GENAIWIKI

Google

Gemini 1.5 Pro

Multimodal LLM · Release Feb 15, 2024 · Proprietary API

Google DeepMind multimodal family member with very large effective context for retrieval-heavy document workflows.

googlelong-contextenterprise
Updated today

Modalities

What goes in and what comes out.

Inputs

text, image, audio, video

Outputs

text

Capabilities

long context, retrieval, multilingual

Benchmarks snapshot

Structured JSON for reproducible comparisons.

{
  "mmmu": 73.4,
  "needle_in_haystack": "strong"
}

Related on GenAIWiki

Same provider, tooling that cites the model, or prompts tuned for it.