Anthropic
AnthropicVision-2024
multimodal · Release Mar 30, 2024 · License n/a
A multimodal model integrating text and image inputs for enhanced comprehension and response generation.
multimodalvision
Updated today
Modalities
What goes in and what comes out.
Inputs
text, image
Outputs
text, image
Capabilities
image captioning, text/image interaction, visual question answering
Benchmarks snapshot
Structured JSON for reproducible comparisons.
{}Related on GenAIWiki
Same provider, tooling that cites the model, or prompts tuned for it.
Anthropic
Claude 3 Opus
Claude 3 Opus enhances AI's conversational abilities with a broader understanding of context and intent, featuring a context window of 16k tokens for improved engagement in dialogues.
Anthropic
Claude 3.5 Sonnet
Balanced capability model emphasizing steerability, long-context reasoning, and safer default behaviors for agentic workflows.