Anthropic

AnthropicVision-2024

multimodal · Release Mar 30, 2024 · License n/a

A multimodal model integrating text and image inputs for enhanced comprehension and response generation.

multimodalvision

Updated today

Modalities

What goes in and what comes out.

Inputs

text, image

Outputs

text, image

Capabilities

image captioning, text/image interaction, visual question answering

Benchmarks snapshot

Structured JSON for reproducible comparisons.

{}

Related on GenAIWiki

Same provider, tooling that cites the model, or prompts tuned for it.

Anthropic

Claude 3 Opus

Claude 3 Opus enhances AI's conversational abilities with a broader understanding of context and intent, featuring a context window of 16k tokens for improved engagement in dialogues.

Anthropic

Claude 3.5 Sonnet

Balanced capability model emphasizing steerability, long-context reasoning, and safer default behaviors for agentic workflows.