Structured cards
Model database
Filter by provider, architecture family, or full-text search across descriptions.
Frontier models
Verified flagship models across major providers, curated for operational relevance.
OpenAI
GPT-5.5
FrontierLatestOpenAI's current flagship model for complex reasoning, coding, and professional work, documented in the OpenAI API model guide as the default starting point for high-complexity workloads.
Gemini 2.5 Pro
FrontierLatestGoogle's advanced Gemini model for complex tasks, with official Gemini API documentation calling out deep reasoning and coding capabilities.
Anthropic
Claude Opus 4.7
FrontierLatestAnthropic's most capable generally available Claude model for complex reasoning and agentic coding, documented in the Claude model overview.
xAI
Grok 4.3
FrontierLatestxAI's documented default for general chat workloads, described in xAI docs as the most intelligent and fastest Grok model for non-specialized use cases.
Mistral AI
Mistral Medium 3.5
FrontierLatestMistral's documented latest frontier-class multimodal model optimized for agentic and coding use cases.
DeepSeek
DeepSeek-V3.2
FrontierLatestDeepSeek's documented successor to the V3.2 experimental line, positioned in official DeepSeek API news as live on app, web, and API.
Meta
Llama 3.1 405B Instruct
FrontierLatestMeta’s largest open-weights instruct checkpoint in the Llama 3.1 family, aimed at strong reasoning and coding quality with a permissive license for research and customization. It is typically served on dedicated GPU clusters or via partners (cloud inference, on-prem) rather than a single vendor API.
Cohere
Command R+
FrontierLatestCohere’s enterprise-oriented Command R+ emphasizes retrieval-grounded answers and tool orchestration patterns for business data. It targets teams building RAG-heavy assistants where citation-style behavior and connector patterns matter more than raw chat novelty.
All models
Filter and paginate the full catalog. Tabs control lifecycle scope.
OpenAI
DALL·E 3
CurrentDALL·E 3 is OpenAI’s instruction-aligned image generation model exposed via the Images API, emphasizing prompt adherence and safety classifiers for consumer and enterprise creative workflows. It targets marketing visuals, product mockups, and storyboarding rather than photorealistic deception.
AWS
Amazon Titan Text Premier
CurrentTitan Text Premier is AWS’s managed text model for Bedrock workloads emphasizing integration with guardrails, knowledge bases, and private data patterns. It targets enterprise RAG and internal assistants rather than frontier creative writing.
OpenAI
GPT-5.4
CurrentCatalog entry for this named release; see the provider’s official documentation for modalities, pricing, and context limits.
DeepSeek
DeepSeek-V3
LegacyDeepSeek-V3 is a large-scale language model family noted for strong coding and math performance under open or research-friendly terms (verify the exact license for your deployment). Teams adopt it for cost-sensitive research, self-hosted inference, or comparison against frontier APIs.
Mistral AI
Mistral Large 2
LegacyMistral’s frontier-class multilingual model emphasizing JSON adherence, agent-friendly behavior, and competitive reasoning within the Mistral API ecosystem. European teams often evaluate it for GDPR-adjacent deployment patterns alongside US-hosted alternatives.
Gemini 1.5 Pro
LegacyGoogle DeepMind Gemini 1.5 Pro targets long-context multimodal workloads—large effective context for retrieval-heavy document pipelines, plus image, audio, and video inputs on supported surfaces. It is often paired with Vertex AI or the Gemini API for enterprise workloads on GCP.
OpenAI
GPT-4o
LegacyOpenAI’s flagship multimodal chat model for production assistants: native image and audio inputs, strong tool and JSON-mode behavior, and low-latency routing on the Chat Completions API. Teams use it for vision-heavy workflows, agent loops with parallel tools, and structured extraction where schema adherence matters.
xAI
Grok-3
LegacyGrok-3 represents xAI’s newer generation aimed at stronger reasoning and tool use versus Grok-2. Capabilities and rollout are version-specific—validate against xAI documentation for your account tier.
Mistral AI
Mixtral 8x7B Instruct
LegacyMixtral 8x7B Instruct is a sparse mixture-of-experts open model noted for strong quality per active parameter and efficient inference vs dense models of similar capability. Widely hosted on inference clouds and self-hosted stacks.
Meta
Llama 3.2 3B Instruct
LegacyLlama 3.2 3B Instruct is a compact instruct model in Meta’s 3.2 generation aimed at mobile and edge scenarios with multilingual support on supported checkpoints. Verify hardware targets and license terms for your distribution channel.
Meta
Llama 3.2 1B Instruct
LegacyLlama 3.2 1B Instruct is among the smallest Llama instruct checkpoints for extreme latency and footprint constraints. Use for routing, tagging, and toy assistants—not for complex reasoning without retrieval augmentation.
OpenAI
GPT-4o mini
LegacyGPT-4o mini is a cost-optimized GPT-4o-family model for high-volume chat, moderation, and routing layers where frontier quality is unnecessary. It supports multimodal inputs on supported API surfaces and is often used as a fast first pass before escalating to larger models.
OpenAI
GPT-4 Turbo
LegacyGPT-4 Turbo is a widely deployed GPT-4-class chat model with a large context window on the OpenAI API, aimed at long-document workflows, retrieval bundles, and production assistants that do not require GPT-4o’s multimodal stack. It remains a common baseline for cost/quality tradeoffs.
Gemini 1.5 Flash
LegacyGemini 1.5 Flash targets low-latency, cost-efficient multimodal chat and retrieval workloads on the Gemini API and Vertex AI. It keeps much of the long-context family behavior with faster responses for interactive apps.
Anthropic
Claude 3 Opus
LegacyClaude 3 Opus was Anthropic’s highest-capability Claude 3-era model for difficult reasoning, nuanced writing, and complex analysis before later Sonnet generations. Teams still reference it for historical benchmarks and legacy deployments—verify current availability in API and Bedrock model lists.
Anthropic
Claude 3 Sonnet
LegacyClaude 3 Sonnet balanced cost and capability in the Claude 3 generation—useful for general assistants and document workflows where Opus was unnecessary. New deployments should compare against Claude 3.5 Sonnet for pricing and quality.
Anthropic
Claude 3 Haiku
LegacyClaude 3 Haiku is the fast Claude 3-era model for simple tasks and high throughput. Prefer 3.5 Haiku when available for better quality at similar latency targets—confirm SKUs on your cloud path.
Alibaba
Qwen 2
LegacyCatalog entry for this named release; see the provider’s official documentation for modalities, pricing, and context limits.
Meta
Llama 3 8B
LegacyCatalog entry for this named release; see the provider’s official documentation for modalities, pricing, and context limits.
Meta
Llama 3 70B
LegacyCatalog entry for this named release; see the provider’s official documentation for modalities, pricing, and context limits.
OpenAI
GPT-4.1 nano
LegacyCatalog entry for this named release; see the provider’s official documentation for modalities, pricing, and context limits.
Anthropic
Claude Opus 4.5
LegacyCatalog entry for this named release; see the provider’s official documentation for modalities, pricing, and context limits.
Anthropic
Claude Opus 4.6
LegacyCatalog entry for this named release; see the provider’s official documentation for modalities, pricing, and context limits.
Anthropic
Claude Sonnet 4.5
LegacyCatalog entry for this named release; see the provider’s official documentation for modalities, pricing, and context limits.
Recommended
Top current models by information quality score — good defaults when you are not sure where to start.
Anthropic
Claude Sonnet 4.6
CurrentLatestAnthropic's Sonnet-tier model documented as the best combination of speed and intelligence in the Claude model overview.
Anthropic
Claude 3.5 Sonnet
CurrentLatestAnthropic’s balanced Sonnet-tier model tuned for long-context reasoning, careful instruction following, and strong performance on coding and analysis workloads. It is a common enterprise choice on the Anthropic API and on AWS Bedrock when teams need large context for RAG and document review.
DeepSeek
DeepSeek-R1
CurrentLatestDeepSeek-R1 is a reasoning-focused model family emphasizing chain-of-thought style behavior for math, code, and structured problem solving. Deployment options include API and open-weight variants—verify licensing and hosting constraints for your region.
Alibaba
Qwen 2.5 72B Instruct
CurrentLatestQwen 2.5 72B Instruct is a large multilingual open-weights model from Alibaba’s Qwen family with strong coding and general chat performance. Common in APAC deployments and on Hugging Face inference endpoints—check license terms for commercial use.
xAI
Grok-2
CurrentLatestGrok-2 is xAI’s flagship chat model positioned for real-time knowledge integrations and high-throughput conversational products on xAI’s API. Availability and pricing evolve—treat capabilities as vendor-specific.
Missing a frontier release? Add a model (editors)