Structured cards
Model database
Filter by provider, architecture family, or full-text search across descriptions.
Frontier models
Verified flagship models across major providers, curated for operational relevance.
OpenAI
GPT-5.5
FrontierLatestOpenAI's current flagship model for complex reasoning, coding, and professional work, documented in the OpenAI API model guide as the default starting point for high-complexity workloads.
Gemini 2.5 Pro
FrontierLatestGoogle's advanced Gemini model for complex tasks, with official Gemini API documentation calling out deep reasoning and coding capabilities.
Anthropic
Claude Opus 4.7
FrontierLatestAnthropic's most capable generally available Claude model for complex reasoning and agentic coding, documented in the Claude model overview.
xAI
Grok 4.3
FrontierLatestxAI's documented default for general chat workloads, described in xAI docs as the most intelligent and fastest Grok model for non-specialized use cases.
Mistral AI
Mistral Medium 3.5
FrontierLatestMistral's documented latest frontier-class multimodal model optimized for agentic and coding use cases.
DeepSeek
DeepSeek-V3.2
FrontierLatestDeepSeek's documented successor to the V3.2 experimental line, positioned in official DeepSeek API news as live on app, web, and API.
Meta
Llama 3.1 405B Instruct
FrontierLatestMeta’s largest open-weights instruct checkpoint in the Llama 3.1 family, aimed at strong reasoning and coding quality with a permissive license for research and customization. It is typically served on dedicated GPU clusters or via partners (cloud inference, on-prem) rather than a single vendor API.
Cohere
Command R+
FrontierLatestCohere’s enterprise-oriented Command R+ emphasizes retrieval-grounded answers and tool orchestration patterns for business data. It targets teams building RAG-heavy assistants where citation-style behavior and connector patterns matter more than raw chat novelty.
All models
Filter and paginate the full catalog. Tabs control lifecycle scope.
Gemini 2.5 Pro
CurrentLatestGoogle's advanced Gemini model for complex tasks, with official Gemini API documentation calling out deep reasoning and coding capabilities.
OpenAI
GPT-5.5
CurrentLatestOpenAI's current flagship model for complex reasoning, coding, and professional work, documented in the OpenAI API model guide as the default starting point for high-complexity workloads.
Anthropic
Claude Opus 4.7
CurrentLatestAnthropic's most capable generally available Claude model for complex reasoning and agentic coding, documented in the Claude model overview.
Anthropic
Claude Sonnet 4.6
CurrentLatestAnthropic's Sonnet-tier model documented as the best combination of speed and intelligence in the Claude model overview.
Anthropic
Claude 3.5 Sonnet
CurrentLatestAnthropic’s balanced Sonnet-tier model tuned for long-context reasoning, careful instruction following, and strong performance on coding and analysis workloads. It is a common enterprise choice on the Anthropic API and on AWS Bedrock when teams need large context for RAG and document review.
DeepSeek
DeepSeek-R1
CurrentLatestDeepSeek-R1 is a reasoning-focused model family emphasizing chain-of-thought style behavior for math, code, and structured problem solving. Deployment options include API and open-weight variants—verify licensing and hosting constraints for your region.
Alibaba
Qwen 2.5 72B Instruct
CurrentLatestQwen 2.5 72B Instruct is a large multilingual open-weights model from Alibaba’s Qwen family with strong coding and general chat performance. Common in APAC deployments and on Hugging Face inference endpoints—check license terms for commercial use.
Meta
Llama 3.1 405B Instruct
CurrentLatestMeta’s largest open-weights instruct checkpoint in the Llama 3.1 family, aimed at strong reasoning and coding quality with a permissive license for research and customization. It is typically served on dedicated GPU clusters or via partners (cloud inference, on-prem) rather than a single vendor API.
xAI
Grok-2
CurrentLatestGrok-2 is xAI’s flagship chat model positioned for real-time knowledge integrations and high-throughput conversational products on xAI’s API. Availability and pricing evolve—treat capabilities as vendor-specific.
Cohere
Command R+
CurrentLatestCohere’s enterprise-oriented Command R+ emphasizes retrieval-grounded answers and tool orchestration patterns for business data. It targets teams building RAG-heavy assistants where citation-style behavior and connector patterns matter more than raw chat novelty.
xAI
Grok 4.3
CurrentLatestxAI's documented default for general chat workloads, described in xAI docs as the most intelligent and fastest Grok model for non-specialized use cases.
Mistral AI
Mistral Medium 3.5
CurrentLatestMistral's documented latest frontier-class multimodal model optimized for agentic and coding use cases.
Mistral AI
Mistral Large 3
CurrentLatestMistral's open-weight general-purpose multimodal model listed in official Mistral model documentation.
DeepSeek
DeepSeek-V3.2
CurrentLatestDeepSeek's documented successor to the V3.2 experimental line, positioned in official DeepSeek API news as live on app, web, and API.
Gemini 2.5 Flash-Lite
CurrentLatestGoogle's fastest and most budget-friendly multimodal model in the Gemini 2.5 family, according to the Gemini API model documentation.
Gemini 2.0 Flash
CurrentLatestGemini 2.0 Flash is Google’s efficiency-oriented multimodal model generation aimed at fast agentic and interactive experiences. Capabilities and naming evolve—validate against the current Gemini API reference for tool use and context limits.
Microsoft
Phi-4
CurrentLatestPhi-4 is Microsoft Research’s small language model line focused on strong reasoning per parameter for on-device and low-cost cloud scenarios. Deployment often happens via Azure AI or Hugging Face hubs—confirm license for your channel.
Mistral AI
Mistral Small 3
CurrentLatestMistral Small 3 is Mistral’s efficiency tier for fast, affordable chat and tool use at high QPS—positioned between tiny open models and Mistral Large. Exact naming and versioning appear in Mistral’s API catalog; pin versions in production.
Mistral AI
Mistral 7B Instruct v0.3
CurrentLatestMistral 7B Instruct is a compact dense model that popularized efficient open-weight chat quality at small scale. It remains a baseline for fine-tunes and on-prem pilots where 13B+ models are too heavy.
Meta
Llama 3.1 8B Instruct
CurrentLatestLlama 3.1 8B Instruct is a small open-weights model for edge laptops, single-GPU servers, and ultra-low-latency assistants. Quality per dollar is competitive for simple tasks but not for frontier reasoning.
Meta
Llama 3.1 70B Instruct
CurrentLatestLlama 3.1 70B Instruct is a mid-size open-weights instruct model balancing quality and deployability on a single large GPU or small multi-GPU nodes. Common for private assistants, on-prem pilots, and fine-tunes where 405B is impractical.
OpenAI
GPT-3.5 Turbo
CurrentLatestGPT-3.5 Turbo is a long-standing cost-efficient chat model family on the OpenAI API for simple assistants, classification, and legacy integrations. Many teams still use it for non-critical paths or as a fallback when newer models are rate-limited.
Gemini 1.0 Pro
CurrentLatestGemini 1.0 Pro represents Google’s first broadly marketed Gemini-era general model for text and basic multimodal tasks on Vertex and consumer surfaces. New projects should prefer 1.5+ generations unless constrained by legacy integrations—verify availability.
Anthropic
Claude 3.5 Haiku
CurrentLatestClaude 3.5 Haiku is Anthropic’s fast, cost-efficient tier for high-volume classification, routing, and simple chat. It targets latency-sensitive paths and agent pre-processing before escalating to Sonnet-class models.
Recommended
Top current models by information quality score — good defaults when you are not sure where to start.
Anthropic
Claude Sonnet 4.6
CurrentLatestAnthropic's Sonnet-tier model documented as the best combination of speed and intelligence in the Claude model overview.
Anthropic
Claude 3.5 Sonnet
CurrentLatestAnthropic’s balanced Sonnet-tier model tuned for long-context reasoning, careful instruction following, and strong performance on coding and analysis workloads. It is a common enterprise choice on the Anthropic API and on AWS Bedrock when teams need large context for RAG and document review.
DeepSeek
DeepSeek-R1
CurrentLatestDeepSeek-R1 is a reasoning-focused model family emphasizing chain-of-thought style behavior for math, code, and structured problem solving. Deployment options include API and open-weight variants—verify licensing and hosting constraints for your region.
Alibaba
Qwen 2.5 72B Instruct
CurrentLatestQwen 2.5 72B Instruct is a large multilingual open-weights model from Alibaba’s Qwen family with strong coding and general chat performance. Common in APAC deployments and on Hugging Face inference endpoints—check license terms for commercial use.
xAI
Grok-2
CurrentLatestGrok-2 is xAI’s flagship chat model positioned for real-time knowledge integrations and high-throughput conversational products on xAI’s API. Availability and pricing evolve—treat capabilities as vendor-specific.
Missing a frontier release? Add a model (editors)