Structured cards
Model database
Filter by provider, architecture family, or full-text search across descriptions.
Frontier models
Verified flagship models across major providers, curated for operational relevance.
OpenAI
GPT-5.5
FrontierLatestOpenAI's current flagship model for complex reasoning, coding, and professional work, documented in the OpenAI API model guide as the default starting point for high-complexity workloads.
Gemini 2.5 Pro
FrontierLatestGoogle's advanced Gemini model for complex tasks, with official Gemini API documentation calling out deep reasoning and coding capabilities.
Anthropic
Claude Opus 4.7
FrontierLatestAnthropic's most capable generally available Claude model for complex reasoning and agentic coding, documented in the Claude model overview.
xAI
Grok 4.3
FrontierLatestxAI's documented default for general chat workloads, described in xAI docs as the most intelligent and fastest Grok model for non-specialized use cases.
Mistral AI
Mistral Medium 3.5
FrontierLatestMistral's documented latest frontier-class multimodal model optimized for agentic and coding use cases.
DeepSeek
DeepSeek-V3.2
FrontierLatestDeepSeek's documented successor to the V3.2 experimental line, positioned in official DeepSeek API news as live on app, web, and API.
Meta
Llama 3.1 405B Instruct
FrontierLatestMeta’s largest open-weights instruct checkpoint in the Llama 3.1 family, aimed at strong reasoning and coding quality with a permissive license for research and customization. It is typically served on dedicated GPU clusters or via partners (cloud inference, on-prem) rather than a single vendor API.
Cohere
Command R+
FrontierLatestCohere’s enterprise-oriented Command R+ emphasizes retrieval-grounded answers and tool orchestration patterns for business data. It targets teams building RAG-heavy assistants where citation-style behavior and connector patterns matter more than raw chat novelty.
All models
Filter and paginate the full catalog. Tabs control lifecycle scope.
Anthropic
Claude Opus 4.6
LegacyCatalog entry for this named release; see the provider’s official documentation for modalities, pricing, and context limits.
Anthropic
Claude Sonnet 4.5
LegacyCatalog entry for this named release; see the provider’s official documentation for modalities, pricing, and context limits.
Recommended
Top current models by information quality score — good defaults when you are not sure where to start.
Anthropic
Claude Sonnet 4.6
CurrentLatestAnthropic's Sonnet-tier model documented as the best combination of speed and intelligence in the Claude model overview.
Anthropic
Claude 3.5 Sonnet
CurrentLatestAnthropic’s balanced Sonnet-tier model tuned for long-context reasoning, careful instruction following, and strong performance on coding and analysis workloads. It is a common enterprise choice on the Anthropic API and on AWS Bedrock when teams need large context for RAG and document review.
DeepSeek
DeepSeek-R1
CurrentLatestDeepSeek-R1 is a reasoning-focused model family emphasizing chain-of-thought style behavior for math, code, and structured problem solving. Deployment options include API and open-weight variants—verify licensing and hosting constraints for your region.
Alibaba
Qwen 2.5 72B Instruct
CurrentLatestQwen 2.5 72B Instruct is a large multilingual open-weights model from Alibaba’s Qwen family with strong coding and general chat performance. Common in APAC deployments and on Hugging Face inference endpoints—check license terms for commercial use.
xAI
Grok-2
CurrentLatestGrok-2 is xAI’s flagship chat model positioned for real-time knowledge integrations and high-throughput conversational products on xAI’s API. Availability and pricing evolve—treat capabilities as vendor-specific.
Missing a frontier release? Add a model (editors)