AI Models Directory

Structured cards

Model database

Filter by provider, architecture family, or full-text search across descriptions.

Frontier models

Verified flagship models across major providers, curated for operational relevance.

Anthropic

Claude Fable 5

Anthropic's highest-capability widely released Claude model, documented for deep reasoning, codebase-scale work, long-context enterprise workloads, and multimodal inputs.

FeaturedUpdated 3 weeks ago

frontierclaude

OpenAI

GPT-5.5

FrontierLatest

OpenAI's current flagship model for complex reasoning, coding, and professional work, documented in the OpenAI API model guide as the default starting point for high-complexity workloads.

FeaturedUpdated 3 weeks ago

frontierreasoning

Microsoft AI

MAI-Thinking-1

FrontierLatest

Microsoft AI's frontier reasoning model in the MAI family, announced for difficult prompts, science, math, and complex planning workloads, with Microsoft Foundry access documented as private preview.

FeaturedUpdated 3 weeks ago

frontiermicrosoft

Google

Gemini 2.5 Pro

FrontierLatest

Google's advanced Gemini model for complex tasks, with official Gemini API documentation calling out deep reasoning and coding capabilities.

FeaturedUpdated 3 weeks ago

frontiergoogle

Anysphere

Cursor Composer 2.5

FrontierLatest

Cursor's Composer 2.5 model, documented in the official Cursor changelog as a coding model used in the Auto routing flow.

FeaturedUpdated 3 weeks ago

codingcursor

xAI

Grok 4.3

FrontierLatest

xAI's documented default for general chat workloads, described in xAI docs as the most intelligent and fastest Grok model for non-specialized use cases.

FeaturedUpdated 4 weeks ago

frontierxai

Mistral AI

Mistral Medium 3.5

FrontierLatest

Mistral's documented latest frontier-class multimodal model optimized for agentic and coding use cases.

FeaturedUpdated 4 weeks ago

frontiermistral

DeepSeek

DeepSeek-V3.2

FrontierLatest

DeepSeek's documented successor to the V3.2 experimental line, positioned in official DeepSeek API news as live on app, web, and API.

FeaturedUpdated 4 weeks ago

frontierdeepseek

All models

Filter and paginate the full catalog. Tabs control lifecycle scope.

Microsoft AI

MAI-Voice-2-Flash

PreviewLatest

Microsoft AI's announced faster MAI voice variant for lower-latency text-to-speech workflows.

Updated 3 weeks ago

speechvoice

Anthropic

Claude Mythos Preview

Preview

Catalog entry for this named release; see the provider’s official documentation for modalities, pricing, and context limits.

Updated 4 weeks ago

frontier

Anthropic

Claude Opus 4.7

Legacy

Anthropic's most capable generally available Claude model for complex reasoning and agentic coding, documented in the Claude model overview.

FeaturedUpdated 4 weeks ago

frontierreasoning

OpenAI

o1

Legacy

OpenAI’s o1 series emphasizes extended internal reasoning before answering—useful for competition-style math, complex debugging, and multi-step planning where latency is acceptable. It behaves differently from standard chat models: tune prompts for chain-of-thought style tasks and measure time-to-first-token.

FeaturedUpdated 4 weeks ago

reasoningstem

Mistral AI

Mistral Large 2

Legacy

Mistral’s frontier-class multilingual model emphasizing JSON adherence, agent-friendly behavior, and competitive reasoning within the Mistral API ecosystem. European teams often evaluate it for GDPR-adjacent deployment patterns alongside US-hosted alternatives.

FeaturedUpdated 4 weeks ago

euenterprise

xAI

Grok-2

Legacy

Grok-2 is xAI’s flagship chat model positioned for real-time knowledge integrations and high-throughput conversational products on xAI’s API. Availability and pricing evolve—treat capabilities as vendor-specific.

FeaturedUpdated 4 weeks ago

frontierapi

OpenAI

GPT-4o

Legacy

OpenAI’s flagship multimodal chat model for production assistants: native image and audio inputs, strong tool and JSON-mode behavior, and low-latency routing on the Chat Completions API. Teams use it for vision-heavy workflows, agent loops with parallel tools, and structured extraction where schema adherence matters.

FeaturedUpdated 4 weeks ago

frontiermultimodal

Google

Gemini 1.5 Pro

Legacy

Google DeepMind Gemini 1.5 Pro targets long-context multimodal workloads—large effective context for retrieval-heavy document pipelines, plus image, audio, and video inputs on supported surfaces. It is often paired with Vertex AI or the Gemini API for enterprise workloads on GCP.

FeaturedUpdated 4 weeks ago

googlelong-context

DeepSeek

DeepSeek-V3

Legacy

DeepSeek-V3 is a large-scale language model family noted for strong coding and math performance under open or research-friendly terms (verify the exact license for your deployment). Teams adopt it for cost-sensitive research, self-hosted inference, or comparison against frontier APIs.

FeaturedUpdated 4 weeks ago

researchcoding

Anthropic

Claude 3.5 Sonnet

Legacy

Anthropic’s balanced Sonnet-tier model tuned for long-context reasoning, careful instruction following, and strong performance on coding and analysis workloads. It is a common enterprise choice on the Anthropic API and on AWS Bedrock when teams need large context for RAG and document review.

FeaturedUpdated 4 weeks ago

codingagents

Microsoft

Phi-3 Medium

Legacy

Phi-3 Medium is a compact instruct model aimed at strong quality per parameter for on-device and cost-sensitive cloud inference. It competes with other SLMs on coding and reasoning benchmarks—validate on your domain prompts.

Updated 4 weeks ago

edgeopen-weights

Mistral AI

Mixtral 8x7B Instruct

Legacy

Mixtral 8x7B Instruct is a sparse mixture-of-experts open model noted for strong quality per active parameter and efficient inference vs dense models of similar capability. Widely hosted on inference clouds and self-hosted stacks.

Updated 4 weeks ago

open-weightsmoe

Llama 3.2 3B Instruct

Legacy

Llama 3.2 3B Instruct is a compact instruct model in Meta’s 3.2 generation aimed at mobile and edge scenarios with multilingual support on supported checkpoints. Verify hardware targets and license terms for your distribution channel.

Updated 4 weeks ago

edgeslm

Llama 3.2 1B Instruct

Legacy

Llama 3.2 1B Instruct is among the smallest Llama instruct checkpoints for extreme latency and footprint constraints. Use for routing, tagging, and toy assistants—not for complex reasoning without retrieval augmentation.

Updated 4 weeks ago

edgetiny

xAI

Grok-3

Legacy

Grok-3 represents xAI’s newer generation aimed at stronger reasoning and tool use versus Grok-2. Capabilities and rollout are version-specific—validate against xAI documentation for your account tier.

Updated 4 weeks ago

frontierapi

OpenAI

GPT-4o mini

Legacy

GPT-4o mini is a cost-optimized GPT-4o-family model for high-volume chat, moderation, and routing layers where frontier quality is unnecessary. It supports multimodal inputs on supported API surfaces and is often used as a fast first pass before escalating to larger models.

Updated 4 weeks ago

latencycost

OpenAI

GPT-4 Turbo

Legacy

GPT-4 Turbo is a widely deployed GPT-4-class chat model with a large context window on the OpenAI API, aimed at long-document workflows, retrieval bundles, and production assistants that do not require GPT-4o’s multimodal stack. It remains a common baseline for cost/quality tradeoffs.

Updated 4 weeks ago

general-purposeapi

OpenAI

GPT-3.5 Turbo

Legacy

GPT-3.5 Turbo is a long-standing cost-efficient chat model family on the OpenAI API for simple assistants, classification, and legacy integrations. Many teams still use it for non-critical paths or as a fallback when newer models are rate-limited.

Updated 4 weeks ago

legacycost

Google

Gemini 1.5 Flash

Legacy

Gemini 1.5 Flash targets low-latency, cost-efficient multimodal chat and retrieval workloads on the Gemini API and Vertex AI. It keeps much of the long-context family behavior with faster responses for interactive apps.

Updated 4 weeks ago

googlelatency

Google

Gemini 1.0 Pro

Legacy

Gemini 1.0 Pro represents Google’s first broadly marketed Gemini-era general model for text and basic multimodal tasks on Vertex and consumer surfaces. New projects should prefer 1.5+ generations unless constrained by legacy integrations—verify availability.

Updated 4 weeks ago

googlelegacy

Cohere

Command R

Legacy

Command R is Cohere’s earlier RAG-oriented model line preceding Command R+, focused on grounded generation with connectors and multilingual enterprise search. Useful when comparing tiered Cohere stacks or maintaining legacy integrations.

Updated 4 weeks ago

enterpriserag

Anthropic

Claude 3 Opus

Legacy

Claude 3 Opus was Anthropic’s highest-capability Claude 3-era model for difficult reasoning, nuanced writing, and complex analysis before later Sonnet generations. Teams still reference it for historical benchmarks and legacy deployments—verify current availability in API and Bedrock model lists.

Updated 4 weeks ago

frontierwriting

Anthropic

Claude 3 Sonnet

Legacy

Claude 3 Sonnet balanced cost and capability in the Claude 3 generation—useful for general assistants and document workflows where Opus was unnecessary. New deployments should compare against Claude 3.5 Sonnet for pricing and quality.

Updated 4 weeks ago

general-purposelegacy

Anthropic

Claude 3 Haiku

Legacy

Claude 3 Haiku is the fast Claude 3-era model for simple tasks and high throughput. Prefer 3.5 Haiku when available for better quality at similar latency targets—confirm SKUs on your cloud path.

Updated 4 weeks ago

latencylegacy

Sarvam 30B is a 30B parameter Mixture-of-Experts chat and reasoning model from Sarvam AI, optimized for Indian languages, real-time conversation, high-throughput voice-agent pipelines, coding, and practical deployment. Sarvam documents 2.4B active parameters per token, 16T tokens of pre-training data, a 64K context window, Grouped Query Attention, Apache 2.0 open weights, and OpenAI-compatible chat completions.

Updated 10 days ago

sarvamindian languages

Sarvam AI

Sarvam 105B

CurrentLatest

Sarvam 105B is Sarvam AI's flagship 105B+ parameter Mixture-of-Experts reasoning model for Indian-language and English chat, complex reasoning, coding, long-context document analysis, and agentic tool-use workflows. Sarvam documents it as a 128K-context OpenAI-compatible chat model with Multi-head Latent Attention, 12T tokens of pre-training data, Apache 2.0 open weights, and production use powering Indus. Its strongest fit is Indian-language enterprise assistants, multilingual reasoning, and agent workflows where native script, romanized, and code-mixed inputs matter.

Updated 10 days ago

sarvamindian languages

Anthropic

Claude Opus 4.8

CurrentLatest

Anthropic's current Opus-tier Claude model, documented for complex reasoning, coding, and multimodal enterprise workloads below the newer Fable tier.

FeaturedUpdated 3 weeks ago

frontierclaude

OpenAI

GPT-5.4

CurrentLatest

OpenAI's GPT-5.4 model, documented in the official OpenAI API model guide as part of the current GPT-5 family below the GPT-5.5 flagship lane.

FeaturedUpdated 3 weeks ago

frontieropenai

Stability AI

Stable Diffusion XL

CurrentLatest

Stable Diffusion XL (SDXL) 1.0 is Stability AI's latent diffusion text-to-image model for native 1024x1024 generation. The base model can run standalone or feed an optional refiner for the final denoising steps, and the published weights support self-hosted Diffusers workflows.

Updated 4 weeks ago

imageopen-weights

Alibaba

Qwen 2.5 72B Instruct

CurrentLatest

Qwen 2.5 72B Instruct is a large multilingual open-weights model from Alibaba’s Qwen family with strong coding and general chat performance. Common in APAC deployments and on Hugging Face inference endpoints—check license terms for commercial use.

FeaturedUpdated 4 weeks ago

open-weightsmultilingual

Missing a frontier release? Add a model (editors)

Model database

Frontier models

Claude Fable 5

GPT-5.5

MAI-Thinking-1

Gemini 2.5 Pro

Cursor Composer 2.5

Grok 4.3

Mistral Medium 3.5

DeepSeek-V3.2

All models

MAI-Voice-2-Flash

Claude Mythos Preview

Claude Opus 4.7

o1

Mistral Large 2

Grok-2

GPT-4o

Gemini 1.5 Pro

DeepSeek-V3

Claude 3.5 Sonnet

Phi-3 Medium

Mixtral 8x7B Instruct

Llama 3.2 3B Instruct

Llama 3.2 1B Instruct

Grok-3

GPT-4o mini

GPT-4 Turbo

GPT-3.5 Turbo

Gemini 1.5 Flash

Gemini 1.0 Pro

Command R

Claude 3 Opus

Claude 3 Sonnet

Claude 3 Haiku

Recommended

Sarvam 30B

Sarvam 105B

Claude Opus 4.8

GPT-5.4

Stable Diffusion XL

Qwen 2.5 72B Instruct