GenAIWiki

Structured cards

Model database

Filter by provider, architecture family, or full-text search across descriptions.

Frontier models

Verified flagship models across major providers, curated for operational relevance.

OpenAI

GPT-5.5

FrontierLatest

OpenAI's current flagship model for complex reasoning, coding, and professional work, documented in the OpenAI API model guide as the default starting point for high-complexity workloads.

FeaturedUpdated today
frontierreasoning

Google

Gemini 2.5 Pro

FrontierLatest

Google's advanced Gemini model for complex tasks, with official Gemini API documentation calling out deep reasoning and coding capabilities.

FeaturedUpdated today
frontiergoogle

Anthropic

Claude Opus 4.7

FrontierLatest

Anthropic's most capable generally available Claude model for complex reasoning and agentic coding, documented in the Claude model overview.

FeaturedUpdated today
frontierreasoning

xAI

Grok 4.3

FrontierLatest

xAI's documented default for general chat workloads, described in xAI docs as the most intelligent and fastest Grok model for non-specialized use cases.

FeaturedUpdated today
frontierxai

Mistral AI

Mistral Medium 3.5

FrontierLatest

Mistral's documented latest frontier-class multimodal model optimized for agentic and coding use cases.

FeaturedUpdated today
frontiermistral

DeepSeek

DeepSeek-V3.2

FrontierLatest

DeepSeek's documented successor to the V3.2 experimental line, positioned in official DeepSeek API news as live on app, web, and API.

FeaturedUpdated today
frontierdeepseek

Meta

Llama 3.1 405B Instruct

FrontierLatest

Meta’s largest open-weights instruct checkpoint in the Llama 3.1 family, aimed at strong reasoning and coding quality with a permissive license for research and customization. It is typically served on dedicated GPU clusters or via partners (cloud inference, on-prem) rather than a single vendor API.

FeaturedUpdated 5 weeks ago
open-weightsself-host

Cohere

Command R+

FrontierLatest

Cohere’s enterprise-oriented Command R+ emphasizes retrieval-grounded answers and tool orchestration patterns for business data. It targets teams building RAG-heavy assistants where citation-style behavior and connector patterns matter more than raw chat novelty.

FeaturedUpdated 5 weeks ago
enterpriserag

All models

Filter and paginate the full catalog. Tabs control lifecycle scope.

Google

Gemini 2.5 Pro

CurrentLatest

Google's advanced Gemini model for complex tasks, with official Gemini API documentation calling out deep reasoning and coding capabilities.

FeaturedUpdated today
frontiergoogle

OpenAI

GPT-5.5

CurrentLatest

OpenAI's current flagship model for complex reasoning, coding, and professional work, documented in the OpenAI API model guide as the default starting point for high-complexity workloads.

FeaturedUpdated today
frontierreasoning

Anthropic

Claude Opus 4.7

CurrentLatest

Anthropic's most capable generally available Claude model for complex reasoning and agentic coding, documented in the Claude model overview.

FeaturedUpdated today
frontierreasoning

Anthropic

Claude Sonnet 4.6

CurrentLatest

Anthropic's Sonnet-tier model documented as the best combination of speed and intelligence in the Claude model overview.

FeaturedUpdated today
frontiersonnet

Anthropic

Claude 3.5 Sonnet

CurrentLatest

Anthropic’s balanced Sonnet-tier model tuned for long-context reasoning, careful instruction following, and strong performance on coding and analysis workloads. It is a common enterprise choice on the Anthropic API and on AWS Bedrock when teams need large context for RAG and document review.

FeaturedUpdated 3 weeks ago
codingagents

DeepSeek

DeepSeek-R1

CurrentLatest

DeepSeek-R1 is a reasoning-focused model family emphasizing chain-of-thought style behavior for math, code, and structured problem solving. Deployment options include API and open-weight variants—verify licensing and hosting constraints for your region.

FeaturedUpdated 3 weeks ago
reasoningresearch

Alibaba

Qwen 2.5 72B Instruct

CurrentLatest

Qwen 2.5 72B Instruct is a large multilingual open-weights model from Alibaba’s Qwen family with strong coding and general chat performance. Common in APAC deployments and on Hugging Face inference endpoints—check license terms for commercial use.

FeaturedUpdated 3 weeks ago
open-weightsmultilingual

Meta

Llama 3.1 405B Instruct

CurrentLatest

Meta’s largest open-weights instruct checkpoint in the Llama 3.1 family, aimed at strong reasoning and coding quality with a permissive license for research and customization. It is typically served on dedicated GPU clusters or via partners (cloud inference, on-prem) rather than a single vendor API.

FeaturedUpdated 5 weeks ago
open-weightsself-host

xAI

Grok-2

CurrentLatest

Grok-2 is xAI’s flagship chat model positioned for real-time knowledge integrations and high-throughput conversational products on xAI’s API. Availability and pricing evolve—treat capabilities as vendor-specific.

FeaturedUpdated 5 weeks ago
frontierapi

Cohere

Command R+

CurrentLatest

Cohere’s enterprise-oriented Command R+ emphasizes retrieval-grounded answers and tool orchestration patterns for business data. It targets teams building RAG-heavy assistants where citation-style behavior and connector patterns matter more than raw chat novelty.

FeaturedUpdated 5 weeks ago
enterpriserag

xAI

Grok 4.3

CurrentLatest

xAI's documented default for general chat workloads, described in xAI docs as the most intelligent and fastest Grok model for non-specialized use cases.

FeaturedUpdated today
frontierxai

Mistral AI

Mistral Medium 3.5

CurrentLatest

Mistral's documented latest frontier-class multimodal model optimized for agentic and coding use cases.

FeaturedUpdated today
frontiermistral

Mistral AI

Mistral Large 3

CurrentLatest

Mistral's open-weight general-purpose multimodal model listed in official Mistral model documentation.

Updated today
frontiermistral

DeepSeek

DeepSeek-V3.2

CurrentLatest

DeepSeek's documented successor to the V3.2 experimental line, positioned in official DeepSeek API news as live on app, web, and API.

FeaturedUpdated today
frontierdeepseek

Google

Gemini 2.5 Flash-Lite

CurrentLatest

Google's fastest and most budget-friendly multimodal model in the Gemini 2.5 family, according to the Gemini API model documentation.

Updated today
googleflash-lite

Google

Gemini 2.0 Flash

CurrentLatest

Gemini 2.0 Flash is Google’s efficiency-oriented multimodal model generation aimed at fast agentic and interactive experiences. Capabilities and naming evolve—validate against the current Gemini API reference for tool use and context limits.

Updated 3 weeks ago
googleagents

Microsoft

Phi-4

CurrentLatest

Phi-4 is Microsoft Research’s small language model line focused on strong reasoning per parameter for on-device and low-cost cloud scenarios. Deployment often happens via Azure AI or Hugging Face hubs—confirm license for your channel.

Updated 5 weeks ago
slmmicrosoft

Mistral AI

Mistral Small 3

CurrentLatest

Mistral Small 3 is Mistral’s efficiency tier for fast, affordable chat and tool use at high QPS—positioned between tiny open models and Mistral Large. Exact naming and versioning appear in Mistral’s API catalog; pin versions in production.

Updated 5 weeks ago
latencyapi

Mistral AI

Mistral 7B Instruct v0.3

CurrentLatest

Mistral 7B Instruct is a compact dense model that popularized efficient open-weight chat quality at small scale. It remains a baseline for fine-tunes and on-prem pilots where 13B+ models are too heavy.

Updated 5 weeks ago
open-weightsslm

Meta

Llama 3.1 8B Instruct

CurrentLatest

Llama 3.1 8B Instruct is a small open-weights model for edge laptops, single-GPU servers, and ultra-low-latency assistants. Quality per dollar is competitive for simple tasks but not for frontier reasoning.

Updated 5 weeks ago
edgeopen-weights

Meta

Llama 3.1 70B Instruct

CurrentLatest

Llama 3.1 70B Instruct is a mid-size open-weights instruct model balancing quality and deployability on a single large GPU or small multi-GPU nodes. Common for private assistants, on-prem pilots, and fine-tunes where 405B is impractical.

Updated 5 weeks ago
open-weightsself-host

OpenAI

GPT-3.5 Turbo

CurrentLatest

GPT-3.5 Turbo is a long-standing cost-efficient chat model family on the OpenAI API for simple assistants, classification, and legacy integrations. Many teams still use it for non-critical paths or as a fallback when newer models are rate-limited.

Updated 5 weeks ago
legacycost

Google

Gemini 1.0 Pro

CurrentLatest

Gemini 1.0 Pro represents Google’s first broadly marketed Gemini-era general model for text and basic multimodal tasks on Vertex and consumer surfaces. New projects should prefer 1.5+ generations unless constrained by legacy integrations—verify availability.

Updated 5 weeks ago
googlelegacy

Anthropic

Claude 3.5 Haiku

CurrentLatest

Claude 3.5 Haiku is Anthropic’s fast, cost-efficient tier for high-volume classification, routing, and simple chat. It targets latency-sensitive paths and agent pre-processing before escalating to Sonnet-class models.

Updated 5 weeks ago
latencycost

Recommended

Top current models by information quality score — good defaults when you are not sure where to start.

Anthropic

Claude Sonnet 4.6

CurrentLatest

Anthropic's Sonnet-tier model documented as the best combination of speed and intelligence in the Claude model overview.

FeaturedUpdated today
frontiersonnet

Anthropic

Claude 3.5 Sonnet

CurrentLatest

Anthropic’s balanced Sonnet-tier model tuned for long-context reasoning, careful instruction following, and strong performance on coding and analysis workloads. It is a common enterprise choice on the Anthropic API and on AWS Bedrock when teams need large context for RAG and document review.

FeaturedUpdated 3 weeks ago
codingagents

DeepSeek

DeepSeek-R1

CurrentLatest

DeepSeek-R1 is a reasoning-focused model family emphasizing chain-of-thought style behavior for math, code, and structured problem solving. Deployment options include API and open-weight variants—verify licensing and hosting constraints for your region.

FeaturedUpdated 3 weeks ago
reasoningresearch

Alibaba

Qwen 2.5 72B Instruct

CurrentLatest

Qwen 2.5 72B Instruct is a large multilingual open-weights model from Alibaba’s Qwen family with strong coding and general chat performance. Common in APAC deployments and on Hugging Face inference endpoints—check license terms for commercial use.

FeaturedUpdated 3 weeks ago
open-weightsmultilingual

xAI

Grok-2

CurrentLatest

Grok-2 is xAI’s flagship chat model positioned for real-time knowledge integrations and high-throughput conversational products on xAI’s API. Availability and pricing evolve—treat capabilities as vendor-specific.

FeaturedUpdated 5 weeks ago
frontierapi

Missing a frontier release? Add a model (editors)