GenAIWiki

Structured cards

Model database

Filter by provider, architecture family, or full-text search across descriptions.

Frontier models

Verified flagship models across major providers, curated for operational relevance.

OpenAI

GPT-5.5

FrontierLatest

OpenAI's current flagship model for complex reasoning, coding, and professional work, documented in the OpenAI API model guide as the default starting point for high-complexity workloads.

FeaturedUpdated today
frontierreasoning

Google

Gemini 2.5 Pro

FrontierLatest

Google's advanced Gemini model for complex tasks, with official Gemini API documentation calling out deep reasoning and coding capabilities.

FeaturedUpdated today
frontiergoogle

Anthropic

Claude Opus 4.7

FrontierLatest

Anthropic's most capable generally available Claude model for complex reasoning and agentic coding, documented in the Claude model overview.

FeaturedUpdated today
frontierreasoning

xAI

Grok 4.3

FrontierLatest

xAI's documented default for general chat workloads, described in xAI docs as the most intelligent and fastest Grok model for non-specialized use cases.

FeaturedUpdated today
frontierxai

Mistral AI

Mistral Medium 3.5

FrontierLatest

Mistral's documented latest frontier-class multimodal model optimized for agentic and coding use cases.

FeaturedUpdated today
frontiermistral

DeepSeek

DeepSeek-V3.2

FrontierLatest

DeepSeek's documented successor to the V3.2 experimental line, positioned in official DeepSeek API news as live on app, web, and API.

FeaturedUpdated today
frontierdeepseek

Meta

Llama 3.1 405B Instruct

FrontierLatest

Meta’s largest open-weights instruct checkpoint in the Llama 3.1 family, aimed at strong reasoning and coding quality with a permissive license for research and customization. It is typically served on dedicated GPU clusters or via partners (cloud inference, on-prem) rather than a single vendor API.

FeaturedUpdated 5 weeks ago
open-weightsself-host

Cohere

Command R+

FrontierLatest

Cohere’s enterprise-oriented Command R+ emphasizes retrieval-grounded answers and tool orchestration patterns for business data. It targets teams building RAG-heavy assistants where citation-style behavior and connector patterns matter more than raw chat novelty.

FeaturedUpdated 5 weeks ago
enterpriserag

All models

Filter and paginate the full catalog. Tabs control lifecycle scope.

Microsoft

Phi-3 Medium

LegacyLatest

Phi-3 Medium is a compact instruct model aimed at strong quality per parameter for on-device and cost-sensitive cloud inference. It competes with other SLMs on coding and reasoning benchmarks—validate on your domain prompts.

Updated 5 weeks ago
edgeopen-weights

Cohere

Command R

LegacyLatest

Command R is Cohere’s earlier RAG-oriented model line preceding Command R+, focused on grounded generation with connectors and multilingual enterprise search. Useful when comparing tiered Cohere stacks or maintaining legacy integrations.

Updated 5 weeks ago
enterpriserag

01.AI

Yi-34B

LegacyLatest

Catalog entry for this named release; see the provider’s official documentation for modalities, pricing, and context limits.

Updated 5 weeks ago
frontier

Microsoft

Phi-3 Mini

LegacyLatest

Catalog entry for this named release; see the provider’s official documentation for modalities, pricing, and context limits.

Updated 5 weeks ago
frontier

TII

Falcon 180B

LegacyLatest

Catalog entry for this named release; see the provider’s official documentation for modalities, pricing, and context limits.

Updated 5 weeks ago
frontier

DeepSeek

DeepSeek-V3

Legacy

DeepSeek-V3 is a large-scale language model family noted for strong coding and math performance under open or research-friendly terms (verify the exact license for your deployment). Teams adopt it for cost-sensitive research, self-hosted inference, or comparison against frontier APIs.

FeaturedUpdated today
researchcoding

Mistral AI

Mistral Large 2

Legacy

Mistral’s frontier-class multilingual model emphasizing JSON adherence, agent-friendly behavior, and competitive reasoning within the Mistral API ecosystem. European teams often evaluate it for GDPR-adjacent deployment patterns alongside US-hosted alternatives.

FeaturedUpdated today
euenterprise

Google

Gemini 1.5 Pro

Legacy

Google DeepMind Gemini 1.5 Pro targets long-context multimodal workloads—large effective context for retrieval-heavy document pipelines, plus image, audio, and video inputs on supported surfaces. It is often paired with Vertex AI or the Gemini API for enterprise workloads on GCP.

FeaturedUpdated today
googlelong-context

OpenAI

GPT-4o

Legacy

OpenAI’s flagship multimodal chat model for production assistants: native image and audio inputs, strong tool and JSON-mode behavior, and low-latency routing on the Chat Completions API. Teams use it for vision-heavy workflows, agent loops with parallel tools, and structured extraction where schema adherence matters.

FeaturedUpdated today
frontiermultimodal

xAI

Grok-3

Legacy

Grok-3 represents xAI’s newer generation aimed at stronger reasoning and tool use versus Grok-2. Capabilities and rollout are version-specific—validate against xAI documentation for your account tier.

Updated today
frontierapi

Mistral AI

Mixtral 8x7B Instruct

Legacy

Mixtral 8x7B Instruct is a sparse mixture-of-experts open model noted for strong quality per active parameter and efficient inference vs dense models of similar capability. Widely hosted on inference clouds and self-hosted stacks.

Updated 5 weeks ago
open-weightsmoe

Meta

Llama 3.2 3B Instruct

Legacy

Llama 3.2 3B Instruct is a compact instruct model in Meta’s 3.2 generation aimed at mobile and edge scenarios with multilingual support on supported checkpoints. Verify hardware targets and license terms for your distribution channel.

Updated 5 weeks ago
edgeslm

Meta

Llama 3.2 1B Instruct

Legacy

Llama 3.2 1B Instruct is among the smallest Llama instruct checkpoints for extreme latency and footprint constraints. Use for routing, tagging, and toy assistants—not for complex reasoning without retrieval augmentation.

Updated 5 weeks ago
edgetiny

OpenAI

GPT-4o mini

Legacy

GPT-4o mini is a cost-optimized GPT-4o-family model for high-volume chat, moderation, and routing layers where frontier quality is unnecessary. It supports multimodal inputs on supported API surfaces and is often used as a fast first pass before escalating to larger models.

Updated 5 weeks ago
latencycost

OpenAI

GPT-4 Turbo

Legacy

GPT-4 Turbo is a widely deployed GPT-4-class chat model with a large context window on the OpenAI API, aimed at long-document workflows, retrieval bundles, and production assistants that do not require GPT-4o’s multimodal stack. It remains a common baseline for cost/quality tradeoffs.

Updated 5 weeks ago
general-purposeapi

Google

Gemini 1.5 Flash

Legacy

Gemini 1.5 Flash targets low-latency, cost-efficient multimodal chat and retrieval workloads on the Gemini API and Vertex AI. It keeps much of the long-context family behavior with faster responses for interactive apps.

Updated 5 weeks ago
googlelatency

Anthropic

Claude 3 Opus

Legacy

Claude 3 Opus was Anthropic’s highest-capability Claude 3-era model for difficult reasoning, nuanced writing, and complex analysis before later Sonnet generations. Teams still reference it for historical benchmarks and legacy deployments—verify current availability in API and Bedrock model lists.

Updated 5 weeks ago
frontierwriting

Anthropic

Claude 3 Sonnet

Legacy

Claude 3 Sonnet balanced cost and capability in the Claude 3 generation—useful for general assistants and document workflows where Opus was unnecessary. New deployments should compare against Claude 3.5 Sonnet for pricing and quality.

Updated 5 weeks ago
general-purposelegacy

Anthropic

Claude 3 Haiku

Legacy

Claude 3 Haiku is the fast Claude 3-era model for simple tasks and high throughput. Prefer 3.5 Haiku when available for better quality at similar latency targets—confirm SKUs on your cloud path.

Updated 5 weeks ago
latencylegacy

Alibaba

Qwen 2

Legacy

Catalog entry for this named release; see the provider’s official documentation for modalities, pricing, and context limits.

Updated 5 weeks ago
frontier

Meta

Llama 3 8B

Legacy

Catalog entry for this named release; see the provider’s official documentation for modalities, pricing, and context limits.

Updated 5 weeks ago
frontier

Meta

Llama 3 70B

Legacy

Catalog entry for this named release; see the provider’s official documentation for modalities, pricing, and context limits.

Updated 5 weeks ago
frontier

OpenAI

GPT-4.1 nano

Legacy

Catalog entry for this named release; see the provider’s official documentation for modalities, pricing, and context limits.

Updated 5 weeks ago
frontier

Anthropic

Claude Opus 4.5

Legacy

Catalog entry for this named release; see the provider’s official documentation for modalities, pricing, and context limits.

Updated 5 weeks ago
frontier

Recommended

Top current models by information quality score — good defaults when you are not sure where to start.

Anthropic

Claude Sonnet 4.6

CurrentLatest

Anthropic's Sonnet-tier model documented as the best combination of speed and intelligence in the Claude model overview.

FeaturedUpdated today
frontiersonnet

Anthropic

Claude 3.5 Sonnet

CurrentLatest

Anthropic’s balanced Sonnet-tier model tuned for long-context reasoning, careful instruction following, and strong performance on coding and analysis workloads. It is a common enterprise choice on the Anthropic API and on AWS Bedrock when teams need large context for RAG and document review.

FeaturedUpdated 3 weeks ago
codingagents

DeepSeek

DeepSeek-R1

CurrentLatest

DeepSeek-R1 is a reasoning-focused model family emphasizing chain-of-thought style behavior for math, code, and structured problem solving. Deployment options include API and open-weight variants—verify licensing and hosting constraints for your region.

FeaturedUpdated 3 weeks ago
reasoningresearch

Alibaba

Qwen 2.5 72B Instruct

CurrentLatest

Qwen 2.5 72B Instruct is a large multilingual open-weights model from Alibaba’s Qwen family with strong coding and general chat performance. Common in APAC deployments and on Hugging Face inference endpoints—check license terms for commercial use.

FeaturedUpdated 3 weeks ago
open-weightsmultilingual

xAI

Grok-2

CurrentLatest

Grok-2 is xAI’s flagship chat model positioned for real-time knowledge integrations and high-throughput conversational products on xAI’s API. Availability and pricing evolve—treat capabilities as vendor-specific.

FeaturedUpdated 5 weeks ago
frontierapi

Missing a frontier release? Add a model (editors)