GenAIWiki

Structured cards

Model database

Filter by provider, architecture family, or full-text search across descriptions.

Frontier models

Verified flagship models across major providers, curated for operational relevance.

OpenAI

GPT-5.5

FrontierLatest

OpenAI's current flagship model for complex reasoning, coding, and professional work, documented in the OpenAI API model guide as the default starting point for high-complexity workloads.

FeaturedUpdated today
frontierreasoning

Google

Gemini 2.5 Pro

FrontierLatest

Google's advanced Gemini model for complex tasks, with official Gemini API documentation calling out deep reasoning and coding capabilities.

FeaturedUpdated today
frontiergoogle

Anthropic

Claude Opus 4.7

FrontierLatest

Anthropic's most capable generally available Claude model for complex reasoning and agentic coding, documented in the Claude model overview.

FeaturedUpdated today
frontierreasoning

xAI

Grok 4.3

FrontierLatest

xAI's documented default for general chat workloads, described in xAI docs as the most intelligent and fastest Grok model for non-specialized use cases.

FeaturedUpdated today
frontierxai

Mistral AI

Mistral Medium 3.5

FrontierLatest

Mistral's documented latest frontier-class multimodal model optimized for agentic and coding use cases.

FeaturedUpdated today
frontiermistral

DeepSeek

DeepSeek-V3.2

FrontierLatest

DeepSeek's documented successor to the V3.2 experimental line, positioned in official DeepSeek API news as live on app, web, and API.

FeaturedUpdated today
frontierdeepseek

Meta

Llama 3.1 405B Instruct

FrontierLatest

Meta’s largest open-weights instruct checkpoint in the Llama 3.1 family, aimed at strong reasoning and coding quality with a permissive license for research and customization. It is typically served on dedicated GPU clusters or via partners (cloud inference, on-prem) rather than a single vendor API.

FeaturedUpdated 5 weeks ago
open-weightsself-host

Cohere

Command R+

FrontierLatest

Cohere’s enterprise-oriented Command R+ emphasizes retrieval-grounded answers and tool orchestration patterns for business data. It targets teams building RAG-heavy assistants where citation-style behavior and connector patterns matter more than raw chat novelty.

FeaturedUpdated 5 weeks ago
enterpriserag

All models

Filter and paginate the full catalog. Tabs control lifecycle scope.

Mistral AI

Mixtral 8x22B

CurrentLatest

Catalog entry for this named release; see the provider’s official documentation for modalities, pricing, and context limits.

Updated 5 weeks ago
frontier

Community

LLaVA

CurrentLatest

Catalog entry for this named release; see the provider’s official documentation for modalities, pricing, and context limits.

Updated 5 weeks ago
frontier

xAI

Grok 1.5

CurrentLatest

Catalog entry for this named release; see the provider’s official documentation for modalities, pricing, and context limits.

Updated 5 weeks ago
frontier

OpenAI

GPT-5.4 nano

CurrentLatest

Catalog entry for this named release; see the provider’s official documentation for modalities, pricing, and context limits.

Updated 5 weeks ago
frontier

OpenAI

GPT-5.4 mini

CurrentLatest

Catalog entry for this named release; see the provider’s official documentation for modalities, pricing, and context limits.

Updated 5 weeks ago
frontier

DeepSeek

DeepSeek Coder V2

CurrentLatest

Catalog entry for this named release; see the provider’s official documentation for modalities, pricing, and context limits.

Updated 5 weeks ago
frontier

Databricks

DBRX

CurrentLatest

Catalog entry for this named release; see the provider’s official documentation for modalities, pricing, and context limits.

Updated 5 weeks ago
frontier

Anthropic

Claude Haiku 4.5

CurrentLatest

Catalog entry for this named release; see the provider’s official documentation for modalities, pricing, and context limits.

Updated 5 weeks ago
frontier

OpenAI

o1

Current

OpenAI’s o1 series emphasizes extended internal reasoning before answering—useful for competition-style math, complex debugging, and multi-step planning where latency is acceptable. It behaves differently from standard chat models: tune prompts for chain-of-thought style tasks and measure time-to-first-token.

FeaturedUpdated 5 weeks ago
reasoningstem

AWS

Amazon Nova

Current

Amazon Nova is AWS’s multimodal foundation model family for text, image, and video workloads delivered through Amazon Bedrock with enterprise IAM, VPC, and governance patterns. Model IDs and modalities vary by region—verify Bedrock model access lists.

FeaturedUpdated 5 weeks ago
awsbedrock

OpenAI

o3-mini

Current

Compact reasoning-focused model in OpenAI’s o-series line aimed at strong STEM and coding performance with lower cost than full o3. Intended for developers who want reasoning without always paying flagship prices—confirm exact API availability and snapshot names in OpenAI docs.

Updated 3 weeks ago
reasoningstem

OpenAI

Whisper large-v3

Current

Whisper large-v3 is OpenAI’s ASR model for transcription and translation across many languages, with strong robustness to accents and noise. It is commonly self-hosted or used via API partners; latency depends heavily on hardware and chunking strategy.

Updated 5 weeks ago
audioopen-weights

OpenAI

text-embedding-3-large

Current

text-embedding-3-large produces high-dimensional text embeddings for semantic search, clustering, and classification. Teams pair it with pgvector or SaaS vector DBs for RAG; output dimensions can be reduced with tradeoffs described in OpenAI documentation.

Updated 5 weeks ago
embeddingsretrieval

Stability AI

Stable Diffusion XL

Current

SDXL is a latent diffusion backbone for high-resolution image generation with broad community tooling (LoRA, ControlNet) and OpenRAIL-style licensing. It is typically self-hosted or run via GPU marketplaces rather than a single proprietary chat API.

Updated 5 weeks ago
imageopen-weights

Snowflake

Snowflake Arctic

Current

Snowflake Arctic is an enterprise-oriented open model emphasizing efficient training recipes and SQL-adjacent enterprise tasks inside the Snowflake ecosystem. It targets teams that want LLM features colocated with governed data in Snowflake Cortex.

Updated 5 weeks ago

OpenAI

o1-mini

Current

A smaller, faster o1-class model for STEM-style reasoning where full o1 latency or cost is prohibitive. Use it when you need better chain-of-thought style behavior than GPT-4o mini but not full o1 depth.

Updated 5 weeks ago
reasoningcost

NVIDIA

NVIDIA Nemotron-4 340B

Current

NVIDIA Nemotron-4 340B is a large open-weights model suite aimed at enterprise and research users who train and serve on NVIDIA stacks (NeMo, NGC). It targets GPU-native teams that need customizable checkpoints with NVIDIA-optimized tooling.

Updated 5 weeks ago
open-weightsenterprise

Google

Gemma 2 27B

Current

Gemma 2 27B is Google’s open-weights Gemma family checkpoint balancing quality and deployability for research and product teams that need permissive terms without Vertex-only APIs. It is often fine-tuned for domain tasks on TPU or GPU clusters.

Updated 5 weeks ago
open-weightsgoogle

OpenAI

DALL·E 3

Current

DALL·E 3 is OpenAI’s instruction-aligned image generation model exposed via the Images API, emphasizing prompt adherence and safety classifiers for consumer and enterprise creative workflows. It targets marketing visuals, product mockups, and storyboarding rather than photorealistic deception.

Updated 5 weeks ago
imagecreative

AWS

Amazon Titan Text Premier

Current

Titan Text Premier is AWS’s managed text model for Bedrock workloads emphasizing integration with guardrails, knowledge bases, and private data patterns. It targets enterprise RAG and internal assistants rather than frontier creative writing.

Updated 5 weeks ago
awsbedrock

OpenAI

GPT-5.4

Current

Catalog entry for this named release; see the provider’s official documentation for modalities, pricing, and context limits.

Updated today
frontier

Recommended

Top current models by information quality score — good defaults when you are not sure where to start.

Anthropic

Claude Sonnet 4.6

CurrentLatest

Anthropic's Sonnet-tier model documented as the best combination of speed and intelligence in the Claude model overview.

FeaturedUpdated today
frontiersonnet

Anthropic

Claude 3.5 Sonnet

CurrentLatest

Anthropic’s balanced Sonnet-tier model tuned for long-context reasoning, careful instruction following, and strong performance on coding and analysis workloads. It is a common enterprise choice on the Anthropic API and on AWS Bedrock when teams need large context for RAG and document review.

FeaturedUpdated 3 weeks ago
codingagents

DeepSeek

DeepSeek-R1

CurrentLatest

DeepSeek-R1 is a reasoning-focused model family emphasizing chain-of-thought style behavior for math, code, and structured problem solving. Deployment options include API and open-weight variants—verify licensing and hosting constraints for your region.

FeaturedUpdated 3 weeks ago
reasoningresearch

Alibaba

Qwen 2.5 72B Instruct

CurrentLatest

Qwen 2.5 72B Instruct is a large multilingual open-weights model from Alibaba’s Qwen family with strong coding and general chat performance. Common in APAC deployments and on Hugging Face inference endpoints—check license terms for commercial use.

FeaturedUpdated 3 weeks ago
open-weightsmultilingual

xAI

Grok-2

CurrentLatest

Grok-2 is xAI’s flagship chat model positioned for real-time knowledge integrations and high-throughput conversational products on xAI’s API. Availability and pricing evolve—treat capabilities as vendor-specific.

FeaturedUpdated 5 weeks ago
frontierapi

Missing a frontier release? Add a model (editors)