Structured cards
Model database
Filter by provider, architecture family, or full-text search across descriptions.
Frontier models
Verified flagship models across major providers, curated for operational relevance.
OpenAI
GPT-5.5
FrontierLatestOpenAI's current flagship model for complex reasoning, coding, and professional work, documented in the OpenAI API model guide as the default starting point for high-complexity workloads.
Gemini 2.5 Pro
FrontierLatestGoogle's advanced Gemini model for complex tasks, with official Gemini API documentation calling out deep reasoning and coding capabilities.
Anthropic
Claude Opus 4.7
FrontierLatestAnthropic's most capable generally available Claude model for complex reasoning and agentic coding, documented in the Claude model overview.
xAI
Grok 4.3
FrontierLatestxAI's documented default for general chat workloads, described in xAI docs as the most intelligent and fastest Grok model for non-specialized use cases.
Mistral AI
Mistral Medium 3.5
FrontierLatestMistral's documented latest frontier-class multimodal model optimized for agentic and coding use cases.
DeepSeek
DeepSeek-V3.2
FrontierLatestDeepSeek's documented successor to the V3.2 experimental line, positioned in official DeepSeek API news as live on app, web, and API.
Meta
Llama 3.1 405B Instruct
FrontierLatestMeta’s largest open-weights instruct checkpoint in the Llama 3.1 family, aimed at strong reasoning and coding quality with a permissive license for research and customization. It is typically served on dedicated GPU clusters or via partners (cloud inference, on-prem) rather than a single vendor API.
Cohere
Command R+
FrontierLatestCohere’s enterprise-oriented Command R+ emphasizes retrieval-grounded answers and tool orchestration patterns for business data. It targets teams building RAG-heavy assistants where citation-style behavior and connector patterns matter more than raw chat novelty.
All models
Filter and paginate the full catalog. Tabs control lifecycle scope.
Mistral AI
Mixtral 8x22B
CurrentLatestCatalog entry for this named release; see the provider’s official documentation for modalities, pricing, and context limits.
Community
LLaVA
CurrentLatestCatalog entry for this named release; see the provider’s official documentation for modalities, pricing, and context limits.
xAI
Grok 1.5
CurrentLatestCatalog entry for this named release; see the provider’s official documentation for modalities, pricing, and context limits.
OpenAI
GPT-5.4 nano
CurrentLatestCatalog entry for this named release; see the provider’s official documentation for modalities, pricing, and context limits.
OpenAI
GPT-5.4 mini
CurrentLatestCatalog entry for this named release; see the provider’s official documentation for modalities, pricing, and context limits.
DeepSeek
DeepSeek Coder V2
CurrentLatestCatalog entry for this named release; see the provider’s official documentation for modalities, pricing, and context limits.
Databricks
DBRX
CurrentLatestCatalog entry for this named release; see the provider’s official documentation for modalities, pricing, and context limits.
Anthropic
Claude Haiku 4.5
CurrentLatestCatalog entry for this named release; see the provider’s official documentation for modalities, pricing, and context limits.
OpenAI
o1
CurrentOpenAI’s o1 series emphasizes extended internal reasoning before answering—useful for competition-style math, complex debugging, and multi-step planning where latency is acceptable. It behaves differently from standard chat models: tune prompts for chain-of-thought style tasks and measure time-to-first-token.
AWS
Amazon Nova
CurrentAmazon Nova is AWS’s multimodal foundation model family for text, image, and video workloads delivered through Amazon Bedrock with enterprise IAM, VPC, and governance patterns. Model IDs and modalities vary by region—verify Bedrock model access lists.
OpenAI
o3-mini
CurrentCompact reasoning-focused model in OpenAI’s o-series line aimed at strong STEM and coding performance with lower cost than full o3. Intended for developers who want reasoning without always paying flagship prices—confirm exact API availability and snapshot names in OpenAI docs.
OpenAI
Whisper large-v3
CurrentWhisper large-v3 is OpenAI’s ASR model for transcription and translation across many languages, with strong robustness to accents and noise. It is commonly self-hosted or used via API partners; latency depends heavily on hardware and chunking strategy.
OpenAI
text-embedding-3-large
Currenttext-embedding-3-large produces high-dimensional text embeddings for semantic search, clustering, and classification. Teams pair it with pgvector or SaaS vector DBs for RAG; output dimensions can be reduced with tradeoffs described in OpenAI documentation.
Stability AI
Stable Diffusion XL
CurrentSDXL is a latent diffusion backbone for high-resolution image generation with broad community tooling (LoRA, ControlNet) and OpenRAIL-style licensing. It is typically self-hosted or run via GPU marketplaces rather than a single proprietary chat API.
Snowflake
Snowflake Arctic
CurrentSnowflake Arctic is an enterprise-oriented open model emphasizing efficient training recipes and SQL-adjacent enterprise tasks inside the Snowflake ecosystem. It targets teams that want LLM features colocated with governed data in Snowflake Cortex.
OpenAI
o1-mini
CurrentA smaller, faster o1-class model for STEM-style reasoning where full o1 latency or cost is prohibitive. Use it when you need better chain-of-thought style behavior than GPT-4o mini but not full o1 depth.
NVIDIA
NVIDIA Nemotron-4 340B
CurrentNVIDIA Nemotron-4 340B is a large open-weights model suite aimed at enterprise and research users who train and serve on NVIDIA stacks (NeMo, NGC). It targets GPU-native teams that need customizable checkpoints with NVIDIA-optimized tooling.
Gemma 2 27B
CurrentGemma 2 27B is Google’s open-weights Gemma family checkpoint balancing quality and deployability for research and product teams that need permissive terms without Vertex-only APIs. It is often fine-tuned for domain tasks on TPU or GPU clusters.
OpenAI
DALL·E 3
CurrentDALL·E 3 is OpenAI’s instruction-aligned image generation model exposed via the Images API, emphasizing prompt adherence and safety classifiers for consumer and enterprise creative workflows. It targets marketing visuals, product mockups, and storyboarding rather than photorealistic deception.
AWS
Amazon Titan Text Premier
CurrentTitan Text Premier is AWS’s managed text model for Bedrock workloads emphasizing integration with guardrails, knowledge bases, and private data patterns. It targets enterprise RAG and internal assistants rather than frontier creative writing.
OpenAI
GPT-5.4
CurrentCatalog entry for this named release; see the provider’s official documentation for modalities, pricing, and context limits.
Recommended
Top current models by information quality score — good defaults when you are not sure where to start.
Anthropic
Claude Sonnet 4.6
CurrentLatestAnthropic's Sonnet-tier model documented as the best combination of speed and intelligence in the Claude model overview.
Anthropic
Claude 3.5 Sonnet
CurrentLatestAnthropic’s balanced Sonnet-tier model tuned for long-context reasoning, careful instruction following, and strong performance on coding and analysis workloads. It is a common enterprise choice on the Anthropic API and on AWS Bedrock when teams need large context for RAG and document review.
DeepSeek
DeepSeek-R1
CurrentLatestDeepSeek-R1 is a reasoning-focused model family emphasizing chain-of-thought style behavior for math, code, and structured problem solving. Deployment options include API and open-weight variants—verify licensing and hosting constraints for your region.
Alibaba
Qwen 2.5 72B Instruct
CurrentLatestQwen 2.5 72B Instruct is a large multilingual open-weights model from Alibaba’s Qwen family with strong coding and general chat performance. Common in APAC deployments and on Hugging Face inference endpoints—check license terms for commercial use.
xAI
Grok-2
CurrentLatestGrok-2 is xAI’s flagship chat model positioned for real-time knowledge integrations and high-throughput conversational products on xAI’s API. Availability and pricing evolve—treat capabilities as vendor-specific.
Missing a frontier release? Add a model (editors)