AI Models Directory | GenAIWiki

GenAIWikiAI Knowledge. Simplified.

Search About Sign in

Structured cards

Model database

Filter by provider, architecture family, or full-text search across descriptions.

Frontier models

Verified flagship models across major providers, curated for operational relevance.

OpenAI

GPT-5.5

OpenAI's current flagship model for complex reasoning, coding, and professional work, documented in the OpenAI API model guide as the default starting point for high-complexity workloads.

FeaturedUpdated today

frontierreasoning

Google

Gemini 2.5 Pro

Google's advanced Gemini model for complex tasks, with official Gemini API documentation calling out deep reasoning and coding capabilities.

FeaturedUpdated today

Anthropic

Claude Opus 4.7

Anthropic's most capable generally available Claude model for complex reasoning and agentic coding, documented in the Claude model overview.

FeaturedUpdated today

frontierreasoning

xAI

Grok 4.3

xAI's documented default for general chat workloads, described in xAI docs as the most intelligent and fastest Grok model for non-specialized use cases.

FeaturedUpdated today

Mistral AI

Mistral Medium 3.5

Mistral's documented latest frontier-class multimodal model optimized for agentic and coding use cases.

FeaturedUpdated today

frontiermistral

DeepSeek

DeepSeek-V3.2

DeepSeek's documented successor to the V3.2 experimental line, positioned in official DeepSeek API news as live on app, web, and API.

FeaturedUpdated today

frontierdeepseek

Meta

Llama 3.1 405B Instruct

Meta’s largest open-weights instruct checkpoint in the Llama 3.1 family, aimed at strong reasoning and coding quality with a permissive license for research and customization. It is typically served on dedicated GPU clusters or via partners (cloud inference, on-prem) rather than a single vendor API.

FeaturedUpdated 5 weeks ago

open-weightsself-host

Cohere

Command R+

Cohere’s enterprise-oriented Command R+ emphasizes retrieval-grounded answers and tool orchestration patterns for business data. It targets teams building RAG-heavy assistants where citation-style behavior and connector patterns matter more than raw chat novelty.

FeaturedUpdated 5 weeks ago

All models

Filter and paginate the full catalog. Tabs control lifecycle scope.

Mistral AI

Mixtral 8x22B

Catalog entry for this named release; see the provider’s official documentation for modalities, pricing, and context limits.

Updated 5 weeks ago

Community

LLaVA

Catalog entry for this named release; see the provider’s official documentation for modalities, pricing, and context limits.

Updated 5 weeks ago

xAI

Grok 1.5

Catalog entry for this named release; see the provider’s official documentation for modalities, pricing, and context limits.

Updated 5 weeks ago

OpenAI

GPT-5.4 nano

Catalog entry for this named release; see the provider’s official documentation for modalities, pricing, and context limits.

Updated 5 weeks ago

OpenAI

GPT-5.4 mini

Catalog entry for this named release; see the provider’s official documentation for modalities, pricing, and context limits.

Updated 5 weeks ago

DeepSeek

DeepSeek Coder V2

Catalog entry for this named release; see the provider’s official documentation for modalities, pricing, and context limits.

Updated 5 weeks ago

Databricks

DBRX

Catalog entry for this named release; see the provider’s official documentation for modalities, pricing, and context limits.

Updated 5 weeks ago

Anthropic

Claude Haiku 4.5

Catalog entry for this named release; see the provider’s official documentation for modalities, pricing, and context limits.

Updated 5 weeks ago

OpenAI

o1

OpenAI’s o1 series emphasizes extended internal reasoning before answering—useful for competition-style math, complex debugging, and multi-step planning where latency is acceptable. It behaves differently from standard chat models: tune prompts for chain-of-thought style tasks and measure time-to-first-token.

FeaturedUpdated 5 weeks ago

AWS

Amazon Nova

Amazon Nova is AWS’s multimodal foundation model family for text, image, and video workloads delivered through Amazon Bedrock with enterprise IAM, VPC, and governance patterns. Model IDs and modalities vary by region—verify Bedrock model access lists.

FeaturedUpdated 5 weeks ago

OpenAI

o3-mini

Compact reasoning-focused model in OpenAI’s o-series line aimed at strong STEM and coding performance with lower cost than full o3. Intended for developers who want reasoning without always paying flagship prices—confirm exact API availability and snapshot names in OpenAI docs.

Updated 3 weeks ago

OpenAI

Whisper large-v3

Whisper large-v3 is OpenAI’s ASR model for transcription and translation across many languages, with strong robustness to accents and noise. It is commonly self-hosted or used via API partners; latency depends heavily on hardware and chunking strategy.

Updated 5 weeks ago

audioopen-weights

OpenAI

text-embedding-3-large

text-embedding-3-large produces high-dimensional text embeddings for semantic search, clustering, and classification. Teams pair it with pgvector or SaaS vector DBs for RAG; output dimensions can be reduced with tradeoffs described in OpenAI documentation.

Updated 5 weeks ago

embeddingsretrieval

Stability AI

Stable Diffusion XL

SDXL is a latent diffusion backbone for high-resolution image generation with broad community tooling (LoRA, ControlNet) and OpenRAIL-style licensing. It is typically self-hosted or run via GPU marketplaces rather than a single proprietary chat API.

Updated 5 weeks ago

imageopen-weights

Snowflake

Snowflake Arctic

Snowflake Arctic is an enterprise-oriented open model emphasizing efficient training recipes and SQL-adjacent enterprise tasks inside the Snowflake ecosystem. It targets teams that want LLM features colocated with governed data in Snowflake Cortex.

Updated 5 weeks ago

OpenAI

o1-mini

A smaller, faster o1-class model for STEM-style reasoning where full o1 latency or cost is prohibitive. Use it when you need better chain-of-thought style behavior than GPT-4o mini but not full o1 depth.

Updated 5 weeks ago

NVIDIA

NVIDIA Nemotron-4 340B

NVIDIA Nemotron-4 340B is a large open-weights model suite aimed at enterprise and research users who train and serve on NVIDIA stacks (NeMo, NGC). It targets GPU-native teams that need customizable checkpoints with NVIDIA-optimized tooling.

Updated 5 weeks ago

open-weightsenterprise

Google

Gemma 2 27B

Gemma 2 27B is Google’s open-weights Gemma family checkpoint balancing quality and deployability for research and product teams that need permissive terms without Vertex-only APIs. It is often fine-tuned for domain tasks on TPU or GPU clusters.

Updated 5 weeks ago

open-weightsgoogle

OpenAI

DALL·E 3

DALL·E 3 is OpenAI’s instruction-aligned image generation model exposed via the Images API, emphasizing prompt adherence and safety classifiers for consumer and enterprise creative workflows. It targets marketing visuals, product mockups, and storyboarding rather than photorealistic deception.

Updated 5 weeks ago

AWS

Amazon Titan Text Premier

Titan Text Premier is AWS’s managed text model for Bedrock workloads emphasizing integration with guardrails, knowledge bases, and private data patterns. It targets enterprise RAG and internal assistants rather than frontier creative writing.

Updated 5 weeks ago

OpenAI

GPT-5.4

Catalog entry for this named release; see the provider’s official documentation for modalities, pricing, and context limits.

Recommended

Top current models by information quality score — good defaults when you are not sure where to start.

Anthropic

Claude Sonnet 4.6

Anthropic's Sonnet-tier model documented as the best combination of speed and intelligence in the Claude model overview.

FeaturedUpdated today

Anthropic

Claude 3.5 Sonnet

Anthropic’s balanced Sonnet-tier model tuned for long-context reasoning, careful instruction following, and strong performance on coding and analysis workloads. It is a common enterprise choice on the Anthropic API and on AWS Bedrock when teams need large context for RAG and document review.

FeaturedUpdated 3 weeks ago

DeepSeek

DeepSeek-R1

DeepSeek-R1 is a reasoning-focused model family emphasizing chain-of-thought style behavior for math, code, and structured problem solving. Deployment options include API and open-weight variants—verify licensing and hosting constraints for your region.

FeaturedUpdated 3 weeks ago

reasoningresearch

Alibaba

Qwen 2.5 72B Instruct

Qwen 2.5 72B Instruct is a large multilingual open-weights model from Alibaba’s Qwen family with strong coding and general chat performance. Common in APAC deployments and on Hugging Face inference endpoints—check license terms for commercial use.

FeaturedUpdated 3 weeks ago

open-weightsmultilingual

xAI

Grok-2

Grok-2 is xAI’s flagship chat model positioned for real-time knowledge integrations and high-throughput conversational products on xAI’s API. Availability and pricing evolve—treat capabilities as vendor-specific.

FeaturedUpdated 5 weeks ago

Missing a frontier release? Add a model (editors)