AI Models Directory | GenAIWiki

GenAIWikiAI Knowledge. Simplified.

Search About Sign in

Structured cards

Model database

Filter by provider, architecture family, or full-text search across descriptions.

Frontier models

Verified flagship models across major providers, curated for operational relevance.

Anthropic

Claude Fable 5

Anthropic's highest-capability widely released Claude model, documented for deep reasoning, codebase-scale work, long-context enterprise workloads, and multimodal inputs.

FeaturedUpdated 3 weeks ago

OpenAI

GPT-5.5

OpenAI's current flagship model for complex reasoning, coding, and professional work, documented in the OpenAI API model guide as the default starting point for high-complexity workloads.

FeaturedUpdated 3 weeks ago

frontierreasoning

Microsoft AI

MAI-Thinking-1

Microsoft AI's frontier reasoning model in the MAI family, announced for difficult prompts, science, math, and complex planning workloads, with Microsoft Foundry access documented as private preview.

FeaturedUpdated 3 weeks ago

frontiermicrosoft

Google

Gemini 2.5 Pro

Google's advanced Gemini model for complex tasks, with official Gemini API documentation calling out deep reasoning and coding capabilities.

FeaturedUpdated 3 weeks ago

Anysphere

Cursor Composer 2.5

Cursor's Composer 2.5 model, documented in the official Cursor changelog as a coding model used in the Auto routing flow.

FeaturedUpdated 3 weeks ago

xAI

Grok 4.3

xAI's documented default for general chat workloads, described in xAI docs as the most intelligent and fastest Grok model for non-specialized use cases.

FeaturedUpdated 4 weeks ago

Mistral AI

Mistral Medium 3.5

Mistral's documented latest frontier-class multimodal model optimized for agentic and coding use cases.

FeaturedUpdated 4 weeks ago

frontiermistral

DeepSeek

DeepSeek-V3.2

DeepSeek's documented successor to the V3.2 experimental line, positioned in official DeepSeek API news as live on app, web, and API.

FeaturedUpdated 4 weeks ago

frontierdeepseek

All models

Filter and paginate the full catalog. Tabs control lifecycle scope.

Sarvam AI

Sarvam 30B

Sarvam 30B is a 30B parameter Mixture-of-Experts chat and reasoning model from Sarvam AI, optimized for Indian languages, real-time conversation, high-throughput voice-agent pipelines, coding, and practical deployment. Sarvam documents 2.4B active parameters per token, 16T tokens of pre-training data, a 64K context window, Grouped Query Attention, Apache 2.0 open weights, and OpenAI-compatible chat completions.

Updated 10 days ago

sarvamindian languages

Sarvam AI

Sarvam 105B

Sarvam 105B is Sarvam AI's flagship 105B+ parameter Mixture-of-Experts reasoning model for Indian-language and English chat, complex reasoning, coding, long-context document analysis, and agentic tool-use workflows. Sarvam documents it as a 128K-context OpenAI-compatible chat model with Multi-head Latent Attention, 12T tokens of pre-training data, Apache 2.0 open weights, and production use powering Indus. Its strongest fit is Indian-language enterprise assistants, multilingual reasoning, and agent workflows where native script, romanized, and code-mixed inputs matter.

Updated 10 days ago

sarvamindian languages

Anthropic

Claude Fable 5

Anthropic's highest-capability widely released Claude model, documented for deep reasoning, codebase-scale work, long-context enterprise workloads, and multimodal inputs.

FeaturedUpdated 3 weeks ago

OpenAI

GPT-5.5

OpenAI's current flagship model for complex reasoning, coding, and professional work, documented in the OpenAI API model guide as the default starting point for high-complexity workloads.

FeaturedUpdated 3 weeks ago

frontierreasoning

Google

Gemini 2.5 Pro

Google's advanced Gemini model for complex tasks, with official Gemini API documentation calling out deep reasoning and coding capabilities.

FeaturedUpdated 3 weeks ago

Anthropic

Claude Opus 4.8

Anthropic's current Opus-tier Claude model, documented for complex reasoning, coding, and multimodal enterprise workloads below the newer Fable tier.

FeaturedUpdated 3 weeks ago

OpenAI

GPT-5.4

OpenAI's GPT-5.4 model, documented in the official OpenAI API model guide as part of the current GPT-5 family below the GPT-5.5 flagship lane.

FeaturedUpdated 3 weeks ago

Stability AI

Stable Diffusion XL

Stable Diffusion XL (SDXL) 1.0 is Stability AI's latent diffusion text-to-image model for native 1024x1024 generation. The base model can run standalone or feed an optional refiner for the final denoising steps, and the published weights support self-hosted Diffusers workflows.

Updated 4 weeks ago

imageopen-weights

Alibaba

Qwen 2.5 72B Instruct

Qwen 2.5 72B Instruct is a large multilingual open-weights model from Alibaba’s Qwen family with strong coding and general chat performance. Common in APAC deployments and on Hugging Face inference endpoints—check license terms for commercial use.

FeaturedUpdated 4 weeks ago

open-weightsmultilingual

Meta

Llama 3.1 405B Instruct

Meta’s largest open-weights instruct checkpoint in the Llama 3.1 family, aimed at strong reasoning and coding quality with a permissive license for research and customization. It is typically served on dedicated GPU clusters or via partners (cloud inference, on-prem) rather than a single vendor API.

FeaturedUpdated 4 weeks ago

open-weightsself-host

DeepSeek

DeepSeek-R1

DeepSeek-R1 is a reasoning-focused model family emphasizing chain-of-thought style behavior for math, code, and structured problem solving. Deployment options include API and open-weight variants—verify licensing and hosting constraints for your region.

FeaturedUpdated 4 weeks ago

reasoningresearch

Cohere

Command R+

Cohere’s enterprise-oriented Command R+ emphasizes retrieval-grounded answers and tool orchestration patterns for business data. It targets teams building RAG-heavy assistants where citation-style behavior and connector patterns matter more than raw chat novelty.

FeaturedUpdated 4 weeks ago

Anthropic

Claude Sonnet 4.6

Anthropic's Sonnet-tier model documented as the best combination of speed and intelligence in the Claude model overview.

FeaturedUpdated 4 weeks ago

AWS

Amazon Nova

Amazon Nova is AWS’s multimodal foundation model family for text, image, and video workloads delivered through Amazon Bedrock with enterprise IAM, VPC, and governance patterns. Model IDs and modalities vary by region—verify Bedrock model access lists.

FeaturedUpdated 4 weeks ago

Anysphere

Cursor Composer 2.5

Cursor's Composer 2.5 model, documented in the official Cursor changelog as a coding model used in the Auto routing flow.

FeaturedUpdated 3 weeks ago

Microsoft AI

MAI-Code-1-Flash

Microsoft AI's agentic coding model in the MAI family, announced for fast code editing, debugging, and tool-driven developer workflows.

FeaturedUpdated 3 weeks ago

codingmicrosoft

Mistral AI

Mistral Medium 3.5

Mistral's documented latest frontier-class multimodal model optimized for agentic and coding use cases.

FeaturedUpdated 4 weeks ago

frontiermistral

xAI

Grok 4.3

xAI's documented default for general chat workloads, described in xAI docs as the most intelligent and fastest Grok model for non-specialized use cases.

FeaturedUpdated 4 weeks ago

Google

Gemini 2.5 Flash-Lite

Google's fastest and most budget-friendly multimodal model in the Gemini 2.5 family, according to the Gemini API model documentation.

Updated 3 weeks ago

googleflash-lite

Microsoft AI

MAI-Image-2.5

Microsoft AI's MAI image model for generation, editing, and visual content workflows, announced as part of the June 2026 MAI model release.

Updated 3 weeks ago

Microsoft AI

MAI-Voice-2

Microsoft AI's voice generation model in the MAI family, announced for natural text-to-speech and voice experiences.

Updated 3 weeks ago

Microsoft AI

MAI-Transcribe-1.5

Microsoft AI's speech-to-text model in the MAI family, announced for fast, accurate transcription across product surfaces.

Updated 3 weeks ago

speechtranscription

OpenAI

GPT-5.4 mini

OpenAI's smaller GPT-5.4 mini model, documented in the official OpenAI API model guide for lower-latency or lower-cost GPT-5 family workloads.

Updated 3 weeks ago

Mistral AI

Mistral Large 3

Mistral's open-weight general-purpose multimodal model listed in official Mistral model documentation.

Updated 4 weeks ago

frontiermistral

Recommended

Top current models by information quality score — good defaults when you are not sure where to start.

Sarvam AI

Sarvam 30B

Sarvam 30B is a 30B parameter Mixture-of-Experts chat and reasoning model from Sarvam AI, optimized for Indian languages, real-time conversation, high-throughput voice-agent pipelines, coding, and practical deployment. Sarvam documents 2.4B active parameters per token, 16T tokens of pre-training data, a 64K context window, Grouped Query Attention, Apache 2.0 open weights, and OpenAI-compatible chat completions.

Updated 10 days ago

sarvamindian languages

Sarvam AI

Sarvam 105B

Sarvam 105B is Sarvam AI's flagship 105B+ parameter Mixture-of-Experts reasoning model for Indian-language and English chat, complex reasoning, coding, long-context document analysis, and agentic tool-use workflows. Sarvam documents it as a 128K-context OpenAI-compatible chat model with Multi-head Latent Attention, 12T tokens of pre-training data, Apache 2.0 open weights, and production use powering Indus. Its strongest fit is Indian-language enterprise assistants, multilingual reasoning, and agent workflows where native script, romanized, and code-mixed inputs matter.

Updated 10 days ago

sarvamindian languages

Anthropic

Claude Opus 4.8

Anthropic's current Opus-tier Claude model, documented for complex reasoning, coding, and multimodal enterprise workloads below the newer Fable tier.

FeaturedUpdated 3 weeks ago

OpenAI

GPT-5.4

OpenAI's GPT-5.4 model, documented in the official OpenAI API model guide as part of the current GPT-5 family below the GPT-5.5 flagship lane.

FeaturedUpdated 3 weeks ago

Stability AI

Stable Diffusion XL

Stable Diffusion XL (SDXL) 1.0 is Stability AI's latent diffusion text-to-image model for native 1024x1024 generation. The base model can run standalone or feed an optional refiner for the final denoising steps, and the published weights support self-hosted Diffusers workflows.

Updated 4 weeks ago

imageopen-weights

Alibaba

Qwen 2.5 72B Instruct

Qwen 2.5 72B Instruct is a large multilingual open-weights model from Alibaba’s Qwen family with strong coding and general chat performance. Common in APAC deployments and on Hugging Face inference endpoints—check license terms for commercial use.

FeaturedUpdated 4 weeks ago

open-weightsmultilingual

Missing a frontier release? Add a model (editors)