AI Models Directory | GenAIWiki

GenAIWikiAI Knowledge. Simplified.

Search About Sign in

Structured cards

Model database

Filter by provider, architecture family, or full-text search across descriptions.

Frontier models

Verified flagship models across major providers, curated for operational relevance.

Anthropic

Claude Fable 5

Anthropic's highest-capability widely released Claude model, documented for deep reasoning, codebase-scale work, long-context enterprise workloads, and multimodal inputs.

FeaturedUpdated 3 weeks ago

OpenAI

GPT-5.5

OpenAI's current flagship model for complex reasoning, coding, and professional work, documented in the OpenAI API model guide as the default starting point for high-complexity workloads.

FeaturedUpdated 3 weeks ago

frontierreasoning

Microsoft AI

MAI-Thinking-1

Microsoft AI's frontier reasoning model in the MAI family, announced for difficult prompts, science, math, and complex planning workloads, with Microsoft Foundry access documented as private preview.

FeaturedUpdated 3 weeks ago

frontiermicrosoft

Google

Gemini 2.5 Pro

Google's advanced Gemini model for complex tasks, with official Gemini API documentation calling out deep reasoning and coding capabilities.

FeaturedUpdated 3 weeks ago

Anysphere

Cursor Composer 2.5

Cursor's Composer 2.5 model, documented in the official Cursor changelog as a coding model used in the Auto routing flow.

FeaturedUpdated 3 weeks ago

xAI

Grok 4.3

xAI's documented default for general chat workloads, described in xAI docs as the most intelligent and fastest Grok model for non-specialized use cases.

FeaturedUpdated 4 weeks ago

Mistral AI

Mistral Medium 3.5

Mistral's documented latest frontier-class multimodal model optimized for agentic and coding use cases.

FeaturedUpdated 4 weeks ago

frontiermistral

DeepSeek

DeepSeek-V3.2

DeepSeek's documented successor to the V3.2 experimental line, positioned in official DeepSeek API news as live on app, web, and API.

FeaturedUpdated 4 weeks ago

frontierdeepseek

All models

Filter and paginate the full catalog. Tabs control lifecycle scope.

Alibaba

Qwen 2

Catalog entry for this named release; see the provider’s official documentation for modalities, pricing, and context limits.

Updated 4 weeks ago

Microsoft

Phi-3 Mini

Catalog entry for this named release; see the provider’s official documentation for modalities, pricing, and context limits.

Updated 4 weeks ago

Meta

Llama 3 8B

Catalog entry for this named release; see the provider’s official documentation for modalities, pricing, and context limits.

Updated 4 weeks ago

Meta

Llama 3 70B

Catalog entry for this named release; see the provider’s official documentation for modalities, pricing, and context limits.

Updated 4 weeks ago

OpenAI

GPT-4.1 nano

Catalog entry for this named release; see the provider’s official documentation for modalities, pricing, and context limits.

Updated 4 weeks ago

TII

Falcon 180B

Catalog entry for this named release; see the provider’s official documentation for modalities, pricing, and context limits.

Updated 4 weeks ago

Anthropic

Claude Opus 4.5

Catalog entry for this named release; see the provider’s official documentation for modalities, pricing, and context limits.

Updated 4 weeks ago

Anthropic

Claude Opus 4.6

Catalog entry for this named release; see the provider’s official documentation for modalities, pricing, and context limits.

Updated 4 weeks ago

Anthropic

Claude Sonnet 4.5

Catalog entry for this named release; see the provider’s official documentation for modalities, pricing, and context limits.

Updated 4 weeks ago

OpenAI

o1-mini

A smaller, faster o1-class model for STEM-style reasoning where full o1 latency or cost is prohibitive. Use it when you need better chain-of-thought style behavior than GPT-4o mini but not full o1 depth.

Updated 4 weeks ago

OpenAI

DALL·E 3

DALL·E 3 is OpenAI’s instruction-aligned image generation model exposed via the Images API, emphasizing prompt adherence and safety classifiers for consumer and enterprise creative workflows. It targets marketing visuals, product mockups, and storyboarding rather than photorealistic deception.

Updated 4 weeks ago

Recommended

Top current models by information quality score — good defaults when you are not sure where to start.

Sarvam AI

Sarvam 30B

Sarvam 30B is a 30B parameter Mixture-of-Experts chat and reasoning model from Sarvam AI, optimized for Indian languages, real-time conversation, high-throughput voice-agent pipelines, coding, and practical deployment. Sarvam documents 2.4B active parameters per token, 16T tokens of pre-training data, a 64K context window, Grouped Query Attention, Apache 2.0 open weights, and OpenAI-compatible chat completions.

Updated 10 days ago

sarvamindian languages

Sarvam AI

Sarvam 105B

Sarvam 105B is Sarvam AI's flagship 105B+ parameter Mixture-of-Experts reasoning model for Indian-language and English chat, complex reasoning, coding, long-context document analysis, and agentic tool-use workflows. Sarvam documents it as a 128K-context OpenAI-compatible chat model with Multi-head Latent Attention, 12T tokens of pre-training data, Apache 2.0 open weights, and production use powering Indus. Its strongest fit is Indian-language enterprise assistants, multilingual reasoning, and agent workflows where native script, romanized, and code-mixed inputs matter.

Updated 10 days ago

sarvamindian languages

Anthropic

Claude Opus 4.8

Anthropic's current Opus-tier Claude model, documented for complex reasoning, coding, and multimodal enterprise workloads below the newer Fable tier.

FeaturedUpdated 3 weeks ago

OpenAI

GPT-5.4

OpenAI's GPT-5.4 model, documented in the official OpenAI API model guide as part of the current GPT-5 family below the GPT-5.5 flagship lane.

FeaturedUpdated 3 weeks ago

Stability AI

Stable Diffusion XL

Stable Diffusion XL (SDXL) 1.0 is Stability AI's latent diffusion text-to-image model for native 1024x1024 generation. The base model can run standalone or feed an optional refiner for the final denoising steps, and the published weights support self-hosted Diffusers workflows.

Updated 4 weeks ago

imageopen-weights

Alibaba

Qwen 2.5 72B Instruct

Qwen 2.5 72B Instruct is a large multilingual open-weights model from Alibaba’s Qwen family with strong coding and general chat performance. Common in APAC deployments and on Hugging Face inference endpoints—check license terms for commercial use.

FeaturedUpdated 4 weeks ago

open-weightsmultilingual

Missing a frontier release? Add a model (editors)