GenAIWiki

GPT-4o

LegacyFrontier

OpenAI’s flagship multimodal chat model for production assistants: native image and audio inputs, strong tool and JSON-mode behavior, and low-latency routing on the Chat Completions API.

Newer version: GPT-5.5

Provider

OpenAI

Model family

OpenAI GPT

Multimodal LLM

Cost tier

Flagship

Status

Legacy

Release May 13, 2024

Why teams choose it

🧠

Azure OpenAI and direct OpenAI APIs differ slightly in SKU names—pin deployment names in config.

Azure OpenAI and direct OpenAI APIs differ slightly in SKU names—pin deployment names in config.

📎

Multimodal limits (image count, resolution) change; validate against the current model c…

Multimodal limits (image count, resolution) change; validate against the current model card before UX sign-off.

Tradeoffs to know

  • Pricing and rate limits are tier-dependent—budget for burst traffic.
  • Policy and safety defaults differ between consumer ChatGPT and API products.

When not to use this

  • Not ideal for simple tasks where cheaper models in the same lineup are good enough.
  • Avoid for latency-sensitive real-time chat when raw response speed outweighs reasoning depth.
  • Confirm limits, pricing, and regional availability on the provider side before committing production workloads.

Technical specs

Inputs
text, image, audio
Outputs
text
Capabilities
tool use, vision, json mode, function calling, streaming
License
Proprietary API
Model string
gpt-4o

Benchmarks

{
  "mmlu": 88.7,
  "humaneval": 90.2
}

OpenAI GPT family lineup


Compare with