GPT-4o
LegacyFrontier
OpenAI’s flagship multimodal chat model for production assistants: native image and audio inputs, strong tool and JSON-mode behavior, and low-latency routing on the Chat Completions API.
Newer version: GPT-5.5
Provider
OpenAI
Model family
OpenAI GPT
Multimodal LLM
Cost tier
Flagship
Status
Legacy
Release May 13, 2024
Why teams choose it
🧠
Azure OpenAI and direct OpenAI APIs differ slightly in SKU names—pin deployment names in config.
Azure OpenAI and direct OpenAI APIs differ slightly in SKU names—pin deployment names in config.
📎
Multimodal limits (image count, resolution) change; validate against the current model c…
Multimodal limits (image count, resolution) change; validate against the current model card before UX sign-off.
Tradeoffs to know
- Pricing and rate limits are tier-dependent—budget for burst traffic.
- Policy and safety defaults differ between consumer ChatGPT and API products.
When not to use this
- Not ideal for simple tasks where cheaper models in the same lineup are good enough.
- Avoid for latency-sensitive real-time chat when raw response speed outweighs reasoning depth.
- Confirm limits, pricing, and regional availability on the provider side before committing production workloads.
Technical specs
- Inputs
- text, image, audio
- Outputs
- text
- Capabilities
- tool use, vision, json mode, function calling, streaming
- License
- Proprietary API
- Model string
gpt-4o
Benchmarks
{
"mmlu": 88.7,
"humaneval": 90.2
}OpenAI GPT family lineup
Current models