OpenAI
Whisper large-v3
Speech-to-text · Release Nov 6, 2023 · MIT
Robust ASR model for transcription and translation with strong performance across accents and noisy environments.
Modalities
What goes in and what comes out.
Inputs
audio
Outputs
text
Capabilities
transcription, translation, timestamps
Benchmarks snapshot
Structured JSON for reproducible comparisons.
{
"wer": "low on common benchmarks"
}Related on GenAIWiki
Same provider, tooling that cites the model, or prompts tuned for it.
OpenAI
GPT-4o
Flagship multimodal model tuned for tool use, vision understanding, and low-latency chat experiences across consumer and enterprise surfaces.
OpenAI
GPT-4 Turbo
GPT-4 Turbo is optimized for speed and efficiency, providing rapid text generation with a 16k token context window. It is designed for applications requiring fast responses without sacrificing quality.
OpenAI
text-embedding-3-large
High-dimensional embedding model designed for semantic search, clustering, and retrieval with adjustable output size.
OpenAI
DALL·E 3
Instruction-following image generation model integrated with safety classifiers and chat-native prompting flows.
ML platform
Hugging Face
Hub for open models, datasets, and Spaces demos, plus Inference Endpoints, Transformers, and enterprise features for teams that train, fine-tune, or serve open-weight and partner models at scale.
Model hosting
Replicate
Managed inference platform for running open and custom models through simple APIs, with usage-based billing and strong support for image, video, and multimodal workloads.