GPT-4o

GPT-4o

Fast, intelligent, flexible GPT model

4.8
462 reviews

13K followers

GPT-4o (“o” for “omni”) is our versatile, high-intelligence flagship model. It accepts both text and image inputs, and produces text outputs (including Structured Outputs). It is the best model for most tasks, and is our most capable model outside of our o-series models.
This is the 10th launch from GPT-4o. View more
OpenAI GPT-4o Audio Models

OpenAI GPT-4o Audio Models

Build Powerful Voice Agents
New OpenAI audio models for developers: gpt-4o powered speech-to-text (more accurate than Whisper) and steerable text-to-speech. Build voice agents, transcriptions, and more.
OpenAI GPT-4o Audio Models gallery image
OpenAI GPT-4o Audio Models gallery image
OpenAI GPT-4o Audio Models gallery image
OpenAI GPT-4o Audio Models gallery image
OpenAI GPT-4o Audio Models gallery image
Launch Team
Vy - Cross platform AI agent
Vy - Cross platform AI agent
AI agent that uses your computer, cross platform, no APIs
Promoted

What do you think? …

Andrew Wipkeo

I like the sound. I listened to the article at 1.5 speed, sometimes it seemed like the pronunciation was slowing down, sometimes it was speeding up. I would like to see 1.25 playback speed in the future, but even so it is already quite pleasant!)

Jennie Weng
Is there any other products that outperform openAI’s? I.e. does Elevenlab do a greater job?
KDDP-Desouk

hjkgjkgjkg

KDDP-Desouk

tamer elkhayat
translate to Arabic

Islam Akramov

Incredible upgrade from OpenAI—GPT-4o’s boost in speech-to-text accuracy is a big leap forward, and steerable TTS opens up so many creative and practical use cases. Voice agents just got a serious level-up. Can’t wait to see what devs build with this!