
GPT-4o
Fast, intelligent, flexible GPT model
4.8•462 reviews•13K followers
Fast, intelligent, flexible GPT model
4.8•462 reviews•13K followers
GPT-4o (“o” for “omni”) is our versatile, high-intelligence flagship model. It accepts both text and image inputs, and produces text outputs (including Structured Outputs). It is the best model for most tasks, and is our most capable model outside of our o-series models.
This is the 10th launch from GPT-4o. View more

OpenAI GPT-4o Audio Models
New OpenAI audio models for developers: gpt-4o powered speech-to-text (more accurate than Whisper) and steerable text-to-speech. Build voice agents, transcriptions, and more.





Launch Team




I like the sound. I listened to the article at 1.5 speed, sometimes it seemed like the pronunciation was slowing down, sometimes it was speeding up. I would like to see 1.25 playback speed in the future, but even so it is already quite pleasant!)
Currents AI
hjkgjkgjkg
tamer elkhayat
translate to Arabic
Incredible upgrade from OpenAI—GPT-4o’s boost in speech-to-text accuracy is a big leap forward, and steerable TTS opens up so many creative and practical use cases. Voice agents just got a serious level-up. Can’t wait to see what devs build with this!