Best Products
Launches
Launch archive
Most-loved launches by the community
Launch Guide
Checklists and pro tips for launching
News
Newsletter
The best of Product Hunt, every day
Stories
Tech news, interviews, and tips from makers
Changelog
New Product Hunt features and releases
Forums
Forums
Ask questions, find support, and connect
Streaks
The most active community members
Events
Meet others online and in-person
Advertise
Subscribe
Sign in
Clear text
recent
p/gpt-5
by
Aaron O'Leary
Featured
•
6mo ago
GPT-5: Not the AGI Messiah, but still pretty impressive
... like button. Benchmarks, benchmarks, benchmarks Benchmarks should always be taken with a grain of salt . They are effectively a snapshot of a models capabilities under near perfect conditions. Sort of like the Big Mac you see on the ads
vs
the Big Mac you get in the bag. It gives you a good idea, but they're far from a perfect measure of real world usage. Math (AIME 2025, no tools): 94.6 percent Real-world coding (SWE-bench Verified ... ... percent Multilingual programming (
Aider
9
56
Subscribe
Sign in