Best Products
Launches
Launch archive
Most-loved launches by the community
Launch Guide
Checklists and pro tips for launching
News
Newsletter
The best of Product Hunt, every day
Stories
Tech news, interviews, and tips from makers
Changelog
New Product Hunt features and releases
Forums
Forums
Ask questions, find support, and connect
Kitty Points Leaderboard
The highest scoring community members
Streaks
The most active community members
Events
Meet others online and in-person
Advertise
Subscribe
Sign in
Clear text
recent
p/self-promotion
by
Vladislava Karim
•
2mo ago
We challenged xAI's Grok to a public AI benchmark battle
... Last week something unexpected happened. We publicly
challenged
Grok xAI's AI to a benchmark competition on Twitter. And Grok accepted. The rules were simple: Same public datasets Our engine: zero-shot (no training data) Grok: supervised ML with full cross-validation The datasets: CWRU industrial bearing fault detection UCI HAR human activity recognition ALFA UAV drone motor fault detection (robotics) What happened: Grok ran supervised baselines. We ran zero-shot. Results Grok confirmed publicly: CWRU: 92.2% accuracy, 98.3% recall ...
0
2
Subscribe
Sign in