VideoWorld

VideoWorld

Teaching AI to Learn by Watching

9 followers

VideoWorldis an autoregressive video generation model from ByteDance Seed team & universities, learning complex tasks (Go, robotics) from unlabeled videos. Uses a Latent Dynamics Model (LDM). Open-source.
VideoWorld gallery image
VideoWorld gallery image
VideoWorld gallery image
VideoWorld gallery image
Free
Launch Team
Intercom
Intercom
Startups get 90% off Intercom + 1 year of Fin AI Agent free
Promoted

What do you think? …

Zac Zuo

Hi everyone!

Humans learn to interact with the world by observing it, even before developing language. So why can't AI? 🤔

Check out VideoWorld, a groundbreaking research project from ByteDance Seed, in collaboration with several universities, that explores how AI can learn solely from watching videos – no text & labels!

Think of it like this: instead of teaching an AI with rules and explanations, you just show it videos, and it figures things out.

Key aspects:

👁️ Pure Visual Learning: Learns Go rules and robotic control without any language input.
🧠 Latent Dynamics Model (LDM): A novel technique that helps the model learn efficiently by focusing on changes in the video.
🏆 Impressive Results: Achieves a 5-dan professional level in Go (using Video-GoBench) and approaches oracle performance in robotics tasks.
🔓 Open-Source: Code, data, and models are available.

This model represents a significant step towards AI that learns more like humans do.