
Vision Banana From Google DeepMind
Image Generators are Generalist Vision Learners
4 followers
Image Generators are Generalist Vision Learners
4 followers
Unified model that outperforms SoTA specialist models on various vision tasks! By treating 2D/3D vision tasks as image generation, we unlock a new foundation for CV.
