trending
Ankit Sharma

22d ago

Vision Banana From Google DeepMind - Image Generators are Generalist Vision Learners

Unified model that outperforms SoTA specialist models on various vision tasks! By treating 2D/3D vision tasks as image generation, we unlock a new foundation for CV.