Pippo

Pippo

High-Res 3D Humans from a Single Photo.

8 followers

Pippo, from Meta Reality Labs: Generate high-res (1K) multi-view human videos from a single photo! Uses a Diffusion Transformer, ControlMLP, and Attention Biasing. Code available.
Pippo gallery image
Pippo gallery image
Pippo gallery image
Pippo gallery image
Launch Team
Anima - Vibe Coding for Product Teams
Build websites and apps with AI that understands design.
Promoted

What do you think? …

Zac Zuo

Hi everyone!

Sharing Pippo, a new project from Meta Reality Labs that takes single-image human generation to the next level!

📸 One Photo, Full Turnaround: Create a complete 3D turnaround video from a single full-body or face-only photo.
🎥 Multi-View from Video: Generate multi-view videos from monocular (single-camera) video input.
✨ 1K Resolution: The output is high-resolution (1024x1024).
⚡ One Forward Pass: The entire video is generated in a single forward pass of the model – no iterative refinement needed.
⚙️ Advanced Tech: It's based on a Diffusion Transformer, with techniques like ControlMLP (for pixel-aligned control) and Attention Biasing (for longer videos).

While this is currently a code-only release (no pre-trained weights yet), the potential is huge for creating realistic avatars, enhancing video editing workflows, and more.