Zac Zuo

Gemma 3n - Run powerful multimodal AI right on your phone

Gemma 3n is Google's new open model, optimized for on-device multimodal AI. Its novel MatFormer architecture enables powerful yet efficient models (like the 2B/4B variants) that can run locally on phones and laptops. Supports image, audio & video.

Add a comment

Replies

Best
Zac Zuo

Hi everyone!

Google is back with a major release for their open model family: Gemma 3n. It’s a big step forward for powerful, on-device multimodal AI.

It has the new MatFormer architecture. It's like a Matryoshka doll, a single, larger model contains smaller, fully-functional models inside. This gives developers incredible flexibility. You can deploy a tiny 2B effective parameter model for speed, a more powerful 4B version, or even create custom sizes in between.

And it’s built to be very efficient. Techniques like Per-Layer Embeddings mean only a small part of the model needs to live in VRAM, making it truly viable for phones and laptops. It’s also fully multimodal, handling image, audio, and video inputs with strong performance.

Zovie Hartwell

I think Google.dev offers a great way to track and showcase your learning progress with Google technologies. It’s helpful for staying motivated through badges and clear pathways, though sometimes the variety of content can feel overwhelming.

Erliza. P

📱🧠 5th launch from Google.dev and now Gemma 3n brings multimodal AI to your pocket? That’s wild 🔥🤯

Thibault (aka TBot, but still human 🤪)
Dope ! hedge is the future 👽
Ilya Vorobiev

Gemma family keeps getting better 🔥 This is huge! Local LLMs are definitely the future - no more worrying about internet connection or privacy. Massive progress from the team, thank you! Can't wait to build with this 🚀