Gemma 3n - Run powerful multimodal AI right on your phone

Flowtica Scribe

•1yr ago

Gemma 3n is Google's new open model, optimized for on-device multimodal AI. Its novel MatFormer architecture enables powerful yet efficient models (like the 2B/4B variants) that can run locally on phones and laptops. Supports image, audio & video.

Replies

Best

Flowtica Scribe

Hunter

📌

Hi everyone!

Google is back with a major release for their open model family: Gemma 3n. It’s a big step forward for powerful, on-device multimodal AI.

It has the new MatFormer architecture. It's like a Matryoshka doll, a single, larger model contains smaller, fully-functional models inside. This gives developers incredible flexibility. You can deploy a tiny 2B effective parameter model for speed, a more powerful 4B version, or even create custom sizes in between.

And it’s built to be very efficient. Techniques like Per-Layer Embeddings mean only a small part of the model needs to live in VRAM, making it truly viable for phones and laptops. It’s also fully multimodal, handling image, audio, and video inputs with strong performance.

Report

1yr ago

I think Google.dev offers a great way to track and showcase your learning progress with Google technologies. It’s helpful for staying motivated through badges and clear pathways, though sometimes the variety of content can feel overwhelming.

Report

1yr ago

📱🧠 5th launch from Google.dev and now Gemma 3n brings multimodal AI to your pocket? That’s wild 🔥🤯

Report

1yr ago

Dope ! hedge is the future 👽

Report

1yr ago

Gemma family keeps getting better 🔥 This is huge! Local LLMs are definitely the future - no more worrying about internet connection or privacy. Massive progress from the team, thank you! Can't wait to build with this 🚀

Report

1yr ago