Wallie V2

The open-source AI streamer that actually feels alive

122 followers

The open-source AI streamer that actually feels alive

122 followers

Visit website

Live streaming platforms

•

AI Characters

Wallie is an open-source AI streamer that actually feels alive. It reacts to your screen, reads live chat on Twitch/YouTube/Kick, animates a Live2D avatar with real lipsync, and never repeats itself — all running locally on your machine. Swap LLM and TTS providers freely. Start free with Groq + Piper. Zero cloud lock-in.

Free

Launch tags:Open Source•Developer Tools•Artificial Intelligence

Launch Team / Built With

SeaTicketAl agent that resolves issues across all your channels

Promoted

Wallie V2

Maker

📌

Hey Product Hunt! 👋 I've been obsessed with AI streamers for a while — but every existing solution felt the same: robotic narration, endless question loops, zero personality. They didn't feel alive. So I built Wallie — an open-source AI streamer framework designed from the ground up to actually behave like a real person streaming. Not just "AI reads text out loud." Wallie: Develops thoughts over time instead of resetting every sentence Reacts to your screen — notices when you switch games, start typing, or go idle Reads live chat on Twitch, YouTube, and Kick — and responds in character Remembers topics across a session (and across sessions) Animates a Live2D avatar with real lipsync, mood-reactive expressions, and natural idle behavior Never repeats itself — a deduplication engine catches paraphrased repetition within the same stream And it runs on your machine. Your API keys stay local. You can even run it fully offline with Ollama + Piper — zero cloud, zero cost. I've been building this for the past few months. It's fully open-source, MIT licensed, and genuinely free to start with Groq's free tier. Would love to hear what you think — especially from VTubers and streamers who've tried other AI tools before. What made you give up on them? That's exactly what Wallie is designed to fix.

Report

22d ago

the reacts to your screen feature is the interesting differentiator here. most AI streamers just respond to chat which is a solved problem. an avatar that can comment on what's actually happening in the game or on screen is a different kind of presence. curious how the screen reading works, is it vision model calls on a frame interval or something else, and what the latency looks like between something happening on screen and Wallie actually reacting to it

Report

17d ago

Wallie V2

Maker

@ansari_adin Vision runs on a frame interval, yeah. mss captures the screen, perceptual hash (pHash) detects meaningful changes, and if the delta clears the threshold, it fires a vision model call with the current frame. The interval and sensitivity are configurable from the dashboard.

Latency from screen event to spoken reaction: typically 2–4 seconds end to end, depending on the LLM provider. Groq + Llama-4 Scout gets you the fastest loop (~1.5–2s). Claude Sonnet is slower on raw latency but produces better reactions — especially for things like recognizing game UI, character names, or anything that requires IP/context knowledge.

The attention engine also means not every screen change triggers a full reaction. The model probabilistically assigns DEEP (22%), GLANCE (28%), TANGENT (5%), IGNORE (27%), or SILENCE (18%) — so Wallie doesn't spam reactions to every mouse move, which makes the ones that do happen feel more considered. Streak fatigue prevents the same reaction type from firing back-to-back.

Report

17d ago

Super work! is it possibly to integrate it with Gemini Live Model?

Report

17d ago

Wallie V2

Maker

@ashishkingdom Gemini is already supported as an LLM provider (Gemini 2.5 Flash and Pro, streaming + vision). You set it from the dashboard: Engine → provider: gemini, then pick your model.

Gemini Live specifically (the real-time audio/multimodal API) isn't integrated yet — that's a different API surface from the standard completions endpoint Wallie uses. It's on the roadmap conceptually (the "Hearing" item — real-time audio input), but the current TTS pipeline and single-history orchestrator design would need some rethinking to accommodate it cleanly. If you're interested in contributing, that'd be a solid PR to open.

Report

17d ago

Forum Threads

p/wallie-v2

•

5d ago

Wallie can now PLAY Minecraft — not just watch your screen 🎮

When we launched, Wallie was an open-source AI streamer that watches your screen, hears your audio, and reacts in character.

Now it does something I'm genuinely excited about: it PLAYS.

Wallie plays Minecraft survival live and unscripted it mines, hunts for iron, crafts, fights, and tries to survive the night, all on its own. No human at the controls, no script. And it talks the whole way through, in character, reacting to what's actually happening.

Two things make it special:

p/wallie-v2

•

16d ago

Watch Wallie react to Minecraft in real-time 🎙️ (open-source AI streamer)

Quick demo of Wallie actually doing its thing

It watches the screen and reacts live like a streamer gets jumped by a

skeleton, dreads creepers in the dark, swears it's "close to diamonds."

First-person personality, real-time voice, fully local.

p/wallie-v2

•

12d ago

Wallie can now HEAR — plus it's now one-click to try

1) Wallie can now HEAR.

It already saw your screen and reacted. Now it hears everything on your system music, videos, voices and reacts live, fused with what it sees. Two senses, one reaction.

And it actually understands music: mood, tempo, major/minor key, even the lyrics. Play a sad song and it feels the melancholy. A beat drops and it reacts to the energy. It'll call a track a banger or roast a muddy mix.

2) It's now way easier to try.

View all