Watch Wallie react to Minecraft in real-time ποΈ (open-source AI streamer)
byβ’
Quick demo of Wallie actually doing its thing π
It watches the screen and reacts live like a streamer β gets jumped by a
skeleton, dreads creepers in the dark, swears it's "close to diamonds."
First-person personality, real-time voice, fully local.
This is the v1.0 build β snappier reactions after the latest perf update.
Open source, run it yourself with Groq + Piper (free):
π https://github.com/Alradyin/wallie-V2
Watch Wallie react to Minecraft in real-time Curious what you'd want it to play next π
109 views


Replies
Fun concept . Real time personality reacting to gameplay makes AI streamers feel much more alive . Curious how you're handling latency between screen detection and voice response in real time play ?
Wallie V2
@lucas_cook2Β Thanks! That "feels alive" part is exactly what I was chasing
On latency, a few things stack up: β’ I don't run the LLM on every frame. A lightweight pHash change-detector + an "attention" engine decide when something's actually worth reacting to, so the heavy model only fires on real events (scene changes, getting jumped, entering a cave). That kills most of the wasted latency right there. β’ Everything streams. LLM tokens flow into a sentence splitter, and the FIRST sentence is sent to TTS before the full reply is even done β so audio starts ASAP instead of waiting on the whole response. β’ Sentences are pipelined: while sentence 1 is playing, sentence 2's TTS is already being synthesized in parallel. β’ TTS is tuned for first-audio (smaller server-side chunking), and frame prep (JPEG resize) runs off the event loop so it never stalls playback. Honest take: it's reaction-latency (~1-2s eventβvoice), not twitch-reflex β works great as a co-host reacting to what happens, not for frame-perfect input. It's all open source if you want to dig into the pipeline: https://github.com/Alradyin/wallie-V2
Cool demo the real time reactions make it feel surprisingly natural . I am thinking that how stable it runs during longer gameplay sessions?
Wallie V2
@madison_brooks2Β Thanks! Built for it β a rolling summarizer keeps context bounded over long sessions, anti-repetition stops it looping, and auto-breaks + rate-limit backoff keep it paced for 24/7. The real ceiling is usually your LLM provider's rate limits, not the app.
Would be interesting to see Wallie learn from gameplay over time, like remembering places, goals, or building styles instead of starting fresh every time.
Wallie V2
@freya24Β That's the dream There's already a cross-session memory layer (it remembers topics, regulars, running threads). But structured gameplay memory β "my base is here, my goal is diamonds, my build style" β is exactly the next step. Persistent goals + spatial memory = feels like a real player. Roadmap'd