Forums
Voice and Realtime Hackathon at YC (Oct 11) w Gemini + Pipecat
Just saw @kwindla post this and I wish I could attend!
Space is limited - apply here: https://events.ycombinator.com/p...
May Voice AI course: $10k in credits + a session with Product Hunt
We've got a great roster of participating companies for the month-long Voice AI course and community events series that starts this week.

Voice AI course and online community
Announcing: Voice Agents course and online community ...

@swyx and I are hosting a month-long technical deep dive into Voice AI and Voice Agents, starting in May.
Our goals are to:
cover all the lessons we've learned over the last two years building realtime, conversational AI,
host fun sessions with all our favorite people who are doing related things,
and
build a long-term online community.
Sign-up link: https://lnkd.in/gnPuHyD4
We'll start announcing free credits for students on Monday. Sign up this weekend with promo code PHUNT for a super-secret Product Hunt community discount.
Last year I signed up for the LLM fine-tuning course taught by Hamel Husain and Dan Becker.
The experience was fantastic in every way. The material was great. The course expanded to cover way more than fine-tuning. It seemed like all of Twitter signed up. I met people in the course Discord that have become online and offline friends. Someone eventually dubbed the course "AI Woodstock." (I think credit for that goes to Swyx.)
We think this is the moment to try to create a similar thing for voice AI.
Voice interfaces are going to be a huge part of the near-future of computing. Voice agents are being deployed at scale today for a wide range of use cases.
collecting patient data prior to healthcare appointments
following up on inbound sales leads,
handling an increasing variety of call center tasks,
coordinating scheduling and logistics between companies, and
answering the phone for nearly every kind of small business.
I'm personally excited about voice interactions for games, realtime video, and voice-enabled programming environments.
https://lnkd.in/gnPuHyD4
Promo code: PHUNT
Voice AI course and online community
Announcing: Voice Agents course and online community ...

@swyx and I are hosting a month-long technical deep dive into Voice AI and Voice Agents, starting in May.
Our goals are to:
cover all the lessons we've learned over the last two years building realtime, conversational AI,
host fun sessions with all our favorite people who are doing related things,
and
build a long-term online community.
Sign-up link: https://lnkd.in/gnPuHyD4
We'll start announcing free credits for students on Monday. Sign up this weekend with promo code PHUNT for a super-secret Product Hunt community discount.
Last year I signed up for the LLM fine-tuning course taught by Hamel Husain and Dan Becker.
The experience was fantastic in every way. The material was great. The course expanded to cover way more than fine-tuning. It seemed like all of Twitter signed up. I met people in the course Discord that have become online and offline friends. Someone eventually dubbed the course "AI Woodstock." (I think credit for that goes to Swyx.)
We think this is the moment to try to create a similar thing for voice AI.
Voice interfaces are going to be a huge part of the near-future of computing. Voice agents are being deployed at scale today for a wide range of use cases.
collecting patient data prior to healthcare appointments
following up on inbound sales leads,
handling an increasing variety of call center tasks,
coordinating scheduling and logistics between companies, and
answering the phone for nearly every kind of small business.
I'm personally excited about voice interactions for games, realtime video, and voice-enabled programming environments.
https://lnkd.in/gnPuHyD4
Promo code: PHUNT
Request for product: voice-based dev environment
Here's my hacked-together, messy, voice-based dev environment:
Voice-driven loop with screen-shotting so the LLM in the loop can see what's in my terminal and editor. The prompt varies depending on what I'm trying to drive with this loop.
A few tool definitions that give read access to files and URLs.
A tool the LLM can send a block of output to that generates keyboard events, so the LLM can drive any editor/terminal.
A separate process watching a directory and constantly making LLM-driven git commits. (git autosave).
I have some pieces of this running most of the time. But I'm lazy, and doing other stuff, and I also try to use a variety of editors and tools, to see what's good lately. Which ... no stability, so my hacked-together stuff is always broken.
I don't want to replace @Windsurf / @Cursor / Claude code. A seriously good agent and expert-system dev toolkit is a lot of work.
Request for product: voice-based dev environment
Here's my hacked-together, messy, voice-based dev environment:
Voice-driven loop with screen-shotting so the LLM in the loop can see what's in my terminal and editor. The prompt varies depending on what I'm trying to drive with this loop.
A few tool definitions that give read access to files and URLs.
A tool the LLM can send a block of output to that generates keyboard events, so the LLM can drive any editor/terminal.
A separate process watching a directory and constantly making LLM-driven git commits. (git autosave).
I have some pieces of this running most of the time. But I'm lazy, and doing other stuff, and I also try to use a variety of editors and tools, to see what's good lately. Which ... no stability, so my hacked-together stuff is always broken.
I don't want to replace @Windsurf / @Cursor / Claude code. A seriously good agent and expert-system dev toolkit is a lot of work.
I'm the ElevenLabs CEO - what do you want to do with voice AI but can't? (AMA)
Hi Everyone!
Solving AI audio end-to-end means tackling both generation and understanding - from text-to-speech to speech-to-text and everything in between. At ElevenLabs, we re working on breakthroughs in AI audio that bridge research and real-world use.
Ask me anything about what we re building, the challenges of scaling AI speech models, and where this space is headed. Also keen to hear what you ve built with ElevenLabs!
I'm the ElevenLabs CEO - what do you want to do with voice AI but can't? (AMA)
Hi Everyone!
Solving AI audio end-to-end means tackling both generation and understanding - from text-to-speech to speech-to-text and everything in between. At ElevenLabs, we re working on breakthroughs in AI audio that bridge research and real-world use.
Ask me anything about what we re building, the challenges of scaling AI speech models, and where this space is headed. Also keen to hear what you ve built with ElevenLabs!
Happy Birthday, Photoshop!
@rajiv_ayyangar and I have talked a lot about what applications, devices, and codebases have most influenced our thinking and careers. Photoshop is high on that list for me.
For many years it was the most reliable application I used regularly, bar none. In a world where we just expected the Windows blue screen and the little Mac unhappy icon to happen regularly, Photoshop never crashed. Even though both its feature set complexity and working data set size were much larger than most other apps of the time.
Photoshop was also truly cross platform. It worked the same way on Windows and Mac. This is hard!
It had a plugin system. Indie developers built great extensions for Photoshop.
And it had a credits screen that you saw every time the app loaded, with the names of all the programmers who worked on it!






