Forums

•

1mo ago

Featured

Talk to Claude Code (with your voice) from anywhere

Here's an MCP server that lets you talk to Claude Code from anywhere you can negotiate a WebRTC connection (or make a phone call):

https://github.com/pipecat-ai/pi...

p/claude

•

1mo ago

Featured

Talk to Claude Code (with your voice) from anywhere

Here's an MCP server that lets you talk to Claude Code from anywhere you can negotiate a WebRTC connection (or make a phone call):

https://github.com/pipecat-ai/pi...

p/daily

•

6mo ago

Featured

Voice and Realtime Hackathon at YC (Oct 11) w Gemini + Pipecat

Just saw @kwindla post this and I wish I could attend!

Space is limited - apply here: https://events.ycombinator.com/p...

p/daily

•

10mo ago

Featured

May Voice AI course: $10k in credits + a session with Product Hunt

We've got a great roster of participating companies for the month-long Voice AI course and community events series that starts this week.

p/daily

•

11mo ago

Featured

Voice AI course and online community

Announcing: Voice Agents course and online community ...

@swyx and I are hosting a month-long technical deep dive into Voice AI and Voice Agents, starting in May.
Our goals are to:
cover all the lessons we've learned over the last two years building realtime, conversational AI,
host fun sessions with all our favorite people who are doing related things,
and
build a long-term online community.
Sign-up link: https://lnkd.in/gnPuHyD4
We'll start announcing free credits for students on Monday. Sign up this weekend with promo code PHUNT for a super-secret Product Hunt community discount.
Last year I signed up for the LLM fine-tuning course taught by Hamel Husain and Dan Becker.
The experience was fantastic in every way. The material was great. The course expanded to cover way more than fine-tuning. It seemed like all of Twitter signed up. I met people in the course Discord that have become online and offline friends. Someone eventually dubbed the course "AI Woodstock." (I think credit for that goes to Swyx.)
We think this is the moment to try to create a similar thing for voice AI.
Voice interfaces are going to be a huge part of the near-future of computing. Voice agents are being deployed at scale today for a wide range of use cases.
collecting patient data prior to healthcare appointments
following up on inbound sales leads,
handling an increasing variety of call center tasks,
coordinating scheduling and logistics between companies, and
answering the phone for nearly every kind of small business.
I'm personally excited about voice interactions for games, realtime video, and voice-enabled programming environments.
https://lnkd.in/gnPuHyD4
Promo code: PHUNT

p/daily

•

11mo ago

Featured

Voice AI course and online community

Announcing: Voice Agents course and online community ...

p/daily

•

11mo ago

Request for product: voice-based dev environment

Here's my hacked-together, messy, voice-based dev environment:

Voice-driven loop with screen-shotting so the LLM in the loop can see what's in my terminal and editor. The prompt varies depending on what I'm trying to drive with this loop.
A few tool definitions that give read access to files and URLs.
A tool the LLM can send a block of output to that generates keyboard events, so the LLM can drive any editor/terminal.
A separate process watching a directory and constantly making LLM-driven git commits. (git autosave).

I have some pieces of this running most of the time. But I'm lazy, and doing other stuff, and I also try to use a variety of editors and tools, to see what's good lately. Which ... no stability, so my hacked-together stuff is always broken.

I don't want to replace @Windsurf / @Cursor / Claude code. A seriously good agent and expert-system dev toolkit is a lot of work.

p/daily

•

11mo ago

Request for product: voice-based dev environment

Here's my hacked-together, messy, voice-based dev environment:

Voice-driven loop with screen-shotting so the LLM in the loop can see what's in my terminal and editor. The prompt varies depending on what I'm trying to drive with this loop.
A few tool definitions that give read access to files and URLs.
A tool the LLM can send a block of output to that generates keyboard events, so the LLM can drive any editor/terminal.
A separate process watching a directory and constantly making LLM-driven git commits. (git autosave).

I don't want to replace @Windsurf / @Cursor / Claude code. A seriously good agent and expert-system dev toolkit is a lot of work.

p/elevenlabs

•

1yr ago

Featured

I'm the ElevenLabs CEO - what do you want to do with voice AI but can't? (AMA)

Hi Everyone!
Solving AI audio end-to-end means tackling both generation and understanding - from text-to-speech to speech-to-text and everything in between. At ElevenLabs, we re working on breakthroughs in AI audio that bridge research and real-world use.
Ask me anything about what we re building, the challenges of scaling AI speech models, and where this space is headed. Also keen to hear what you ve built with ElevenLabs!

p/elevenlabs

•

1yr ago

Featured

I'm the ElevenLabs CEO - what do you want to do with voice AI but can't? (AMA)

p/photoshop

•

1yr ago

Happy Birthday, Photoshop!

@rajiv_ayyangar and I have talked a lot about what applications, devices, and codebases have most influenced our thinking and careers. Photoshop is high on that list for me.
For many years it was the most reliable application I used regularly, bar none. In a world where we just expected the Windows blue screen and the little Mac unhappy icon to happen regularly, Photoshop never crashed. Even though both its feature set complexity and working data set size were much larger than most other apps of the time.
Photoshop was also truly cross platform. It worked the same way on Windows and Mac. This is hard!
It had a plugin system. Indie developers built great extensions for Photoshop.
And it had a credits screen that you saw every time the app loaded, with the names of all the programmers who worked on it!

p/openai

•

1yr ago

What are your thoughts on Openai's Realtime API?

I don't see a lot of products using the realtime api in building their conversation ai agents. Given that it now has realtime communication support through WebRTC allowing low latency conversations, I expected it to blow up. Are there any limitations of this model like hallucinations and or is it just too expensive for commercial use?

p/openai

•

1yr ago

What are your thoughts on Openai's Realtime API?

•

7yr ago

Daily Video API - Add video calls to your app, in minutes.

Now you can add secure, 1-click video calls that work anywhere to your product with just an <iframe> tag. Plus, our API makes building flexible communication workflows easy, with user permissions, chat, recording, and more.

•

6yr ago

Daily.co API for HIPAA Compliant Video - Add secure video chat to healthcare apps

Daily.co is an API to add secure, reliable video calls to telehealth applications. With 2 lines of code, developers can embed a HIPAA compliant video chat widget. Calls work on mobile and across browsers and devices. We'll sign a Business Associate Agreement.

•

1yr ago

Daily Bots - Build voice-to-voice AI agents with any LLM with Open Source

Build hosted AI agents on Daily Bots, the Open Source cloud for ultra low latency voice and video. Use commercial and open models. Incorporate tool use, telephony, vision, etc. If your needs evolve beyond Daily Bots, run your code anywhere that supports RTVI.

Kwindla Kramer

Forums

Talk to Claude Code (with your voice) from anywhere

Talk to Claude Code (with your voice) from anywhere

Voice and Realtime Hackathon at YC (Oct 11) w Gemini + Pipecat

May Voice AI course: $10k in credits + a session with Product Hunt

Voice AI course and online community

Voice AI course and online community

Request for product: voice-based dev environment

Request for product: voice-based dev environment

I'm the ElevenLabs CEO - what do you want to do with voice AI but can't? (AMA)

I'm the ElevenLabs CEO - what do you want to do with voice AI but can't? (AMA)

Happy Birthday, Photoshop!

What are your thoughts on Openai's Realtime API?

What are your thoughts on Openai's Realtime API?

Daily Video API - Add video calls to your app, in minutes.

Daily.co API for HIPAA Compliant Video - Add secure video chat to healthcare apps

Daily Bots - Build voice-to-voice AI agents with any LLM with Open Source