Sokuji

Sokuji

Real-time AI translation for multilingual conversations

59 followers

Sokuji breaks language barriers using OpenAI's Realtime API. It translates speech instantly through GPT-4o and routes audio to video calls. Available as both a desktop app with virtual audio devices and a browser extension for Google Meet/Microsoft Teams/Zoom.
Sokuji gallery image
Sokuji gallery image
Sokuji gallery image
Sokuji gallery image
Sokuji gallery image
Sokuji gallery image
Sokuji gallery image
Free
Launch Team / Built With
AssemblyAI
AssemblyAI
Build voice AI apps with a single API
Promoted

What do you think? …

Jiang zhuo
We built Sokuji to solve a real problem: enabling seamless communication across language barriers in real-time conversations. What makes Sokuji unique is its complete audio routing solution with virtual device management that integrates directly with applications like Google Meet. Unlike other translation tools that just provide text, Sokuji delivers spoken translations through your microphone in real-time, creating a truly natural conversation flow. We're most proud of how Sokuji makes advanced AI technology accessible and practical. The application creates virtual audio devices, handles automatic routing, and provides intuitive visualizations that make the complex process of simultaneous interpretation feel effortless. Our browser extension brings the same powerful features directly to your browser without installation requirements, making real-time translation available to everyone. Whether you're in international business meetings, connecting with family abroad, or learning a new language, Sokuji removes barriers to understanding. We'd love to hear how you use Sokuji and what languages you're connecting with!

Tried it while practicing Spanish with a friend and it was actually fun 😄 One thing I noticed: it worked better with a headset mic than my laptop mic. Any audio hardware recommendations or best practices for clarity?

Jiang zhuo

@hamza_afzal_butt 
The main issue lies in the clarity of the input device and background noise.

Laptop microphones typically have a wide pickup range—often a fan-shaped area in front of the laptop—so they tend to capture more background noise.

This becomes particularly problematic when the monitoring feature is enabled and output through speakers, as the microphone may pick up the translated audio, leading to reduced accuracy.

However, using headphones for monitoring and a headset microphone can avoid this issue.


I don’t have a specific microphone recommendation, but here are a few things you can try:

  • Adjust Noise Reduction based on your environment.

  • In the Audio settings, you can disable the monitoring feature or avoid using speaker output for monitoring.

@jiang_zhuo Thank you!

Erliza. P

Real-time multilingual communication unlocked! 🌍🗣️ Sokuji's GPT-4o powered translation with seamless audio routing is game-changing for global teams. Finally, language barriers dissolve during video calls without awkward pauses ⚡

Joy Wang

Sokuji delivers real-time speech translation powered by GPT-4o, making multilingual communication seamless during video calls. With both a desktop app and a browser extension for Google Meet, it’s a practical and powerful tool for global collaboration.

sathithya yogi

this is insane

Supa Liu

Sokuji instantly breaks language barriers by translating speech in real time with GPT-4o, seamlessly integrating into video calls via desktop app or Google Meet extension.

Evgenii Zaitsev

Sokuji tackles a huge challenge in real-time communication - breaking down language barriers naturally and seamlessly. The approach of routing spoken translations through virtual audio devices is really clever and feels way more intuitive than just text translation.

I’m curious, how many languages does Sokuji support currently, and how does it handle accents or dialects? Also, any plans to expand beyond Google Meet integrations?

Jiang zhuo

@evgenii_zaitsev1 
Its capability depends on the AI model being used. Currently, the models provided by OpenAI support most major languages around the world, and the results are quite good.

As for accents and dialects, that also mainly depends on the AI model. We're working on integrating different models and API providers, and we believe this will improve as AI technology advances. At present, it already performs well with various English accents from around the world.

We're currently developing integration for Microsoft Teams. Zoom or a Windows client may follow.

At the moment, we only have a Linux desktop client, which functions at the system level and is not limited by specific meeting platforms—so it can be used in any scenario where a microphone is involved.

12
Next
Last