Google Gemini Live is pretty good. If you want to try only voice, Try unmute.sh ... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		amrrs 9 months ago \| parent \| context \| favorite \| on: Bagel: Open-source unified multimodal model Google Gemini Live is pretty good. If you want to try only voice, Try unmute.sh by Kyutai which will be eventually open-sourced

spuz 9 months ago [–]

Thanks - it seems that Gemini Live is pretty far behind advanced voice mode at the moment. For example, I can't get it to speak slower when I want to understand what it is saying.

I'm still interested in what keyword I could use to search for the latest research in voice models.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact