Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Are there any remotely comparable open source models? Fully multimodal, audio-to-audio?


Hmm, there’s this Gazelle that can take in audio, but to get audio back out you’d have to use something else (e.g. Piper).

https://github.com/tincans-ai/gazelle?tab=readme-ov-file

https://tincans.ai/slm

https://github.com/rhasspy/piper




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: