Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

You should throw in some diarization, there's some pretty effective libraries that don't need pertraining on the voice separation in python.


I would suggest 2 speaker-diarization libraries:

- https://huggingface.co/pyannote/speaker-diarization-3.1 - https://github.com/narcotic-sh/senko

I personally love senko since it can run in seconds, whereas py-annote took hours, but there is a 10% WER (word error rate) that is tough to get around.


Nice suggestion, I'll look them up.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: