Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

You can fine tune without the pre training data too.

Mistral models are one example, they never released pre training data and there are many fine tunes.



We are in agreement -- that's exactly what I am saying :)




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: