Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Generation time is more or less proportional to tokens * model size, so if you can get the same quality result with fewer tokens from the same size of model, then you save time and money.




Thanks. That was not obvious to me either.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: