Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

The numbers are pretty incredible. Will the competition be able to match them?


Groq is claiming 284 tokens/second on Llama 3.1 70b, so they’re in the same ballpark.

https://groq.com/12-hours-later-groq-is-running-llama-3-inst...


If Groq 2 is 2x faster it will match Cerebras WSE-3.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: