Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

OP's point is that the hardware specs are 2-3x higher in many places, but all their benchmarks are 20-30% higher. The article mentions this as well. It means AMD couldn't even utilize their own hardware very well at this point.


Transformers are heavily memory bandwidth bound on modern hardware, and these chips only have 60% higher memory bandwidth.


Their own slides show a 20-30% speedup on attention tasks.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: