Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

That's what benchmarks like ARC-AGI are designed to test. The models are getting better at it, and you aren't.

Nothing ultimately matters in this business except the first couple of time derivatives.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: