Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

> opus scores 97.1% when given an actual vision access

Do you have a source for this? I would be very curious to see how top models do with vision.

 help




No, there is no source for this. Opus is scoring around 1% just like all the other frontier models. It would be fairly trivial to add a renderer intermediary. And if it improves to 97+%... Then you would get a huge cut of $2 million dollars. The assertion that Opus gets 97% if you just give it a gui is completely bogus.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: