Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

What's going on with this plot's y-axis?

https://bsky.app/profile/tylermw.com/post/3lvtac5hues2n



It makes it look like the presentation is rushed or made last minute. Really bad to see this as the first plot in the whole presentation. Also, I would have loved to see comparisons with Opus 4.1.

Edit: Opus 4.1 scores 74.5% (https://www.anthropic.com/news/claude-opus-4-1). This makes it sound like Anthropic released the upgrade to still be the leader on this important benchmark.


> like the presentation is rushed or made last minute

Or written by GPT-5?


They never compare with other vendors


Also this coding deception rate bar tries to decieve us.

https://imgur.com/a/QkriFco


It’s beyond parody that they did something like this on a slide about deception. You couldn’t make this stuff up.


After reading around, it seems like they probably forgot to update/swap the slides before presentation. The graphs were correct on their website, as they launched. But the ones they used in the presentation were probably some older versions they had forgotten to fix.


This is hilarious


Probably created without thinking enabled. Lower % accuracy ensues, speaking from experience.


Probably generated by AI.


If not, the person that made the chart just got $1.5M


Couldn’t believe it was real haha


idiots everywhere. I BET person who made this earns a good salary


Please don't post like this to Hacker News, regardless of how idiotic other people are or you feel they are.

You may not owe people who you feel are idiots better, but you owe this community better if you're participating in it.

https://news.ycombinator.com/newsguidelines.html




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: