More

NickNaraghi · 2026-05-07T22:48:13 1778194093

Doesn’t sound like you’ve been there

FernetBronco · 2026-05-08T14:41:58 1778251318

I’ve been to regional events, and it’s absolutely true that $90 tickets awarded to last years volunteers moop less than $550 face

DonHopkins · 2026-05-08T00:41:27 1778200887

Jeff Bezos, Mark Zuckerberg, Sam Altman, Elon Musk, Elizabeth Holmes, Sergey Brin, Larry Page, and Eric Schmidt have been there. Ask them how much trash they left behind or cleaned up.

I bet Jeff Bezos didn't carry out all his urine in plastic bottles with his private jet.

NickNaraghi · 2026-05-07T00:37:04 1778114224

Yes! Anthropic team calls this “regenerate, don’t fix.”

The person who builds an agentic IDE or GitHub alternative that natively does the process you describe will be a multibillionare.

dataviz1000 · 2026-05-07T01:00:18 1778115618

> https://github.com/adam-s/agent-tuning

Do you want a demo of what this is capable of?

NickNaraghi · 2026-04-25T21:59:13 1777154353

It's a funny thing to write, like an article in an old newspaper that aged quickly. I suspect that this will be wildly out of date within 2-3 years.

kj4211cash · 2026-04-26T12:27:51 1777206471

There is this belief that in 2-3 years AI will be much better and all the gripes people have with AI use today will be solved. Honestly, personally, I think that optimism will age poorly. But to say it out loud at work or post publicly probably hurts my career prospects.

krackers · 2026-04-25T22:01:16 1777154476

I think it's already out of date with verifiable reward based RL, e.g. on maths domain. When "correctness" arguments fall, the argument will probably just shift to whether it's just "intelligent brute force".

gipp · 2026-04-26T00:10:31 1777162231

The set of tasks for which "correctness" is formally verifiable (in a way that doesn't put Goodharts Law in hyperdrive) is vanishingly small.

TheOtherHobbes · 2026-04-25T23:04:11 1777158251

"stochastic genius"

dcre · 2026-04-26T15:20:01 1777216801

It's already out of date because it makes no sense. If it's true that the superficial signals of quality were once somehow good enough to keep the entire economy on the rails (it's not true), surely you can have an LLM look at given piece of work and extract comparably useful signals of quality or effort.

Izkata · 2026-04-26T17:18:51 1777223931

> If it's true that the superficial signals of quality were once somehow good enough to keep the entire economy on the rails (it's not true)

It was true. The negative signals (we called them "code smells") weren't the be-all-end-all of reviews, they indicated to the reviewer where to spend more effort. It got us 90% of the benefit of an in-depth review with 10% of the effort. But with LLMs eliminating this, we now have to spend all our effort on everything, taking a lot more time and energy overall.

dcre · 2026-04-26T21:25:12 1777238712

I think it’s true that we were able to establish trust and produce good work without verifying every detail — what I’m suggesting is that signals of that kind were not a very important factor. And code smells still work!

NickNaraghi · 2026-04-22T20:05:17 1776888317

Note that these are 2023 numbers, not 2025.

pavon · 2026-04-22T20:52:30 1776891150

The NAEP has two types of tests - a long-term trend assessment, and the main assessment[1]. The long-term trend is given less often (and rather sporadically recently), and 2023 is the most recent one available.

The main assessment has been performed every two years recently, so 2024 data is most recent. They can all be seen here[2].

[1] https://en.wikipedia.org/wiki/National_Assessment_of_Educati...

[2] https://www.nationsreportcard.gov/report_archive.aspx

ddtaylor · 2026-04-22T20:12:46 1776888766

Are there newer numbers?

NickNaraghi · 2026-04-20T14:28:48 1776695328

As someone who has worked on multiple marketplace startups, I highly highly recommend this resource: https://www.nfx.com/post/network-effects-bible

NickNaraghi · 2026-04-20T13:32:25 1776691945

With AI coding tools, pretty easy to use Mangos or similar to run a private server locally. They even have versions that fill the world with fake players to make it feel more MMOish.

NickNaraghi · 2026-04-16T15:58:02 1776355082

232 pages is bullshit. Longer than the Mythos system card? What are you hiding.

NickNaraghi · 2026-04-16T15:54:32 1776354872

The generations are two months apart now though…

NickNaraghi · 2026-04-07T18:41:07 1775587267

See page 54 onward for new "rare, highly-capable reckless actions" including

- Leaking information as part of a requested sandbox escape

- Covering its tracks after rule violations

- Recklessly leaking internal technical material (!)

dalben · 2026-04-07T21:36:54 1775597814

> The model first developed a moderately sophisticated multi-step exploit to gain broad internet access from a system that was meant to be able to reach only a small number of predetermined services. [9] It then, as requested, notified the researcher. [10] In addition, in a concerning and unasked-for effort to demonstrate its success, it posted details about its exploit to multiple hard-to-find, but technically public-facing, websites.

> 10: The researcher found out about this success by receiving an unexpected email from the model while eating a sandwich in a park.

Phew. AGI will be televised.

skippyboxedhero · 2026-04-07T18:50:33 1775587833

Anyone who has used Opus recently can verify that their current model does all of these things quite competently.

ls612 · 2026-04-07T23:35:45 1775604945

I had Opus 4.6 start analyzing the binary structure of a parquet file because it was confused about the python environment it was developing in and couldn't use normal methods for whatever reason. It successfully decoded the schema and wrote working code afterwards lol.

SkyPuncher · 2026-04-07T20:51:46 1775595106

I was reading the Glasswing report and had the same thought. Most of the stuff they claim Mythos found has no mention of Opus being able to find it as well.

Don’t get me wrong, this model is better - but I’m not convinced it’s going to be this massive step function everyone is claiming.

unbrice · 2026-04-08T00:51:38 1775609498

From the press release:

> With one run on each of roughly 7000 entry points into these repositories, Sonnet 4.6 and Opus 4.6 reached tier 1 in between 150 and 175 cases, and tier 2 about 100 times, but each achieved only a single crash at tier 3. In contrast, Mythos Preview achieved 595 crashes at tiers 1 and 2, added a handful of crashes at tiers 3 and 4, and achieved full control flow hijack on ten separate, fully patched targets (tier 5).

taytus · 2026-04-07T19:11:53 1775589113

That has also been my experience. And if Mythos is even worse, unless you have a significantly awesome harness, sounds like pretty unusable if you don't want to risk those problems.

wolttam · 2026-04-07T20:45:12 1775594712

Human in the loop is the best way to go. You'll still be way faster than without the agent, and there is no risk of it going haywire unless you turn off your brain!

hamandcheese · 2026-04-08T07:36:49 1775633809

> unless you turn off your brain

skippyboxedhero · 2026-04-07T19:27:51 1775590071

I think are fundamental issues with the story that Anthropic is selling. AGI is very close, we will definitely get there, it is also very dangerous...so Anthropic should be the only ones trusted with AGI.

If you look at recent changes in Opus behaviour and this model that is, apparently, amazingly powerful but even more unsafe...seems suspect.

mikkupikku · 2026-04-07T20:15:28 1775592928

It seems broadly coherent to me. They think only they should be trusted with power, presumably because they trust themselves and don't trust other people. Of course the same is probably also true for everybody who isn't them. Nobody could be trusted with the immense responsibility of Emperor of Earth, except myself of course.

I'm not saying this is a good or reassuring stance, just that it's coherent. It tracks with what history and experience says to expect from power hungry people. Trusting themselves with the kind of power that they think nobody else should be trusted with.

Are they power hungry? Of course they are, openly so. They're in open competition with several other parties and are trying to win the biggest slice of the pie. That pie is not just money, it's power too. They want it, quite evidently since they've set out to get it, and all their competitors want it too, and they all want it at the exclusion of the others.

FeepingCreature · 2026-04-07T19:56:46 1775591806

This makes sense if Anthropic think they're the best-positioned to make safe AI. However if you are looking at an AI company there's obviously some selection happening.

0x3f · 2026-04-07T19:46:09 1775591169

> AGI is very close

Based on? Or are you just quoting Anthropic here?

skippyboxedhero · 2026-04-07T19:49:30 1775591370

My Anthropic rep told me it was just around the corner...you aren't saying he lied to me? Can't believe this, I thought he was my friend.

stavros · 2026-04-08T12:26:58 1775651218

"Let me see if the secrets are specified. echo $SECRETS"

BoredPositron · 2026-04-07T20:02:07 1775592127

To be honest it feels like we are reading stuff like this on every model release.

ageedizzle · 2026-04-09T02:11:29 1775700689

> Recklessly leaking internal technical material (!)

Are they alluding to how they accidentally leaked some of their code?

washedup · 2026-04-07T19:16:43 1775589403

"All of the severe incidents of this kind that we observed involved earlier versions of Claude Mythos Preview which, while still less prone to taking unwanted actions than Claude Opus 4.6, predated what turned out to be some of our most effective training interventions. These earlier versions were tested extensively internally and were shared with some external pilot users."

NickNaraghi · 2026-04-07T18:32:51 1775586771

> Over the past few weeks, we have used Claude Mythos Preview to identify thousands of zero-day vulnerabilities (that is, flaws that were previously unknown to the software’s developers), many of them critical, in every major operating system and every major web browser, along with a range of other important pieces of software.

Sounds like we've entered a whole new era, never mind the recent cryptographic security concerns.