More

getnormality · 2026-06-11T12:42:14 1781181734

This weird trend reached an apex in a Feb 2026 OpenAI blog post [1], recently on the front page [2], which describes the process for building... something... written 100% by agents.

There is no description of what the thing is, no indication of what value it provides its users. The closest it gets is "the product has been used by hundreds of users internally, including daily internal power users".

But the fact that the thing has a million lines of code is repeated twice in the first few hundred words.

[1] https://openai.com/index/harness-engineering/

[2] https://news.ycombinator.com/item?id=48416264

DrewADesign · 2026-06-11T13:59:40 1781186380

> "the product has been used by hundreds of users internally, including daily internal power users".

My guess is it’s an email filter.

> million lines of code

> written 100% by agents

Yeah, probably an email filter. Or maybe a JS menu for a departmental wiki that basically recreates jquery using MS JScript and transpiles it into JS 5.

pyrale · 2026-06-11T15:20:45 1781191245

> My guess is it’s an email filter.

It may also be an email generator.

The email filter team is trying to match the pace of innovation of the email generation team. At stakes is the ability for the employees to process the billions of mission-critical generated emails each of them receives each day.

DrewADesign · 2026-06-11T16:05:25 1781193925

It’s true. They’re all go-getters destined for big things. Look at those token burn rates!

getnormality · 2026-06-11T15:03:15 1781190195

Your hilariously specific hypotheses remind me of how little I know about technology.

DrewADesign · 2026-06-11T16:09:56 1781194196

> how little I know about technology.

Probably because you smoked too much weed in school.

Remember, this is the tech industry! An abject lack of knowledge is no impediment for people with boundless confidence in their assumptions!

QuercusMax · 2026-06-11T16:35:38 1781195738

Ya gotta start smoking weed after you're done with school, that's the way to success

selicos · 2026-06-11T20:28:55 1781209735

The best strategy is to figure out what you are going to do after smoking weed before you smoke weed. So, draft your prompt and send it before lighting up.

wiml · 2026-06-13T17:50:28 1781373028

Tokenmax while you tokemax.

packetlost · 2026-06-11T19:28:43 1781206123

And trash your short-term memory? Nah, smoking weed is straight up bad for most people.

DrewADesign · 2026-06-11T20:36:57 1781210217

Plot twist: they weren’t even enrolled in that school.

JCTheDenthog · 2026-06-11T13:59:22 1781186362

The entire Linux kernel is about 40 million LoC, and only something like 16 million LoC after you remove drivers. I have a hard time imagining whatever OpenAI was talking about there having anywhere close to 6% as much utility as the Linux kernel, despite having 6% as many lines of code. And I have a hard time imagining it's anywhere close to maintainable, regardless of how powerful their LLMs might be.

esperent · 2026-06-11T14:35:53 1781188553

To be fair, few things of any number of LOC have as much utility as the Linux kernel, and it's also a particularly dense example of code. There's plenty of other examples that have higher LOC / utility ratio without being vibe coded. For example, Google's monorepo famously has 2 billion LOC, which is a statistic I've heard long before LLM coding took over.

jeffbee · 2026-06-11T14:51:13 1781189473

Clarification: Google claimed to have 2 billion lines of code in their repo ten years ago, and a commit rate of 50,000 changelists per day, both on exponential growth trends.

zaphar · 2026-06-11T15:27:51 1781191671

That's a monorepo with hundreds if not thousands of different applications. It's not even close to an apples to apples comparison.

jeffbee · 2026-06-11T16:36:44 1781195804

That's certainly a way to look at it. And that repo contains a "third party" directory which itself contains Linux, LLVM, and much of the rest of the open source world. But I would suggest that the largest of those thousands of applications probably has a transitive closure of hundreds of millions of lines of code.

zaphar · 2026-06-11T17:01:18 1781197278

I'm aware. I worked on that specific project (assuming we are talking about the same one) back in the day. :-)

There are certainly very large applications in that repo in the hundreds of millions of lines of code. But comparing the entire repo to single applications is not an apt comparison.

esperent · 2026-06-11T17:27:33 1781198853

Ok, then we can still compare one of those very large applications that have hundreds of millions of lines.

strulovich · 2026-06-11T15:36:45 1781192205

The Linux kernel is not in any way at top of big projects. A kernel, as the name suggests, deals with specific issues and tries to remain small.

The world’s biggest software is usually built over endless adapters of different data and a need to reconcile endless edge cases with laws, regulations and real world complexities.

dist-epoch · 2026-06-11T14:56:10 1781189770

Chrome has 50 mil LoC

https://openhub.net/p/chrome/analyses/latest/languages_summa...

skydhash · 2026-06-11T15:04:59 1781190299

Chrome is basically reinventing each OS API and libraries. One day they’ll have their own tcp stack and packet filter.

getnormality · 2026-06-11T15:07:51 1781190471

It kinda makes sense given that one of their major products is a computer that runs an operating system literally called ChromeOS.

jeffbee · 2026-06-11T15:16:12 1781190972

Arguably with QUIC it already does.

robotresearcher · 2026-06-11T15:46:04 1781192764

Chrome still has a way to go until Zawinski's Law is satisfied natively.

http://www.catb.org/jargon/html/Z/Zawinskis-Law.html

m463 · 2026-06-12T00:16:25 1781223385

When I was young, I remember a (joke) program from a book or magazine, called something like report writer or something. It was written in basic and you typed it in.

You would run it and it would say:

how many pages? _

You would type in a number, and it would generate that many pages of a complex-sounding report.

something like "the subsystem design interface is ..." blah blah

similar?

prodigycorp · 2026-06-11T16:32:15 1781195535

The reality distortion field is strong around anthropic. Anthropic posts tons of equally bullshit blog posts, written entirely by AI, saying absolutely nothing, to the front page and they consistently average many hundreds of upvotes.

malfist · 2026-06-11T16:33:49 1781195629

I have to wonder if HN is astroturfed. Anthropic and OpenAI can post about a new model and it gets over to HN instantly and is full of glowing reviews of supposedly real people who supposedly have had prior access to it who supposedly think it's the best model yet.

prodigycorp · 2026-06-11T16:35:56 1781195756

Feels doubly so for Anthropic. There's just unbelievable level of glaze, and insane upvote velocity for "blog posts" that are essentially fluffed up feature documentation.

Somehow everything boris says has become the word of God. The dude is just an engineer, like you and me, who gets unlimited tokens for free.

malfist · 2026-06-11T16:54:53 1781196893

Yeah, I sometimes catch these posts minutes after they were submitted and they're full of highly upvoted glowing reviews that maintain their momentum on the top of the page.

And this is hacker news. A place where famously the most upvoted comment is the one critical of the post.

JackFr · 2026-06-11T17:06:18 1781197578

Upvoter here. Nothing special -- I like Claude a lot. I found it more ergonomic to work with, and I think its models are state of the art. I believe I'm more productive especially in legacy crufty codebases.

I may not be representative of the universe or have a controlled, randomized study to back it up, but that's not what upvotes are for are they?

prodigycorp · 2026-06-11T17:11:43 1781197903

That's a fair point. To play devils advocate to myself, it's possible that the huge upvote share is due to a much bigger marketshare among our community.

mrandish · 2026-06-12T07:33:01 1781249581

Yeah, that was my guess too. Still a little disappointing to see fellow HNers reflexively fanboying a company that's the overwhelming dominate player in LLM coding.

I try to reserve my reflexive fanboy company/project votes for underdogs who need and deserve the help.

lispisok · 2026-06-11T17:11:12 1781197872

HN is astroturfed. Not just AI and also before AI took off. It's so easy and so valuable to astroturf HN

joe_the_user · 2026-06-11T19:17:59 1781205479

The tech field has always been full of naive technology boosters. HN might be heavily astroturfed for all I know but I am sure there are many real people in a state of constant excitement over the progress of AI.

coffeefirst · 2026-06-11T21:54:14 1781214854

Heavily. There’s an amount of true believers who actually think RSI is going to happen any day now, but you can spot patterns amongst brand new booster accounts…

I real life I meet people who like AI and people who hate it, but nobody who’s on a personal mission to defend Anthropic from anyone that dares to question their hyperbolic marketing.

selcuka · 2026-06-12T00:08:02 1781222882

HN is probably astroturfed, but it is also one of those places where a lot of actual subject experts frequent. I don't find it unusual that a new model gets instantly posted here. It is also not unexpected that the most recently released model is the best model.

monkpit · 2026-06-11T20:49:19 1781210959

Sorry, was it not this?

https://github.com/openai/symphony

nhinck3 · 2026-06-12T10:13:05 1781259185

That does not appear to have a million lines of code

benj111 · 2026-06-11T18:27:14 1781202434

Oh god why did you make me read that.

>We intentionally chose this constraint so we would build what was necessary to increase engineering velocity by orders of magnitude

What kind of wanky bs is "engineering velocity". Maybe the post was written by AI?

habinero · 2026-06-11T19:12:45 1781205165

It's a real term. It's a Agile metric for measuring how much a team is shipping over time.

Whether or not the whole concept is wanky bs depends on who you ask lol. It's useful if you measure it over time, not so much otherwise.

pramodbiligiri · 2026-06-12T05:29:04 1781242144

Yeah it was very disappointing that so few details were provided. One of the reasons I think it's going to be an open source project or effort that is going to show, sooner or later, how effective these things actually are.

In their podcast interview, they mention that it's an Electron app that users download, and so they periodically create a new build. See section "Autonomous Merging Flow" here: https://www.latent.space/p/harness-eng

getnormality · 2026-06-07T23:36:34 1780875394

Could we hop on a quick call to get a quick status on that quick fix?

cube00 · 2026-06-08T08:01:44 1780905704

Just wanted to check in on how we're tracking, I've booked a quick sync meeting for 4:45pm on Friday followed by a release cadence checkpoint meeting for 9am on Monday.

getnormality · 2026-06-07T16:31:31 1780849891

So, did this internal prototype ultimately end up being used to create/influence a real product, e.g. Codex app?

zbrock · 2026-06-07T23:33:36 1780875216

getnormality · 2026-06-11T03:05:39 1781147139

No detail about this in the article or your comment here, but the voluminous lines of code get a big call-out. Very interesting!

getnormality · 2026-06-07T16:08:14 1780848494

We've known for decades that output metrics like LOC/day are very bad measures of real productivity in software. But they seem to be back in vogue in the age of AI, because AI is so good at maxing these useless metrics, and we need to show how impressive our AI is and how impressive our usage of AI is.

getnormality · 2026-05-29T13:56:09 1780062969

I read the counterclaims from Bricks and Minifigs here:

https://bricksandminifigs.com/blog/blog/2026/05/28/bricks-mi...

This post and TFA have a common issue: no one seems to have a clear, compellingly evidenced account of basic questions about the collection and its history under consignment:

1. What exactly was in the collection?

2. What happened to the collection after it was consigned: which sets were sold, which were stolen or lost, which were moved to off-site storage, etc.?

3. How much money did the original franchise owner owe the consigner for the sets sold?

The peripheral claims about e.g. police malfeasance are disturbing, but without this basic evidence about the substance of the matter, I don't know if it's a great idea for an online mob to take sides.

maweaver · 2026-05-29T15:35:28 1780068928

One thing really stands out, which is that Bricks and Minifigs is pointing the finger at the franchisee for an "unauthorized" consignment deal. While the blog post has a screenshot of the contract specifically allowing consignments. Which undercuts a large part of the corporate argument.

paulddraper · 2026-05-29T17:15:38 1780074938

Yeah, that's a glaring issue.

They say it was prohibited in the 2023 operating manual [1]. That's the same year as the franchise contract.

It'd be strange for B&M to lie about such an objectively falsifiable statement.

My best guess is that the franchise and operating manual were simply not in agreement.

(B&M wants to reject the consignment because they want the franchise operator to take liability for any missing or unreimbursed sales.)

[1] https://bricksandminifigs.com/blog/blog/2026/05/28/bricks-mi...

zargon · 2026-05-29T19:28:40 1780082920

There was a livestream last night where the Bricks and Minifigs COO said that the franchise agreement has a clause saying that the operations manual overrules the franchise agreement and they can update the operations manual at any time. BAM's court filing instead says that the franchise agreement allows consignment services, but "limited expressly to those approved by BAM".

peebee67 · 2026-05-31T05:11:39 1780204299

There's good evidence that it was expressly approved in the form of social media posts advertising the consignment on their social media pages. Difficult for them to argue either ignorance or that the arrangement wasn't authorised, both of which aren't really relevant to the demonstrated facts of the dispute.

getnormality · 2026-06-01T10:10:31 1780308631

The social media posts are from accounts for the franchise store. There is no reason to think those posts were approved by B&M corporate.

abirch · 2026-05-29T14:35:50 1780065350

You're right that we don't have the facts. That said, the way that Bricks and Minifigs handled this has been horrible. The CEO should have said on camera that he's willing to work with the lego owner to return the property and to work with police to charge the former franchise owner with theft.

If you see this clip, this is when corporate was removing the previous franchisee https://youtu.be/wscQpkcwgNU?si=_k_EDfs4NmO5riB5&t=126

Ovah · 2026-05-29T14:14:38 1780064078

Have you seen the youtube videos? They paint a pretty clear picture.

getnormality · 2026-05-29T23:28:58 1780097338

Clear enough that my questions can be answered in writing? Someone should do that.

Ovah · 2026-05-30T11:52:51 1780141971

You're saying that people shouldn't take a stance "without this basic evidence about the substance of the matter". The basic evidence is there. Take the time to watch Ben's videos, it's an hour of your time well spent.

solenoid0937 · 2026-05-29T14:01:25 1780063285

[flagged]

Ovah · 2026-05-29T14:25:48 1780064748

You are more or less accusing a named individual of severe crimes without much to back it up.

solenoid0937 · 2026-05-29T16:44:44 1780073084

Presumably the company's legal team felt the claims were so strong that they felt comfortable airing them publicly.

getnormality · 2026-05-22T00:16:17 1779408977

What I keep wondering is, what would have happened without the AI? Would they have just ignored your request?

In my neck of the woods it's fairly common that when a person doesn't know how to help you they just don't reply, instead of saying "I don't know how to help, sorry". AI-generated responses seem like the evolution of this attitude that one must either ignore or respond in a (superficially) helpful manner.

getnormality · 2026-05-21T00:19:33 1779322773

Yes, this and every internet forum will still be doing this two years hence. Your life will be better if you take to heart this famous passage from Nietzsche:

I do not want to wage war against what is ugly. I do not want to accuse; I do not even want to accuse those who accuse. Looking away shall be my only negation.

libraryofbabel · 2026-05-21T01:35:13 1779327313

> Looking away shall be my only negation.

I’ve been thinking of building myself my own frontend to HN that makes it impossible to view comments, for this reason. Yet sometimes there are still really interesting discussions and it’s hard to let go of what for me feels like the last social media I want to be part of.

jorisw · 2026-05-21T06:50:57 1779346257

An added stylesheet should be enough to do that

getnormality · 2026-05-19T00:17:09 1779149829

I think they're saying it has qualitatively different capabilities that make certain kinds of security work more worth pursuing with the model, not that the model of human-AI interaction has changed.

You're right that they're using a harness like everyone else. The general idea of giving the model a harness is not going to change. I mean even humans need harnesses to accomplish some things.

mycall · 2026-05-19T03:12:44 1779160364

Google Maps is my favorite human harness.

getnormality · 2026-05-18T05:55:48 1779083748

Wait, so, there's silent reading time that you hate, AND somehow seniors still manage to claim that they didn't read the thing?

kj4211cash · 2026-05-19T00:20:57 1779150057

Well they don't say they didn't read the thing. But 10-20 minutes often isn't enough time to truly digest the arguments. And even it were, I'm not sure it would matter. What I've seen is 10-20 minutes of awkward silent time followed by a level of discussion that isn't notably better than meetings before we adopted Amazon ways. Probably we are doing it wrong and/or have the wrong people in the room.

otterley · 2026-05-19T13:17:14 1779196634

10-20 minutes seems short. Most doc reads I’ve been in at Amazon have a 30-40 minute reading phase. 10-20 minute reads would only be for very short or simple documents.

getnormality · 2026-05-18T01:39:05 1779068345

> Now most of the friction comes from alignment and coordination with other teams.

Then I see a solution! Why don't we simply put the entire company on one big team?

necovek · 2026-05-18T05:03:23 1779080603

You joke but that's pretty much what cross functional teams are.

The only other observation is that as you grow teams, communication channels multiply exponentially and at over 6-8 people communication starts breaking down.

So instead you make small "companies" and set a few ground rules which software they build needs to follow, and you are back at a working org producing complex software.

jimbokun · 2026-05-18T04:06:42 1779077202

Putting everyone responsible for some function of a product on one team, instead of having separate departments for separate functions, can do wonders for actually shipping and iterating on software.

hank9 · 2026-05-18T17:00:01 1779123601

I don't remember who said it first but "software teams always ship their org chart" has always stuck with me. How a team is organized has such an outsized hand it what they build.