More

weatherlite · 2025-12-30T08:14:11 1767082451

> Are you sure it’s _people_ driving this increase?

Most likely - yes. If Google has been dead for years people wouldn't pour hundreds of billions of dollars into ads there. The Search revenue keeps increasing, even since ChatGPT showed up. It might stagnate soon or even decrease a bit - but "death" ? The numbers don't back this up. One blog saying he stops paying for Google ads conflicts with the reality of around 200 billion yearly revenue from Search.

crazygringo · 2025-12-30T14:05:47 1767103547

Exactly this. Businesses decide whether to pay for ads based on clickthru rates and conversions. Bots don't click through. They don't convert. If these rates fall, advertisers will pay proportionally less as their max bid, and Search ads revenue will fall substantially.

That hasn't happened. Google continues to grow with real users.

weatherlite · 2025-12-29T11:51:29 1767009089

> None of the stated problems are actually issues with LLMs after on policy training is performed

But still , isnt it a major weakness they have to RL on everything that has not much data? That really weakens the attempt to make it true AGI.

Legend2440 · 2025-12-29T16:30:55 1767025855

No.

AGI would be a universal learner, not a magic genie. It still needs to do learning (RL or otherwise) in order to do new tasks.

weatherlite · 2025-12-29T18:15:42 1767032142

> It still needs to do learning (RL or otherwise) in order to do new tasks.

Why ? As in - why isn't reading the Brainfuck documentation enough for Gemini to learn Brainfuck ? I'd allow for 3-7 days of a learning curve like perhaps a human would need but why do you need to kinda redo the whole model (or big parts of it) just so it could learn Brainfuck or some other tool? Either the learning (RL or otherwise) need to become way more efficient than it is today (takes today weeks? months? billions of dollars) or it isn't AGI I would say. Not in practical/economic sense and I believe not in the philosophical sense of how we all envisioned true generality.

weatherlite · 2025-12-18T15:23:59 1766071439

> Almost anyone can prompt an LLM to generate a thousand-line patch and submit it for code review. That’s no longer valuable. What’s valuable is contributing code that is proven to work.

That's really not a great development for us. If our main point is now reduced to accountability over the result with barely any involvement in the implementation - that's very little moat and doesn't command a high salary. Either we provide real value or we don't ...and from that essay I think it's not totally clear what the value is - it seems like every QA, junior SWE or even product manager can now do the job of prompting and checking the output.

simonw · 2025-12-18T15:27:04 1766071624

The value is being better at it than any QA or product manager.

Experienced software engineers have such a huge edge over everyone else with this stuff.

If your product manager doesn't understand what a CORS header is good luck having them produce a change that requires cross-domain fetch() call... and first they'll have to know what a "cross-doman fetch() call" means.

And sure they could ask an LLM about that, but they still need the vocabulary and domain knowledge to get to that question.

falcor84 · 2025-12-18T15:37:09 1766072229

That's an interesting argument, but from my industry experience, the average experienced QA Engineer and technical Product Manager both have better vocabulary than the average SWE. Indeed, I wonder whether a future curriculum for Vibe Engineering (to borrow your own term) may look more similar to that of present-day QA or Product curricula, than to a typical coding or CS curriculum.

throw1235435 · 2025-12-19T21:58:19 1766181499

Nah; the only advantage that a software engineer has is that if they are experienced they've probably just a little bit bright. But their role will probably change to something other than a software engineer. Bright valuable people that care and are engaged are rare anyway. They may transition to a different role slowly (e.g. Product, QA, BA, etc) because they still offer value and know the domain, but it isn't traditional SWE. That's been disrupted by AI; I don't want it to be true and I'm hoping for something else; but reality is staring us in the face at the moment and it isn't fair to people to talk platitudes anymore. The fact that you have to write an article like this feels like defensive framing to me + illustrates what happens once a skill is devalued by people/society due to disruption; it proves to me where this is all heading.

My thought on why people especially juniors are just delivering slop: Why bother with quality? Why bother with the craft? When it will be disrupted by the next tool/AI model/etc in the next few years anyway? Just think short term - will this slop get you through the PR and tick a short term box? If so success - might not have a job long term anyway due to all the AI stuff. In fact if I keep ticking boxes I'm more likely to last than the other person given more job incentives. Just get paid today.

In your example a QA that is skilled at testing websites should pick up CORS issues for example. And the models will keep getting better and eventually give them harnesses too - and we SWE will slowly automate everything around this because the only lifeboat left for your career is to cash out hopefully by disrupting yourself (no unions, professional bodies, etc).

weatherlite · 2025-12-14T18:51:11 1765738271

> Israelians

Israelis

weatherlite · 2025-12-03T09:56:06 1764755766

> AI agents break rules under everyday pressure

Jeez they really ARE becoming human like

alentred · 2025-12-03T10:11:22 1764756682

LLMs are built based on human language and texts produced by people, and imitate the same exact reasoning patterns that exist in the training data. Sorry for being direct, but this is literally unsurprising. I think it is important to realize it to not anthropomorphize LLM / AI - strictly speaking they do not *become* anything.

IAmGraydon · 2025-12-03T11:26:00 1764761160

Exact same thought I had. More AI pumping BS.

weatherlite · 2025-12-02T18:51:01 1764701461

> I feel the effects of this are going to take a while to be felt (5 years?);

Who knows if we'll even need senior devs in 5 years. We'll see what happens. I think the role of software development will change so much those years of technical experience as a senior won't be so relevant but that's just my 5 cents.

giancarlostoro · 2025-12-02T18:54:40 1764701680

The way I'm using claude code for personal projects, I feel like most devs will become moreso architects and testers of the output, and reviewers of the output. Which is good, plenty of us have said for ages, devs dont read code enough. Well now you get to read it. ;)

While the work seems to take similar amounts of time, I spend drastically less time fixing bugs, bugs that take me days or God forbid weeks, solved in minutes usually, sometimes maybe an hour if its obscure enough. You just have to feed the model enough context, full stack trace, every time.

tenacious_tuna · 2025-12-02T18:59:27 1764701967

> Well now you get to read it.

Man, I wish this was true. I've given the same feedback on a colleague's clearly LLM-generated PRs. Initially I put effort into explaining why I was flagging the issues, now I just tag them with a sadface and my colleague replies "oh, cursor forgot." Clearly he isn't reading the PRs before they make it to me; so long as it's past lint and our test suite he just sends the PR.

I'd worry less if the LLMs weren't prone to modifying the preconditions of the test whenever they fail such that the tests get neutered, rather than correctly resolving the logic issues.

HaroldCindy · 2025-12-02T19:05:04 1764702304

We need to develop new etiquette around submitting AI-generated code for review. Using AI for code generation is one thing, but asking other people review something that you neither wrote nor read is inconsiderate of their time.

daheza · 2025-12-02T19:55:49 1764705349

I'm getting AI generated product requirements that they haven't read themselves. It is so frustrating. Random requirements like "this service must have a response time of 5s or less" - "A retry mechanism must be present". We have a specific SLA already for response time and the designs don't have a retry mechanism built.

The bad product managers have become 10x worse because they just generate AI garbage to spray at the engineering team. We are now writing AI review process for our user stories to counter the AI generation of the product team. I'd much rather spend my time building things than having AI wars between teams.

HaroldCindy · 2025-12-03T01:42:14 1764726134

Oof. My general principle is "sending AI-authored prose to another human without at least editing it is rude". Getting an AI-generated message from someone at all feels rude to me, kind of like an extreme version of "dictated but not read" being in a letter in the old days.

_keats · 2025-12-02T23:23:08 1764717788

Wow, this describes _exactly_ what I've started to see from some PMs.

icedchai · 2025-12-02T23:31:35 1764718295

At least they're running the test suite? I'm working with guys who don't even do that! I've also heard "I've fixed the tests" only to discover, yes, the tests pass now, but the behavior is no longer correct...

GuinansEyebrows · 2025-12-02T19:05:46 1764702346

> I feel like most devs will become moreso architects and testers of the output

which means either devs will take over architectural roles (which already exist and are filled) or architects will take over dev roles. same goes for testing/QA - these are already positions within the industry in addition to being hats that we sometimes put on out of necessity or personal interest.

QuercusMax · 2025-12-02T20:15:49 1764706549

I've seen Product Manager / Technical Program Manager types leaning into using AI to research what's involved in a solution, or even fix small bugs themselves. Many of these people have significant software experience already.

This is mostly a good thing provided you have a clear separation between solution exploration and actually shipping software - as the extra work put into productionizing a solution may not be obvious or familiar to someone who can use AI to identify a bugfix candidate, but might not know how we go about doing pre-release verification.

weatherlite · 2025-12-02T19:10:55 1764702655

> I feel like most devs will become moreso architects and testers of the output

Which stands to reason you'll need less of them. I'm really hoping this somehow leads to an explosion of new companies being built and hiring workers , otherwise - not good for us.

phantasmish · 2025-12-02T19:55:11 1764705311

> Which stands to reason you'll need less of them.

Depends on how much demand there would be for somewhat-cheaper software. Human hours taken could well remain the same.

Also depends on whether this approach leads to a whole lot of badly-fucked projects that companies can’t do without and have to hire human teams to fix…

jackschultz · 2025-12-02T19:13:14 1764702794

This is what I'm doing, Opus 4.5 for personal projects and to learn the flow and what's needed. Only thing I'll disagree with is how the work takes similar amount of time because I'm finding it unbelievably faster. It's crazy how with smart planning and documentation that we can do with the agents, getting markdown files etc, they can write the code better and faster than I can as a senior dev. No question.

I've found Opus 4.5 as a big upgrade compared to any of the other models. Big step up and the minor issues that were annoying and I needed to watch out for with Sonnet and GPT5.1.

It's to the point where I'm on the side of, if the models are offline or I run out of tokens for the 5 hour window or the week (with what I'm paying now), there's kind of no use of doing work. I can use other models to do planning or some review, but then wait until I'm back with Opus 4.5 to do the code.

It still absolutely requires review from me and planning before writing the code, and this is why there can be some slop that goes by, but it's the same as if you have a junior and they put in weak PRs. Difference is much quicker planning which the models help with, better implementation with basic conventions compared to juniors, and much easier to tell a model to make changes compared to a human.

giancarlostoro · 2025-12-02T19:23:50 1764703430

> This is what I'm doing, Opus 4.5 for personal projects and to learn the flow and what's needed. Only thing I'll disagree with is how the work takes similar amount of time because I'm finding it unbelievably faster.

I guess it depends on the project type, in some cases like you're saying way faster. I definitely recognize I've shaved weeks off a project, and I get really nuanced and Claude just updates and adjusts.

samdoesnothing · 2025-12-02T20:14:03 1764706443

Can you post a repo so we can see what it's generating?

weatherlite · 2025-12-02T09:15:58 1764666958

I'm impressed by this. You know in the beginning I was like hey why doesn't this look like counterstrike ? yeah I had the exepectation this things can one shot an industry leading computer game. Of course that's not yet possible. But still, this is pretty damn impressive for me.

Ragnarork · 2025-12-02T10:40:18 1764672018

In a way, they really condensed perfectly a lot of what's silly currently around AI.

> Codex, Opus, Gemini try to build Counter Strike

Even though the prompt mentions Counter Strike, it actually asks to build the basics of a generic FPS, and with a few iterations ends up with some sort of minecraft-looking generic FPS with code that would never make it to prod anywhere sane.

It's technically impressive. But functionally very dubious (and not at all anything remotely close to Counter-Strike besides "being an FPS").

Fitting.

gafferongames · 2025-12-02T15:12:40 1764688360

It's not even technically impressive to anybody who has worked on first person shooters. It's literally trash.

weatherlite · 2025-11-28T17:20:42 1764350442

Well the same can be said about non contrarians ...

enknee1 · 2025-11-29T14:40:51 1764427251

The same can be said about hucksters of all stripes, yes.

But maybe not contrarians/non-contrarians? They are just the agree/disagree commentators. And much of the most valuable commentary is nuanced with support for and against their own position. But generally for.

weatherlite · 2025-11-27T13:55:05 1764251705

Ah great more "when will we hit AGI" speculation, lets keep them coming. Some say 2 years, some say 5, some say never.

weatherlite · 2025-11-26T08:46:52 1764146812

> Even if the models don't get any smarter, just give it a few more years and we'll see a strong impact. We're just starting to figure things out.

2 years ? 15 years ? It matters a lot for people, the stock market and governments