> AI coding assistants bias towards assuming the code they're working with is co...

owebmaster · 2025-05-13T18:06:05 1747159565

> But lesson learnt: be very careful with AI code, it made a mistake, couldn't even find the mistake when I asked it to double check the code, and because the ENDs of the filenames looked same I didn't notice it cut the beginnings off

Don't test code in production.

Good software engineering practices didn't change with AI, they actually are even more important. levelsio is a quite successful entrepreneur but he is not an engineer.

phillipcarter · 2025-05-13T18:29:49 1747160989

Moreover, he's also not a good person to look for at how to apply AI! He picks the simplest possible thing to build with an extremely narrow focus to maximize revenue and minimize work. It's precisely the right way to analyze tradeoffs in his shoes as a solo entrepreneur. But I would imagine that few of us who work for larger organizations would apply a similar mindset to software development.

That said, we all test in production, it's just a question of how deliberate and principled we are about it :D

phillipcarter · 2025-05-13T15:49:07 1747151347

> That's why you should just write tests, before you write the code, so that you know what you are expecting with the code that is under test is doing. i.e Test driven development.

I've tried this too. They find ways to cheat the tests, sometimes throwing in special cases that match the specific test cases. It's easy to catch in the small scale but not when in a larger coding session.

> No. Please do not do this. These LLMs have zero understanding / reasoning about the code they are outputting.

This is incorrect. LLMs do have the ability to reason, but it's not the same reasoning that you or I do. They are actually quite good at checking for a variety of problems, like if the code you're writing is sensitive to memory pressure and you want to account for it. Asking them to examine the code with several constraints in mind often does give reasonable advice and suggestions to change. But this requires you to understand those changes to be effective.

theshrike79 · 2025-05-13T21:02:18 1747170138

I had a case where Claude 3.7 mocked a test so far it wasn't actually testing anything but the mocks :D