More

kjok · 2026-01-29T20:59:14 1769720354

Maybe humans can focus on cybersecurity and fraud? That’s not going away with AI

kjok · 2026-01-29T07:53:56 1769673236

Block based on cookies (i.e., set a cookie on the browser and check on the server whether it exists).

direwolf20 · 2026-01-29T14:02:18 1769695338

This project implements a variety of similar JSless checks, such as image loading

https://github.com/WeebDataHoarder/go-away

bigbadfeline · 2026-01-30T18:39:43 1769798383

That helps, but the big bot farms use clients that support cookies, we need to add more defenses on top of them.

kjok · 2026-01-21T16:05:43 1769011543

Genuine question: why is everyone rolling out their own sandbox wrappers around VMs/Docker for agents?

borenstein · 2026-01-21T16:11:05 1769011865

I know, right? The day I initially thought about posting this, there was another one called `yolo-box`. (That attempt--my very first post--got me instantly shadow-banned due to being on a VPN, which led to an unexpected conversation with @dang, which led to some improvements, which led to it being a week later.)

I think it's the convergence of two things. First, the agents themselves make it easier to get exactly what you want; and second, the OEM solutions to these things really, really aren't good enough. CC Cloud and Codex are sort of like this, except they're opaque and locked down, and they work for you or they don't.

It reminds me a fair bit of 3D printer modding, but with higher stakes.

catlifeonmars · 2026-01-21T16:19:27 1769012367

Because of findings like this

https://www.anthropic.com/research/small-samples-poison

(A small number of samples can poison LLMs of any size) to save clicks to read the headline

The way I think of it is, coding agents are power tools. They can be incredibly useful, but can also wreak a lot of havoc. Anthropic (et al) is marketing them to beginners and inevitably someone is going to lose their fingers.

kjok · 2026-01-21T17:18:00 1769015880

I understand the need, but I don't understand why a VM or Docker is not enough. Why are people creating custom wrappers around VMs/containers?

borenstein · 2026-01-21T17:27:18 1769016438

Docker isn't virtualization; it's not that hard to infiltrate the underlying system if you really want to. But as for VMs--they are enough! They're also a lot of boilerplate to set up, manage, and interact with. yolo-cage is that boilerplate.

dist-epoch · 2026-01-21T17:55:07 1769018107

Because people want to run agents in yolo mode without worrying that it's going to delete the whole computer.

And once you put the agent in a VM/container it's much easier to run 10 of them in parallel without mutual interference.

borenstein · 2026-01-21T18:14:01 1769019241

On that note, yolo-cage is pretty heavyweight. There are much lighter tools if your main concern is "don't nuke my laptop." yolo-box was trending on HN last week: https://news.ycombinator.com/item?id=46592344

derpsteb · 2026-01-21T16:12:44 1769011964

My experience is that neither has a good UX for what I usually try to do with coding agents. The main problem I see is setup/teardown of the boxes and managing tools inside them.

m-hodges · 2026-01-21T16:12:36 1769011956

It all feels like temporary workflow fixes until The Agent Companies just ship their opinionated good enough way to do it.

borenstein · 2026-01-21T16:22:15 1769012535

It probably is. Some of this stuff will hang around because power users want control. Some of it will evolve into more sophisticated solutions that get turned into products and become easier to acquihire than the build in house. A lot of it will become obsolete when the OEMs crib the concept. But IMO all of those are acceptable outcomes if what you really want is the thing itself.

odie5533 · 2026-01-21T19:30:26 1769023826

They've already suggested using Dev Containers. https://code.claude.com/docs/en/devcontainer

kjok · 2025-11-10T18:30:24 1762799424

If not today, models will get better at solving captchas in the near future. IMHO, the real concern, however, is cheap captcha solving services.

arbol · 2025-11-10T19:25:54 1762802754

The solvers are a problem but they give themselves away when they incorrectly fake devices or run out of context. I run a bot detection SaaS and we've had some success blocking them. Their advertised solve times are also wildly inaccurate. They take ages to return a successful token, if at all. The number of companies providing bot mitigation is also growing rapidly, making it difficult for the solvers to stay on top of reverse engineering etc.

kjok · 2025-11-10T19:38:35 1762803515

> when they incorrectly fake devices

And how often does this happen? Do you have any proof? Most YC companies building browser agents have built-in captcha solvers.

arbol · 2025-11-10T21:23:18 1762809798

That's a good question. I haven't checked the stats to see how often it happens but I will make a note to return with some info. We're dealing with the entire internet, not just YC companies, and many scrapers / solvers will pass up a user agent that doesn't quite match the JS capabilities you would expect of the browser version. Some solving companies allow you to pass up user agent , which causes inconsistencies as they're not changing their stack to match the user agent you supply. Under the hood they're running whatever version of headless Chrome they're currently pinned to.

kjok · 2025-09-29T19:40:35 1759174835

What is so fundamentally different for AI agents?

rvz · 2025-09-29T20:27:42 1759177662

Other than the current popular thing which is "AI agents", like all programs, it changes absolutely nothing.

Yoric · 2025-09-29T20:39:05 1759178345

The fact that the first thing people are going to do is punch holes in the sandbox with MCP servers?

kjok · 2025-09-29T19:34:02 1759174442

Ah ha.

kjok · 2025-09-18T16:01:34 1758211294

> automated scanners seem to do a good job already of finding malicious packages.

That's not true. This latest incident was detected by an individual researcher, just like many similar attacks in the past. Time and again, it's been people who flagged these issues, later reported to security startups, not automated tools. Don't fall for the PR spin.

If automated scanning were truly effective, we'd see deployments across all major package registries. The reality is, these systems still miss what vigilant humans catch.

kelnos · 2025-09-19T00:14:41 1758240881

> This latest incident was detected by an individual researcher

So that still seems fine? Presumably researchers are focusing on latest releases, and so their work would not be impacted by other people using this new pnpm option.

hobofan · 2025-09-18T16:26:17 1758212777

> If automated scanning were truly effective, we'd see deployments across all major package registries.

No we wouldn't. Most package registries are run by either bigcorps at a loss or by community maintainers (with bigcorps again sponsoring the infrastructure).

And many of them barely go beyond the "CRUD" of package publishing due to lack of resources. The economic incentives of building up supply chain security tools into the package registries themselves are just not there.

kjok · 2025-09-18T17:13:46 1758215626

You're right that registries are under-resourced. But, if automated malware scanning actually worked, we'd already see big tech partnering with package registries to run continuous, ecosystem-wide scanning and detection pipelines. However, that isn't happening. Instead, we see piecemeal efforts from Google with assurance artifacts (SLSA provenance, SBOMs, verifiable builds), Microsoft sponsoring OSS maintainers, Facebook donating to package registries. Google's initiatives stop short of claiming they can automatically detect malware.

This distinction matters. Malware detection is, in the general case, an undecidable problem (think halting problem and Rice theorem). No amount of static or dynamic scanning can guarantee catching malicious logic in arbitrary code. At best, scanners detect known signatures, patterns, or anomalies. They can't prove absence of malicious behavior.

So the reality is: if Google's assurance artifacts stop short of claiming automated malware detection is feasible, it's a stretch for anyone else to suggest registries could achieve it "if they just had more resources." The problem space itself is the blocker, not just lack of infra or resources.

motorest · 2025-09-19T06:26:06 1758263166

> But, if automated malware scanning actually worked, we'd already see big tech partnering with package registries to run continuous, ecosystem-wide scanning and detection pipelines.

I think this sort of thought process is misguided.

We do see continuous, ecosystem-wide scanning and detection pipelines. For example, GitHub does support DependaBot, which runs supply chain checks.

https://github.com/dependabot

What you don't see is magical rabbits being pulled out of top hats. The industry has decades of experience with anti-malware tools in contexts where said malware runs in spite of not being explicitly provided deployment or execution permissions. And yet it deploys and runs. What do you expect if you make code intentionally installable and deployable, and capable of sending HTTP requests to send and receive any kind of data?

Contrary to what you are implying, this is not a simple problem with straight-forward solutions. The security model has been highly reliant on the role of gatekeepers, both in producer and consumer sides. However, the last batch of popular supply chain attacks circumvented the only failsafe in place. Beyond this point, you just have a module that runs unspecified code, just like any other module.

anematode · 2025-09-18T18:17:16 1758219436

The latest incident was detected first by an individual researcher (haven't verified this myself, but trusting you here) -- or maybe s/he was just the fastest reporter in the west. Even simple heuristics like the sudden addition of high-entropy code would have caught the most recent attacks, and obviously there are much better methods too.

kjok · 2025-09-18T15:54:49 1758210889

I think there's some confusion here. No automated scan was able to catch the attack. It was an individual who notified these startups.

acdha · 2025-09-18T17:15:50 1758215750

Quite possibly - there have been several incidents recently and a number of researchers working together so it’s not clear exactly who found something first and it’s definitely not as simple to fix as tossing a tool in place.

The CEO of socket.dev described an automated pipeline flagging new uploads for analysts, for example, which is good but not instantaneous:

https://news.ycombinator.com/item?id=45257681

The Aikido team also appear to be suggesting they investigated a suspicious flag (apologies if I’m misreading their post), which again needs time for analysts to work:

https://www.aikido.dev/blog/npm-debug-and-chalk-packages-com...

My thought was simply that these were caught relatively quickly by security researchers rather than by compromised users reporting breaches. If you didn’t install updates with a relatively short period of time after they were published, the subsequent response would keep you safe. Obviously that’s not perfect and a sophisticated, patient attack like liblzma suffered would likely still be possible but there really does seem to be a value to having something like Debian’s unstable/stable divide where researchers and thrill-seekers would get everything ASAP but most people would give it some time to be tested. What I’d really like to see is a community model for funding that and especially supporting independent researchers.

kjok · 2025-09-16T21:02:49 1758056569

> managed pull-through caches implemented via tools like Artifactory

This is why package malware creates news, but enterprises mirroring package registries do not get affected. Building a mirroring solution will be pricey though mainly due to high egress bandwidth cost from Cloud providers.

kjok · 2025-09-16T16:35:22 1758040522

Why should MS buy any of these startups when a developer (not any automated tech) found the malware? It looks like these startups did after-the-fact analysis for PR.

singulasar · 2025-09-17T10:29:26 1758104966

on the other hand, the previous supply chain attack was found by automated tech. Also, if MS would be so kind as to just run similar scans at the time a package is updated instead of after the package is updated (which is the only way the automated tech can run if npm doesn't integrate it), then malware like this would be way less common.

MS doesn't care

kjok · 2025-09-18T15:55:12 1758210912

> on the other hand, the previous supply chain attack was found by automated tech.

Are you sure about this? Would love to see which ones.

singulasar · 2025-09-18T16:59:10 1758214750

The chalk/debug one https://www.aikido.dev/blog/npm-debug-and-chalk-packages-com... I believe socket also found it this way just a bit later.

The dev later said that Charlie notifying him probably shaved off some very important time for the remediation.

So in this case 2 different companies found it using automated tech before anyone else

CharlieEriksen · 2025-09-19T07:16:00 1758266160

Hi, I'm Charlie from Aikido, as mentioned above. Yes, we detected it automatically, and I alerted Josh to the situation on BSky.

There's no reason why Microsoft/npm can't do what we're doing, or any of the other handful to dozen companies that do similar things to us, to protect the supply chain.