More

veunes · 2026-02-06T13:11:02 1770383462

Character matters but so does having people around you who are willing to call it early, before you've rationalized yourself into ignoring it

veunes · 2026-02-06T13:08:09 1770383289

But less about personal brilliance and more about how social power actually works when money, status, and weak accountability intersect

veunes · 2026-02-06T13:05:30 1770383130

But it might've changed one decision, one meeting, one normalization step

veunes · 2026-02-06T12:59:29 1770382769

I'd rephrase it as: nobody should be trusted with unchecked power, especially when it's exercised quietly and indirectly

veunes · 2026-02-06T12:49:20 1770382160

Sometimes (sometimes) it just implies that someone sent an email, got ignored, and left a paper trail behind

bitmasher9 · 2026-02-06T13:30:52 1770384652

Just being named in the files doesn’t mean you are guilty. In this situation being named in the files gave him an opportunity to demonstrate high moral character. “I turned down his money because he was scummy”

cogman10 · 2026-02-06T14:01:07 1770386467

Yup. There's a few people like that in the files. But a distressingly large number of named people had ongoing correspondence.

veunes · 2026-02-02T10:05:59 1770026759

All of this speedrun hits a wall at the context window. As long as the project fits into 200k tokens, you’re flying. The moment it outgrows that, productivity doesn’t drop by 20% - it drops to zero. You start spending hours explaining to the agent what you changed in another file that it has already forgotten. Large organizations win in the long run precisely because they rely on processes that don’t depend on the memory of a single brain - even an electronic one

TaupeRanger · 2026-02-02T13:31:48 1770039108

This reads as if written by someone who has never used these tools before. No one ever tries to "fit" the entire project into a single context window. Successfully using coding LLMs involves context management (some of which is now done by the models themselves) so that you can isolate the issues you're currently working on, and get enough context to work effectively. Working on enormous codebases over the past two months, I have never had to remind the model what it changed in another file, because 1) it has access to git and can easily see what has changed, and 2) I work with the model to break down projects into pieces that can be worked on sequentially. And keep in mind, this the worst this technology will ever be - it will only get larger context windows and better memory from here.

storus · 2026-02-03T01:28:27 1770082107

What are the SOTA methods for context management assuming the agent runs with its tool calls without any break? Do you flush GPU tokens/adjust KV caches when you need to compress context by summarizing/logging some part?

kasey_junk · 2026-02-02T11:33:27 1770032007

Everyone I know who is using AI effectively has solved for the context window problem in their process. You use design, planning and task documents to bootstrap fresh contexts as the agents move through the task. Using these approaches you can have the agents address bigger and bigger problems. And you can get them to split the work into easily reviewable chunks, which is where the bottleneck is these days.

Plus the highest end models now don’t go so brain dead at compaction. I suspect that passing context well through compaction will be part of the next wave of model improvements.

veunes · 2026-02-02T09:50:38 1770025838

This is the birth of Shadow AI, and it’s going to be bigger than Shadow IT ever was in the 2000s

Back then, employees were secretly installing Excel macros and Dropbox just to get work done faster. Now they’re quietly running Claude Code in the terminal because the official Copilot can’t even forma a CSV properly.

CISOs are terrified right now and that’s understandable. Non-technical people with root access and agents that write code are a security nightmare. But trying to ban this outright will only push your most effective employees to places where they’re allowed to "fly"

AdamN · 2026-02-02T13:28:06 1770038886

"they’re quietly running Claude Code" ... with their tokens or even worse full on usernames and passwords that have write/execute privileges.

veunes · 2026-02-02T09:27:50 1770024470

The zeroize after exec feature sounds good, but what is the threat model in an agent context? If the agent can run printenv in the first millisecond and exfiltrate it (if net is allowed), zeroizing won't help

It seems egress filtering (allowlists) is more critical for agents than memory protection. If I allow an agent to run npm install, I'm opening a network Pandora's box, and Landlock (until ABI v4) offers pretty limited control there

veunes · 2026-01-30T12:11:51 1769775111

Not simple in the sense of easy, but simple in the sense of foundational: if a government can't even roughly say how many people it governs, everything built on top of that gets shaky

veunes · 2026-01-30T12:08:26 1769774906

I think you're right in principle, but the article is pointing at a slightly different failure mode than just "wide error bars"