More

robrenaud · 2026-06-11T00:16:29 1781136989

They use a lightweight adapter to silently degrade the performance. Usually these adaptors are made to improve the performance for a given domain/task.

robrenaud · 2026-06-04T15:56:31 1780588591

3 blue 1 brown has a great visual introduction to transformers, the heart of LLMs.

It's chapter 5. Start at chapter 1 if you want more background on neural nets and backprop.

https://youtu.be/wjZofJX0v4M?si=HFXbrB-5cArprGaU

robrenaud · 2026-06-04T15:48:01 1780588081

"The reasoning is the weights."

The reasoning is in a process that uses the weights.

Sorting algorithms are just bytes. Those bytes don't sort by themselves. They do instruct a computer on how to sort though.

robrenaud · 2026-05-11T14:04:08 1778508248

There is some recent work on modularizing knowledge in LLMs.

https://arxiv.org/html/2605.06663v1

It might be possible to train a big generalist that is a composition of modules, some of which can be dropped dynamically at inference time, depending on the prompt.

digitaltrees · 2026-05-17T05:23:32 1778995412

Cool. Thanks for sharing. I am thinking about creating a series of smaller models for specific purposes and then orchestrating them so they mirror the human brain which is a bunch of subsystems that give multiple opinions about the same stimulus

shailendra_sis · 2026-05-17T05:34:30 1778996070

Interesting direction. I’ve also been thinking about modular / subsystem-based approaches for specialized tasks in small AI systems.

robrenaud · 2026-04-28T18:12:06 1777399926

Is every American tax payer morally compromised?

eks391 · 2026-04-28T19:07:41 1777403261

Yes ;)

I agree with the intent of your rhetorical question, so I'm jesting with you. I'm justifying my "yes" with the hopefully humorous distraction that every person, including American taxpayers, has at some point made a nonsustainable/selfish (my definition of immoral) decision.

robrenaud · 2026-04-27T20:05:47 1777320347

My big gripe with unions is the unwavering protection of their worst performing members.

Eg, that they necessitated so called "rubber rooms" like these in the NYC public schools, where teachers got paid to do nothing while waiting on arbitration.

https://en.wikipedia.org/wiki/Reassignment_center

threetonesun · 2026-04-27T20:12:55 1777320775

I doubt you'll find many people in favor of how bad cops get protected by police unions either. At least in the US I'd much rather a broad social net so my health care and retirement weren't so directly tied to my job than a union specific to my trade.

robrenaud · 2026-04-20T15:17:42 1776698262

The flat earthers are why I hate astronomy.

Afaict, the grand parent poster is just very wrong. You do want to cause acute stresses to your heart (cardiovascular exercise) to get it work better.

groundzeros2015 · 2026-04-20T15:20:05 1776698405

It’s not really about this particular claim. It’s that I can read a comment that has a reasonable chain of logic and I don’t know if it’s true. This topic is just not easily studied and theories are hard to falsify.

groundzeros2015 · 2026-04-20T19:33:03 1776713583

Claims about flat earth are falsifiable with at-home experiment.

robrenaud · 2026-04-16T21:48:24 1776376104

Yeah, it's different. Anthropic profits when it delivers tokens. Hosting providers pay when Anthropic scrapes them.

robrenaud · 2026-04-01T05:57:30 1775023050

Yeah, my big problem with the paper is it just might be an artifact of qwen's training process.

taneq · 2026-04-01T10:39:27 1775039967

In all fairness most of the unique stuff I can do is probably an artifact of my training process, so it seems unfair to deny an LLM the same accomodation.

nativeit · 2026-04-01T13:25:42 1775049942

How much did your training cost society?

msdz · 2026-04-01T16:24:06 1775060646

This got me thinking, and it might actually even be a comparable amount. Let's estimate 12 years of schooling run at minimum $100,000 per student, at least in the US [1], and then add onto that number whatever else you may do after that, i.e. a bunch more money if paid (college) or "unpaid" (self-taught skills and improvements) education, and then the likely biggest portion for white-collar workers, yet hard-to-quantify, in experience and "value" professional work will equip one with.

Now divide the average SOTA LLM's training cost (or a guess, since these numbers aren't always published as far as I'm aware) by the number of users, or if you wanted to be more strict, the number of people it's proven to be useful for (what else would training be for), and it might not be so far off anymore?

Of course, whether it makes sense to divide and spread out the LLMs' costs across users in order to calculate an "average utility" is debatable.

[1] https://www.publicschoolreview.com/average-spending-student-...

robrenaud · 2026-03-10T15:36:17 1773156977

Was Alphago's move 37 original?

In the last step of training LLMs, reinforcement learning from verified rewards, LLMs are trained to maximize the probability of solving problems using their own output, depending on a reward signal akin to winning in Go. It's not just imitating human written text.

Fwiw, I agree that world models and some kind of learning from interacting with physical reality, rather than massive amounts of digitized gym environments is likely necessary for a breakthrough for AGI.