> 3) Solving AI Alignment is an actual problem and not just dumb extrapolation f...

astrange · on March 30, 2023

> You cannot take a bunch of weights and tell how it will behave.

We know that they only contain pure functions, so they don't "do" anything besides output numbers when you put numbers into them.

Testing a system that contains a model and does actions with it is a different story, but if you don't let the outputs influence the inputs it's still not going to do much.