Looks like there is some quality reduction, but nonetheless 2s to generate a 5s ...

avaer · 2025-12-26T10:02:49 1766743369

Efficient realtime video diffusion will revolutionize the way people use computers even more so than LLMs.

I actually think we are already there with quality, but nobody is going to wait 10 minutes to do a task with video that takes 2 seconds with text.

If Sora/Kling/whatever ran cool locally 24/7 at 60FPS, would anyone ever build a UI? Or a (traditional) OS?

I think it's worth watching the scaling graph.

IsTom · 2025-12-26T10:12:04 1766743924

> If Sora/Kling/whatever ran cool locally 24/7 at 60FPS, would anyone ever build a UI?

I like my buttons to stay where I left them.

pavlov · 2025-12-26T10:29:27 1766744967

Yeah, it’s like asking “why would anyone read a book today when LLMs can generate infinite streams of text”

exe34 · 2025-12-26T11:01:02 1766746862

those streams of text are often conditioned on the prompts - people are using it to learn about new concepts, and as a hyperpersonalised version of search. it can not only tell you of tools you didn't know existed, but it can show you how to use them.

I do like my buttons to stay where I left them - but that can be conditioned. instead of gnome "designers" telling me the button needs to be wide enough to hit with my left foot, I could tell the system I want this button to be small and in that corner - and add it to my prompt.

pylotlight · 2025-12-26T15:01:43 1766761303

I feel like a lot of the above assumes the user knows what they want or what works best. I want an intelligent designer to figure out the best flow/story/narrative/game and create/present it, cause I'm a dumb user who doesn't know what is actually good.

exe34 · 2025-12-26T19:03:44 1766775824

that's called a default - I'm happy for a gnome designer to "design" the button to be large enough to hit with my foot with a blindfold on, but I'd like the option to change it to adjust to my workflow rather than adjust my workflow to the button.

pavlov · 2025-12-26T12:02:03 1766750523

I suppose if one only reads self-help books of the “You’re the best, trust your instincts!” kind, then LLMs are an appropriate replacement.

exe34 · 2025-12-26T19:04:25 1766775865

Or indeed, if one has a mind of their own and wants a tool to obey them, rather than submit to their "betters"'s opinions.

subscribed · 2025-12-26T13:11:02 1766754662

Please no, please no

That will be Windows 12 and perhaps 2 generations in of iOS :)

villgax · 2025-12-26T06:08:49 1766729329

That’s not the actual time if you run it, encoding and decoding is extra

Lerc · 2025-12-26T09:22:22 1766740942

Nevertheless it does seem that generating will fairly soon become fast enough to extend a video clip in realtime. Autoregressive by the second. Integrated with a multi modal input model you would be very close to an AI avatar that would be extremely compelling.