The beginning of the article was good, but the analysis of DeepSeek and what it ...

aorloff · on Jan 27, 2025

His DeepSeek argument was essentially that experts who look at the economics of running these teams (eg. ha ha the engineers themselves might dabble) are looking over the hedge at DeepSeek's claims and they are really awestruck

lysecret · on Jan 27, 2025

Where do you have this "capacity" limit from? I can get as many H100s from GCP or wherever as I wish, the only thing that is capacity limited are 100k clusters ala ELON+X, but what DeepSeek (and the recent evidence of a limit in pure base-model scaling) shows is that this might actually not be profitable, and we end up with much smaller base models scaled at inference time. The moat for Nvidia in this inference time scaling is much smaller, also you don't need the humongous clusters for that either you can just distribute the inference (and in the future run it locally too).

mgraczyk · on Jan 27, 2025

What's your GPU quota in GCP? How did you get it increased that much?

saagarjha · on Jan 27, 2025

Asking GCP to give you H100s on-demand is nowhere near cost efficient.