What's there not to understand? If it matches the latest GPT O-N model in perfor...

thechao · 2025-01-28T17:59:21 1738087161

New theory: this is a short-long play by the fund. They shorted NV, now they're hoovering up stock. In the process of making their billions from a small 50$mm investment!

zombiwoof · 2025-01-28T18:16:20 1738088180

Bullseye

thechao · 2025-01-28T18:39:04 1738089544

Damn Matt Levine to hell! His latest newsletter has an entire section devoted to the entire topic!

infecto · 2025-01-28T12:33:32 1738067612

In my tests it does not come close to O1-Pro. Still huge news but it does not quite make it.

rallyforthesun · 2025-01-28T13:22:17 1738070537

The results i did get from deepseek-r1 on their webpage did not match the results i did get from o1-pro. I did ask it go to a github repo, find the part where the logic of the “export” button is and explain why it doesn’t work (the whole logic is actually missing, won’t work at all). O1 pro did get it right in the first try while deepseek r1 was heavily hallucinating. Maybe i am using the wrong model?

throwup238 · 2025-01-28T14:30:30 1738074630

No, you’re not. They explicitly mention in the R1 paper (in the last paragraph before the bibliography) that R1 isn’t a “huge” improvement over DeepSeek-V3 in coding - where “huge” is an academic weasel word.

It’s just a lot of hype. In my coding tests it significantly underperforms o1 (haven’t tried o1-pro), often getting stuck in a reasoning loop because I underspecified something (that I don’t have to with o1).

infecto · 2025-01-28T15:50:41 1738079441

Same anecdotal experience. Its definitely an improvement and they have made operational improvements at runtime but I am still concerned they are have over fit for the tests.