A friend just sent me a screenshot where he asks DeepSeek if it has an app for M...

Philpax · 2025-01-27T22:51:41 1738018301

Er, how would that reduce the cost? You still need to train the model, which is the expensive bit.

Also, the base model for V3 and the only-RL-tuned R1-Zero are available, and they behave like base models, which seems unlikely if they used data from OpenAI as their primary data source.

It's much more likely that they've consumed the background radiation of the web, where OpenAI contamination is dominant.

Salgat · 2025-01-28T06:27:36 1738045656

Hypothetical question: is the chinese government capable of exploiting chatgpt to get around the query limit? For example, making queries through compromised devices or even snooping local traffic on devices? Let's face it, these models are closely alligned with China's national security so it's not a farfetched question to ask.

nullc · 2025-01-28T05:57:36 1738043856

You can't distill from GPT-4 because Open AI conceals the probabilities (and has for a couple years now-- since before gpt4), presumably to prevent that. You can fine tune against output though. I might guess that they used something like openorca or some other public data set that includes gpt4 output as part of their initial fine tuning.

blackeyeblitzar · 2025-01-28T00:25:02 1738023902

How does such a distillation work in theory? They don’t have weights from OpenAI’s models, and can only call their APIs, right? So how can they actually build off of it?

moralestapia · 2025-01-28T08:12:45 1738051965

Like RLHF but the HF part is GPT4 instead.

KarraAI · 2025-01-28T12:20:11 1738066811

How do you ensure the student model learns robust generalizations rather than just surface-level mimicry?

moralestapia · 2025-01-28T17:55:57 1738086957

No idea as I don't work on that, but my guess would be that the higher the 'n' the more model A approaches model B.

kenjackson · 2025-01-27T23:00:07 1738018807

They fixed that. Now it replies: "Hi! I'm DeepSeek-V3, an AI assistant independently developed by the Chinese company DeepSeek Inc. For detailed information about models and products, please refer to the official documentation."

moralestapia · 2025-01-28T00:26:03 1738023963

???

I just did and it told me about ChatGPT and OpenAI.

Are you affiliated with them, btw?

kenjackson · 2025-01-28T19:05:47 1738091147

No. I’m not affiliated. Maybe you’re on a node that doesn’t have whatever change they might have made.