growdark's comments

growdark · 2025-10-23T18:35:50 1761244550

I'd love to see a benchmark that tests different LLMs for slop, not necessarily limited to code. That might be even more interesting than ARC-AGI.

Bolwin · 2025-10-23T19:51:38 1761249098

See the writing benchmarks here https://eqbench.com/creative_writing_longform.html

Der_Einzige · 2025-10-23T21:11:38 1761253898

Note this is the same first author

jampa · 2025-10-23T18:38:10 1761244690

Not a benchmark per se, but there is a "Not x, but y" Slop Leaderboard:

https://www.reddit.com/r/LocalLLaMA/comments/1lv2t7n/not_x_b...

topaz0 · 2025-10-23T22:55:05 1761260105

100% of LLM output is slop. Done.

growdark · 2025-10-23T18:32:47 1761244367

Does this actually work or would the slop just become more subtle?

growdark · 2025-04-23T21:45:00 1745444700

>Ge0rg3’s code is “open source,” in that anyone can copy it and reuse it non-commercially.

A little nit-picking, but that's not what open source means, especially as it relates to the GPL in this case. If you can't use the code commercially, it's neither "open source" (as defined by OSI) nor free software (as defined by the FSF).

nativeit · 2025-04-23T21:46:56 1745444816

Right, but the original statement isn't being mutually exclusive.

growdark · 2025-04-05T21:12:58 1743887578

Would it be realistic to buy and self-host the hardware to run, for example, the latest Llama 4 models, assuming a budget of less than $500,000?

mrajcok · 2025-04-05T21:48:50 1743889730

Yes - I'm able to run Llama 3.1 405B on 3x A6000 + 3x 4090.

Will have Llama 4 Maverick running in 4bit quantization (typically results in only minor quality degradation) once llama.cpp support is merged.

Total hardware cost well under $50,000.

The 2T Behemoth model is tougher, but enough Blackwell 6000 Pro cards (16) should be able to run it for under $200k.

briandw · 2025-04-05T22:20:24 1743891624

Llama scout is a 17B x 16 MOE. So that 17B active parameters. That makes it faster to run. But the memory requirements are still large. They claim it fits on an H100. So under 80GB. A mac studio at 96GB could run this. By run i mean inference, Ollama is easy to use for this. 4x3090 nvidia cards would also work but its not the easiest pc build. The tinybox https://tinygrad.org/#tinybox is 15k and you can do Lora fine tuning. Could also do a regular pc with 128gb of ram, but its would be quite slow.

latchkey · 2025-04-05T21:55:03 1743890103

A box of AMD MI300x (1.5TB of memory) is much less than $500k and AMD made sure to have day zero support with vLLM.

That said, I'm obviously biased but you're probably better off renting it.

hhh · 2025-04-05T21:23:49 1743888229

You can do it with regular gpus for less