More

ThouYS · 2026-05-23T17:42:58 1779558178

I feel like there is no portable advice for performance. A torch model exported as onnx is a different model.

That onnx model run using onnxruntime with cuda ep is a different model than the one run with TRT ep.

And even among the same runtime, depending on the target hardware and the memory available during tuning, the model behaves differently. It is a humongous mess

mycall · 2026-05-24T00:58:58 1779584338

That's interesting as I was considering GGUF --> ONNX conversions (via Olive), but if this creates unknown distortions in the effectiveness and stability, it might be a dead-end idea.

ThouYS · 2026-05-25T13:04:36 1779714276

Just to clarify: I mean VRAM, RAM and runtime performance, not the numerical outputs (even though those also vary to some degree, haha)

ThouYS · 2026-05-05T21:48:48 1778017728

don't know about this guy, but qwen3.6:27b with the UD 4bit quant and little-coder/pi has been amazing. the first local LLM experience that can do actual meaningful work

brcmthrowaway · 2026-05-05T23:49:14 1778024954

What is UD?

ac29 · 2026-05-05T23:58:49 1778025529

Unsloth Dynamic, just some branding from Unsloth for their quants (other people use similar techniques)

ThouYS · 2026-03-19T08:01:30 1773907290

Maybe most interesting about the piece is, that we'll likely see more large scale interviews like this (even if this one is a bit bland)

ThouYS · 2026-03-13T08:53:59 1773392039

what!

ThouYS · 2026-03-13T07:33:35 1773387215

very nice. has a claude touch to it

ThouYS · 2026-02-27T09:35:46 1772184946

this is.. a nothing burger? they don't exclude working for autonomous weapons, nor do they exclude mass surveillance. so what gives?

ThouYS · 2026-02-14T19:08:53 1771096133

the headlines these days

ThouYS · 2026-02-12T16:07:35 1770912455

for me gpt-oss:20b was that. glm 4.7 flash was not better, but much slower on a 16GB card

ThouYS · 2026-02-03T12:39:27 1770122367

wow, everything is exactly unfolding as some AI doomers have projected

ThouYS · 2026-02-03T08:05:32 1770105932

Anki is and was truly a blessing. Not sure I would have gotten through my studies without it. Thank you dae!