Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Cool thing!

A couple suggestions:

1. I have an M3 Ultra with 256GB of memory, but the options list only goes up to 192GB. The M3 Ultra supports up to 512GB. 2. It'd be great if I could flip this around and choose a model, and then see the performance for all the different processors. Would help making buying decisions!

 help



Unfortunately, Apple retired the 512GiB models.

Sure, but those already sold still exist.

>. I have an M3 Ultra with 256GB of memory,

Im sorry but spending this kind of money when you could have just built yourself a dual 3090 workstation that would have been better for pretty much everything including local models is just plain stupid.

Hell, even one 3090 can now run Gemma 3 27b qat very fast.


Are you aware that your 3090s have nowhere close to 256GB of VRAM? Or maybe you are not aware that on macs you have unified memory (working both as RAM and VRAM).

Are you aware that having ram doesn't matter when your tokens/second is slow as shit?

You don't need to run large models, Gemma QAT 27B fits on one GPU and is quite good. Other models like Qwen3 are great for coding.

3090 gets 100+ tokens/second for QWEN, very close to what you would see with a cloud based model.

M3 ultra gets ~30.

Congrats, you played yourself.


> a dual 3090 workstation that would have been better for pretty much everything

Doesn't run macOS


Except if you are living in a region where electricity is quite expensive :/

ask apple to graciously allow you to install your own ram in the computer you "own"



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: