I think AMD is going down a different path, ie. ROCm then partnering with ML fra...

lbhdc · on May 25, 2023

He mentioned ROCm, and apparently had lack luster experience with it.

>The software is called ROCm, it’s open source, and supposedly it works with PyTorch. Though I’ve tried 3 times in the last couple years to build it, and every time it didn’t build out of the box, I struggled to fix it, got it built, and it either segfaulted or returned the wrong answer. In comparison, I have probably built CUDA PyTorch 10 times and never had a single issue.

tzhenghao · on May 25, 2023

Not surprising lol. This was also the experience I had while experimenting with MLIR approximately 3 years ago. You'd need to git checkout a very specific commit and then even change some flags in code to have a successful build. I'm sure things are better now but I haven't messed with it since then.

radq · on May 25, 2023

> I'm sure things are better now but I haven't messed with it since then.

I had the same experience ~3 months ago. Gave up and switched to Nvidia 3090s for my workloads.

ipsum2 · on May 25, 2023

It's because ROCm is not developed for RDNA (consumer) cards, but CDNA (datacenter) cards. No surprise that he's having trouble with it.

PeterisP · on May 25, 2023

AMD is not going down the path of ROCm; perhaps they claim to do so, but as evidenced by the lack of both effort and results, they clearly are not.

The parent post is surprised that they still aren't making the appropriate investments to make it work. They kind of started to do that a few years ago, but then it fell on the wayside without reaching even table stakes, which in my opinion would require providing a ROCm distribution that works out of the box for most of their recent consumer cards (i.e. those cards which the enthusiasts/students/advocates/researchers might use while choosing which software stack to learn, and afterward base corporate compute cluster purchasing decisions on whether they support the software they wrote for e.g. CUDA+Pytorch), and they seem to be failing at that.

meragrin_ · on May 25, 2023

Now only if they would support their hardware and Windows.