Huawei enters the GPU market with 96 GB VRAM GPU under 2000 USD, meanwhile NVIDIA sells from 10,000+ (RTX 6000 PRO)

☆ Yσɠƚԋσʂ ☆@lemmy.ml · 3 months ago

Huawei enters the GPU market with 96 GB VRAM GPU under 2000 USD, meanwhile NVIDIA sells from 10,000+ (RTX 6000 PRO)

geneva_convenience@lemmy.ml · 3 months ago

For inference only. NVIDIA GPU’s are so big because they can train models. Not just run them. All other GPU’s seem to lack that capacity.

nutbutter@discuss.tchncs.de · 3 months ago

You can train or fine-tune a model on any GPU. Surely, It will be slower, but higher VRAM is better.

geneva_convenience@lemmy.ml · 3 months ago

No. The CUDA training stuff is Nvidia only.

herseycokguzelolacak@lemmy.ml · 3 months ago

Pytorch runs on HIP now.

geneva_convenience@lemmy.ml · edit-2 3 months ago

AMD has been lying about that every year since 2019.

Last time I checked it didn’t. And it probably still doesn’t.

People aren’t buying NVIDIA if AMD would work too. The VRAM prices NVIDIA asks are outrageous.

herseycokguzelolacak@lemmy.ml · 3 months ago

I run llama.cpp and PyTorch on MI300s. It works really well.

geneva_convenience@lemmy.ml · edit-2 3 months ago

Can you train on it too? I tried Pytorch on AMD once and it was awful. They promised mountains but delivered nothing. Newer activation functions were all broken.

llama.cpp is inference only, for which AMD works great too after converting to ONNX. But training was awful on AMD in the past.

herseycokguzelolacak@lemmy.ml · 3 months ago

We have trained transformers and diffusion models on AMD MI300s, yes.

geneva_convenience@lemmy.ml · 3 months ago

Interesting. So why does NVIDIA still hold such a massive monopoly on the datacenter?