Edward Snowden slams Nvidia's RTX 50-series 'F-tier value,' whistleblows on lackluster VRAM capacity

Avieshek@lemmy.world · 3 months ago

Edward Snowden slams Nvidia's RTX 50-series 'F-tier value,' whistleblows on lackluster VRAM capacity

Eager Eagle@lemmy.world · 3 months ago

I bet he just wants a card to self host models and not give companies his data, but the amount of vram is indeed ridiculous.

Jeena@piefed.jeena.net · 3 months ago

Exactly, I’m in the same situation now and the 8GB in those cheaper cards don’t even let you run a 13B model. I’m trying to research if I can run a 13B one on a 3060 with 12 GB.

The Hobbyist@lemmy.zip · 3 months ago

You can. I’m running a 14B deepseek model on mine. It achieves 28 t/s.

Jeena@piefed.jeena.net · 3 months ago

Oh nice, that’s faster than I imagined.

levzzz@lemmy.world · 3 months ago

You need a pretty large context window to fit all the reasoning, ollama forces 2048 by default and more uses more memory

manicdave@feddit.uk · 3 months ago

I’m running deepseek-r1:14b on a 12GB rx6700. It just about fits in memory and is pretty fast.