PyTorch 2.5.1
Python 3.12
CUDA 12.4
Ollama 0.5.7
Open WebUI 0.5.10(Python 3.11)
deepseek-r1:14b
、deepseek-r1:32b
和 deepseek-r1:70b
。RTX 3080x2(20GB)
1NAME ID SIZE PROCESSOR UNTIL
2deepseek-r1:14b ea35dfe18182 11 GB 100% GPU 4 minutes from now
3deepseek-r1:32b 38056bbcbb2d 21 GB 4%/96% CPU/GPU 4 minutes from now
4deepseek-r1:70b 0c1615a8ca32 46 GB 56%/44% CPU/GPU 4 minutes from now
RTX 4090D(24GB)
1NAME ID SIZE PROCESSOR UNTIL
2deepseek-r1:32b 38056bbcbb2d 23 GB 100% GPU 4 minutes from now
3deepseek-r1:70b 0c1615a8ca32 45 GB 46%/54% CPU/GPU 4 minutes from now
RTX 4090D(24GB)双卡
1NAME ID SIZE PROCESSOR UNTIL
2deepseek-r1:70b 0c1615a8ca32 49 GB 100% GPU 4 minutes from now
deepseek-r1:14b
:推荐 20GB 显存,100% 由 GPU 推理,模型回复速度直观较快。deepseek-r1:32b
:推荐 24GB 显存,100% 由 GPU 推理,模型回复速度直观较快。deepseek-r1:70b
:推荐 48GB 显存,100% 由 GPU 推理,模型回复速度直观一般。