Install DeepSeek on Linux in 3 Minutes

Enzo_Michelangeli · February 7, 2025, 3:14pm

Actually, using Llama.cpp it’s possible to run the full (although heavily quantized) 671B version of DeepSeek R1: see https://unsloth.ai/blog/deepseekr1-dynamic
The reason why it’s possible to run a model with less physical RAM than the size of the model, as long as the mass storage is an SDD, is memory mapping (mmap): only the layers currently in use are loaded in RAM. Of course this slows down the operations due to frequent loading of pages from the SDD, but not involving writes (the model is read-only) the SDD is not subject to wear and tear.
I’ve run it on a gaming laptop (MSI Katana 15 with 64 Gb RAM), and it’s indeed very slow: about 0.22 tokens per second.

Topic		Replies	Views
DeepSeek Local: How to Self-Host DeepSeek (Privacy and Control) Articles & guides software , ai-and-chatbots	8	1754	March 25, 2025
What’s everyone working on? (September 2025 edition) General Discussions homelab , home-network , sysadmins	15	286	February 14, 2026
AI and Chatbots: The Role of Linux in their Growth Articles & guides ai-and-chatbots	15	599	January 2, 2024
Anyone else using Warp Terminal? General Discussions command-line	19	2885	December 10, 2025
Welcome thread! Introduce yourself here Community forums	367	12245	February 3, 2026

Install DeepSeek on Linux in 3 Minutes

Related topics