Recently, the artificial intelligence inference system DeepSeek released an article that comprehensively reveals the key secrets of its V3/R1 inference system. The article disclosed key information such as DeepSeek's theoretical cost and profit margin for the first time. According to reports, all services of DeepSeek V3 and R1 use H800 GPU and adopt the same accuracy as training to ensure service effectiveness. At the same time, DeepSeek realizes day-night resource allocation to maximize hardware utilization.
According to statistics, assuming the cost of GPU rental is USD 2 per hour, the total cost of DeepSeek for a day is USD 87,072. If all tokens are calculated based on the pricing of DeepSeek R1, the theoretical total revenue per day is US$562,027, and the cost profit margin is as high as 545%. However, actual revenue did not reach this number because V3 is priced lower, paid services only account for a portion, and there are discounts at night.
DeepSeek's high profit margin comes from its innovative inference system design, including three technical pillars: large-scale cross-node expert parallelism (EP), computing communication overlap and load balancing optimization. In addition, DeepSeek further compresses costs at the engineering level, fully supports reasoning services during peak days, and idle nodes at night are transferred for R&D training.
Overall, DeepSeek has achieved efficient operation efficiency and significant profit margins through innovative technical design and fine resource allocation, demonstrating its strong strength in the field of artificial intelligence inference systems.
The above is the detailed content of DeepSeek releases V3/R1 theoretical daily income, profit margin as high as 545%. For more information, please follow other related articles on the PHP Chinese website!