{"id":46690,"date":"2025-02-10T13:47:06","date_gmt":"2025-02-10T18:47:06","guid":{"rendered":"https:\/\/mjtsai.com\/blog\/?p=46690"},"modified":"2025-02-10T13:47:06","modified_gmt":"2025-02-10T18:47:06","slug":"deepseeks-true-training-cost","status":"publish","type":"post","link":"https:\/\/mjtsai.com\/blog\/2025\/02\/10\/deepseeks-true-training-cost\/","title":{"rendered":"DeepSeek&rsquo;s True Training Cost"},"content":{"rendered":"<p><a href=\"https:\/\/www.tomshardware.com\/tech-industry\/artificial-intelligence\/deepseek-might-not-be-as-disruptive-as-claimed-firm-reportedly-has-50-000-nvidia-gpus-and-spent-usd1-6-billion-on-buildouts\">Anton Shilov<\/a>:<\/p>\n<blockquote cite=\"https:\/\/www.tomshardware.com\/tech-industry\/artificial-intelligence\/deepseek-might-not-be-as-disruptive-as-claimed-firm-reportedly-has-50-000-nvidia-gpus-and-spent-usd1-6-billion-on-buildouts\"><p><a href=\"https:\/\/semianalysis.com\/2025\/01\/31\/deepseek-debates\/\">SemiAnalysis<\/a> reports that the company behind DeepSeek incurred $1.6 billion in hardware costs and has a fleet of 50,000 Nvidia Hopper GPUs, a finding that undermines the idea that DeepSeek reinvented AI training and inference with dramatically lower investments than the leaders of the AI industry. <\/p><div><\/div><p>DeepSeek operates an extensive computing infrastructure with approximately 50,000 Hopper GPUs, the report claims. This includes 10,000 H800s and 10,000 H100s, with additional purchases of H20 units, according to SemiAnalysis. These resources are distributed across multiple locations and serve purposes such as AI training, research, and financial modeling. The company&rsquo;s total capital investment in servers is around $1.6 billion, with an estimated $944 million spent on operating costs, according to SemiAnalysis.<\/p><\/blockquote>\n\n<p><a href=\"https:\/\/www.bloomberg.com\/news\/articles\/2025-02-10\/google-ai-chief-says-deepseek-s-cost-claims-are-exaggerated\">Yazhou Sun and Tom Mackenzie<\/a>:<\/p>\n<blockquote cite=\"https:\/\/www.bloomberg.com\/news\/articles\/2025-02-10\/google-ai-chief-says-deepseek-s-cost-claims-are-exaggerated\">\n<p>The notion that China&rsquo;s DeepSeek spent under $6 million to develop its artificial intelligence system is &ldquo;exaggerated and a little bit misleading,&rdquo; according Google DeepMind boss Demis Hassabis.<\/p>\n<p>[&#8230;]<\/p>\n<p>DeepSeek &ldquo;seems to have only reported the cost of the final training round, which is a fraction of the total cost.&rdquo;<\/p>\n<\/blockquote>\n\n<p>Previously:<\/p>\n<ul>\n<li><a href=\"https:\/\/mjtsai.com\/blog\/2025\/02\/07\/deepseek-privacy-issues\/\">DeepSeek Privacy Issues<\/a><\/li>\n<li><a href=\"https:\/\/mjtsai.com\/blog\/2025\/01\/28\/deepseek\/\">DeepSeek<\/a><\/li>\n<\/ul>","protected":false},"excerpt":{"rendered":"<p>Anton Shilov: SemiAnalysis reports that the company behind DeepSeek incurred $1.6 billion in hardware costs and has a fleet of 50,000 Nvidia Hopper GPUs, a finding that undermines the idea that DeepSeek reinvented AI training and inference with dramatically lower investments than the leaders of the AI industry. DeepSeek operates an extensive computing infrastructure with [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"apple_news_api_created_at":"2025-02-10T18:47:09Z","apple_news_api_id":"3af96dca-de2b-4a22-835c-a2939d5a0b18","apple_news_api_modified_at":"2025-02-10T18:47:09Z","apple_news_api_revision":"AAAAAAAAAAD\/\/\/\/\/\/\/\/\/\/w==","apple_news_api_share_url":"https:\/\/apple.news\/AOvltyt4rSiKDXKKTnVoLGA","apple_news_coverimage":0,"apple_news_coverimage_caption":"","apple_news_is_hidden":false,"apple_news_is_paid":false,"apple_news_is_preview":false,"apple_news_is_sponsored":false,"apple_news_maturity_rating":"","apple_news_metadata":"\"\"","apple_news_pullquote":"","apple_news_pullquote_position":"","apple_news_slug":"","apple_news_sections":"\"\"","apple_news_suppress_video_url":false,"apple_news_use_image_component":false,"footnotes":""},"categories":[2],"tags":[1351,2721,1894,96],"class_list":["post-46690","post","type-post","status-publish","format-standard","hentry","category-technology","tag-artificial-intelligence","tag-deepseek","tag-nvidia","tag-web"],"apple_news_notices":[],"_links":{"self":[{"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/posts\/46690","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/comments?post=46690"}],"version-history":[{"count":1,"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/posts\/46690\/revisions"}],"predecessor-version":[{"id":46691,"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/posts\/46690\/revisions\/46691"}],"wp:attachment":[{"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/media?parent=46690"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/categories?post=46690"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/tags?post=46690"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}