{"id":40320,"date":"2023-08-11T14:50:54","date_gmt":"2023-08-11T18:50:54","guid":{"rendered":"https:\/\/mjtsai.com\/blog\/?p=40320"},"modified":"2023-08-22T10:46:59","modified_gmt":"2023-08-22T14:46:59","slug":"cnet-deletes-thousands-of-old-articles-to-game-google-search","status":"publish","type":"post","link":"https:\/\/mjtsai.com\/blog\/2023\/08\/11\/cnet-deletes-thousands-of-old-articles-to-game-google-search\/","title":{"rendered":"CNET Deletes Thousands of Old Articles to Game Google Search"},"content":{"rendered":"<blockquote cite=\"https:\/\/talk.tidbits.com\/t\/cnet-deletes-thousands-of-old-articles-to-juice-seo\/23453\/4?u=mjtsai\"><p>So speaking as someone who&rsquo;s adjacent to the SEO industry (not my job, but I&rsquo;ve spent a couple of decades in publishing, digital media, and analytics), I can share a little detail about what I suspect is going on here.<\/p><p>&ldquo;Content pruning&rdquo; is a common practice, and largely includes taking out of date content so that readers can focus on more current and\/or profitable content. This is routine for large sites, and usually includes updating out-of-date but popular articles. Also has the benefit of trimming the amount of content to manage - spring cleaning, if you will.<\/p><p>From an SEO perspective, Google will dedicate limited resources to indexing any given site (its so-called &ldquo;crawl budget&rdquo;). If you take down the pages that aren&rsquo;t doing you any good because they&rsquo;re unprofitable, Google stops spending resources on those pages, and stops sending traffic to pages that don&rsquo;t make money. If you&rsquo;re lucky and have better pages with relevant content, Google will hopefully send those people to those better pages instead.<\/p><p>[&#8230;]<\/p><p>As for why Google says this isn&rsquo;t necessary, well, CNET and Google have different objectives.<\/p><\/blockquote><p><a href=\"https:\/\/gizmodo.com\/cnet-deletes-thousands-old-articles-google-search-seo-1850721475\">Thomas Germain<\/a> (via <a href=\"https:\/\/news.slashdot.org\/story\/23\/08\/09\/2037207\/cnet-deletes-thousands-of-old-articles-to-game-google-search\">Slashdot<\/a>, <a href=\"https:\/\/news.ycombinator.com\/item?id=37068464\">Hacker News<\/a>):<\/p>\n<blockquote cite=\"https:\/\/gizmodo.com\/cnet-deletes-thousands-old-articles-google-search-seo-1850721475\"><p>Archived copies of CNET&rsquo;s author pages show the company deleted small batches of articles prior to the second half of July, but then the pace increased. Thousands of articles disappeared in recent weeks. A CNET representative confirmed that the company was culling stories but declined to share exactly how many it has taken down. The move adds to recent controversies over CNET&rsquo;s editorial strategy, which has included layoffs and experiments with error-riddled articles <a href=\"https:\/\/gizmodo.com\/cnet-ai-chatgpt-tech-news-1850017739\">written by AI chatbots<\/a>.<\/p><p>&ldquo;Removing content from our site is not a decision we take lightly. Our teams analyze many data points to determine whether there are pages on CNET that are not currently serving a meaningful audience. This is an industry-wide best practice for large sites like ours that are primarily driven by SEO traffic,&rdquo; said Taylor Canada, CNET&rsquo;s senior director of marketing and communications. &ldquo;In an ideal world, we would leave all of our content on our site in perpetuity. Unfortunately, we are penalized by the modern internet for leaving all previously published content live on our site.&rdquo;<\/p><p>[&#8230;]<\/p><p>Removing, redirecting, or refreshing irrelevant or unhelpful URLs &ldquo;sends a signal to Google that says CNET is fresh, relevant and worthy of being placed higher than our competitors in search results,&rdquo; the document reads.<\/p><\/blockquote>\n\n<p><a href=\"https:\/\/twitter.com\/searchliaison\/status\/1689018769782476800\">Danny Sullivan<\/a>:<\/p>\n<blockquote cite=\"https:\/\/twitter.com\/searchliaison\/status\/1689018769782476800\"><p>Are you deleting content from your site because you somehow believe Google doesn&rsquo;t like &ldquo;old&rdquo; content? That&rsquo;s not a thing! Our guidance doesn&rsquo;t encourage this.<\/p><\/blockquote>\n\n<p><a href=\"https:\/\/pxlnv.com\/linklog\/cnet-purges-old-articles\/\">Nick Heer<\/a>:<\/p>\n<blockquote cite=\"https:\/\/pxlnv.com\/linklog\/cnet-purges-old-articles\/\">\n<p>A bunch of SEO types Germain interviewed swear by it, but they believe in a lot of really bizarre stuff. It sounds like nonsense to me. After all, Google also prioritizes authority, and a well-known website which has chronicled the history of an industry for decades is pretty damn impressive. Why would &ldquo;a 1996 article about available AOL service tiers&rdquo; &mdash; per the <a href=\"https:\/\/www.documentcloud.org\/documents\/23903730-faq-on-cnet-content-pruning-aug-2023\">internal memo<\/a> &mdash; cause a negative effect on the site&rsquo;s rankings, anyhow? I cannot think of a good reason why a news site purging its archives makes any sense whatsoever.<\/p>\n<\/blockquote>\n\n<p>It&rsquo;s quite possible the consultants were taking them for a ride or are just wrong. But it&rsquo;s also <em>possible<\/em> that the SEO people who follow this stuff really closely for a living have figured out something non-intuitive and unexpected. Google obviously doesn&rsquo;t want to <em>say<\/em> that it incentivizes sites to delete content, and the algorithms are probably not intentionally <em>designed<\/em> to do that, but that doesn&rsquo;t mean this result isn&rsquo;t an <em>emergent property<\/em> of complex algorithms and models that no one fully <a href=\"https:\/\/news.ycombinator.com\/item?id=37071237\">understands<\/a>.<\/p>\n\n<p><a href=\"https:\/\/news.ycombinator.com\/item?id=37079050\">Danny Sullivan<\/a>:<\/p>\n<blockquote cite=\"https:\/\/news.ycombinator.com\/item?id=37079050\"><p>Indexing and ranking are two different things.<\/p><p>Indexing is about gathering content. The internet is big, so we don&rsquo;t index all the pages on it. We try, but there&rsquo;s a lot. If you have a huge site, similarly, we might not get all your pages. Potentially, if you remove some, we might get more to index. Or maybe not, because we also try to index pages as they seem to need to be indexed. If you have an old page that doesn&rsquo;t seem to change much, we probably aren&rsquo;t running back ever hour to it in order to index it again.<\/p><p>[&#8230;]<\/p><p>People who believe removing &ldquo;old&rdquo; content aren&rsquo;t generally thinking that&rsquo;s going to make the &ldquo;new&rdquo; pages get indexed faster. They might think that maybe it means more of their pages overall from a site could get indexed, but that can include &ldquo;old&rdquo; pages they&rsquo;re successful with, too.<\/p><\/blockquote>\n\n<p><a href=\"https:\/\/news.ycombinator.com\/item?id=37079446\">fshbbdssbbgdd<\/a>:<\/p>\n<blockquote cite=\"https:\/\/news.ycombinator.com\/item?id=37079446\">\n<p>Suppose CNET published an article about LK99 a week ago, then they published another article an hour ago. If Google hasn&rsquo;t indexed the new article yet, won&rsquo;t CNET rank lower on a search for &ldquo;LK99&rdquo; because the only matching page is a week old?<\/p>\n<p>If by pruning old content, CNET can get its new articles in the results faster, it seems this would get CNET higher rankings and more traffic. Google doesn&rsquo;t need to have a ranking system directly measuring the average age of content on the site for the net effect of Google&rsquo;s systems to produce that effect. &ldquo;Indexing and ranking are two different things&rdquo; is an important implementation detail, but CNET cares about the outcome, which is whether they can show up at the top of the results page.<\/p>\n<\/blockquote>\n\n<p>It would be nice to look at concrete data. Google knows how the CNET pages rank in its index, and CNET knows how its traffic changed (or didn&rsquo;t) after the deletions. But so far neither is sharing.<\/p>\n\n<p>Previously:<\/p>\n<ul>\n<li><a href=\"https:\/\/mjtsai.com\/blog\/2023\/01\/31\/cnets-ai-powered-seo-money-machine\/\">CNET&rsquo;s AI-powered SEO Money Machine<\/a><\/li>\n<\/ul>\n\n<p id=\"cnet-deletes-thousands-of-old-articles-to-game-google-search-update-2023-08-15\">Update (2023-08-15): <a href=\"https:\/\/pxlnv.com\/linklog\/seo-games\/\">Nick Heer<\/a>:<\/p>\n<blockquote cite=\"https:\/\/pxlnv.com\/linklog\/seo-games\/\">\n<p>The whole entire point of a publisher like CNet is to chronicle an industry. It is too bad its new owners do not see that in either its history or its future.<\/p>\n<\/blockquote>\n\n<p><a href=\"https:\/\/tidbits.com\/2023\/08\/12\/cnet-deletes-thousands-of-old-articles-to-juice-seo\/\">Adam Engst<\/a>:<\/p>\n<blockquote cite=\"https:\/\/tidbits.com\/2023\/08\/12\/cnet-deletes-thousands-of-old-articles-to-juice-seo\/\">\n<p>Though I&rsquo;m dubious of most SEO claims based on my experience with the TidBITS and Take Control sites over decades, it&rsquo;s conceivable that SEO experts have discovered a hack that works&mdash;until Google tweaks its algorithms in response. Regardless, I disapprove of deleting legitimate content because there&rsquo;s no predicting what utility it could provide to the future; at least CNET says it&rsquo;s sending deleted stories to the Internet Archive.<\/p>\n<\/blockquote>\n\n<p id=\"cnet-deletes-thousands-of-old-articles-to-game-google-search-update-2023-08-16\">Update (2023-08-16): <a href=\"https:\/\/any.dev\/@chris\/110867971209370146\">Chris Morrell<\/a>:<\/p>\n<blockquote cite=\"https:\/\/any.dev\/@chris\/110867971209370146\"><p>I will say that Google has a history of publicly stating things about rankings that were measurably untrue. I would not at all be surprised to find out that &ldquo;content pruning&rdquo; is actually effective and is just another way Google&rsquo;s search algos incentivize bad content decisions.<\/p><p>[&#8230;]<\/p><p> Google has claimed for years that they crawl client-side JS just fine, but almost everyone knows that&rsquo;s not true. They&rsquo;ve also said very clearly that Core Web Vitals are important but experimentation shows they have minimal impact.<\/p><p>I&rsquo;m not advocating for deleting content on the web, but I do think that Google has put a lot of publishers in a position to second-guess everything because what they say often doesn&rsquo;t match the evidence.<\/p><\/blockquote>\n\n<p id=\"cnet-deletes-thousands-of-old-articles-to-game-google-search-update-2023-08-22\">Update (2023-08-22): <a href=\"https:\/\/talk.tidbits.com\/t\/cnet-deletes-thousands-of-old-articles-to-juice-seo\/23453\/4?u=mjtsai\">Nik Friedman TeBockhorst<\/a>:<\/p>\n<blockquote cite=\"https:\/\/talk.tidbits.com\/t\/cnet-deletes-thousands-of-old-articles-to-juice-seo\/23453\/4?u=mjtsai\">\n<p>So speaking as someone who&rsquo;s adjacent to the SEO industry (not my job, but I&rsquo;ve spent a couple of decades in publishing, digital media, and analytics), I can share a little detail about what I suspect is going on here.<\/p>\n<p>&ldquo;Content pruning&rdquo; is a common practice, and largely includes taking out of date content so that readers can focus on more current and\/or profitable content. This is routine for large sites, and usually includes updating out-of-date but popular articles. Also has the benefit of trimming the amount of content to manage - spring cleaning, if you will.<\/p>\n<p>From an SEO perspective, Google will dedicate limited resources to indexing any given site (its so-called &ldquo;crawl budget&rdquo;). If you take down the pages that aren&rsquo;t doing you any good because they&rsquo;re unprofitable, Google stops spending resources on those pages, and stops sending traffic to pages that don&rsquo;t make money. If you&rsquo;re lucky and have better pages with relevant content, Google will hopefully send those people to those better pages instead.<\/p>\n<p>[&#8230;]<\/p>\n<p>As for why Google says this isn&rsquo;t necessary, well, CNET and Google have different objectives.<\/p>\n<\/blockquote>","protected":false},"excerpt":{"rendered":"<p>So speaking as someone who&rsquo;s adjacent to the SEO industry (not my job, but I&rsquo;ve spent a couple of decades in publishing, digital media, and analytics), I can share a little detail about what I suspect is going on here.&ldquo;Content pruning&rdquo; is a common practice, and largely includes taking out of date content so that [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"apple_news_api_created_at":"2023-08-11T18:50:57Z","apple_news_api_id":"41f6dc18-6530-48cc-9295-54b7afe2f2d2","apple_news_api_modified_at":"2023-08-22T14:47:02Z","apple_news_api_revision":"AAAAAAAAAAAAAAAAAAAAAg==","apple_news_api_share_url":"https:\/\/apple.news\/AQfbcGGUwSMySlVS3r-Ly0g","apple_news_coverimage":0,"apple_news_coverimage_caption":"","apple_news_is_hidden":false,"apple_news_is_paid":false,"apple_news_is_preview":false,"apple_news_is_sponsored":false,"apple_news_maturity_rating":"","apple_news_metadata":"\"\"","apple_news_pullquote":"","apple_news_pullquote_position":"","apple_news_slug":"","apple_news_sections":"\"\"","apple_news_suppress_video_url":false,"apple_news_use_image_component":false,"footnotes":""},"categories":[2],"tags":[1016,2286,96],"class_list":["post-40320","post","type-post","status-publish","format-standard","hentry","category-technology","tag-datacide","tag-google-search","tag-web"],"apple_news_notices":[],"_links":{"self":[{"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/posts\/40320","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/comments?post=40320"}],"version-history":[{"count":4,"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/posts\/40320\/revisions"}],"predecessor-version":[{"id":40406,"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/posts\/40320\/revisions\/40406"}],"wp:attachment":[{"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/media?parent=40320"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/categories?post=40320"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/tags?post=40320"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}