{"id":48321,"date":"2025-07-02T16:17:58","date_gmt":"2025-07-02T20:17:58","guid":{"rendered":"https:\/\/mjtsai.com\/blog\/?p=48321"},"modified":"2025-07-08T15:41:55","modified_gmt":"2025-07-08T19:41:55","slug":"cloudflare-creates-ai-crawler-tollbooth","status":"publish","type":"post","link":"https:\/\/mjtsai.com\/blog\/2025\/07\/02\/cloudflare-creates-ai-crawler-tollbooth\/","title":{"rendered":"Cloudflare Creates AI Crawler Tollbooth"},"content":{"rendered":"<p><a href=\"https:\/\/blog.cloudflare.com\/content-independence-day-no-ai-crawl-without-compensation\/\">Matthew Prince<\/a> (<a href=\"https:\/\/news.ycombinator.com\/item?id=44443480\">Hacker News<\/a>, <a href=\"https:\/\/tech.slashdot.org\/story\/25\/07\/01\/1745245\/cloudflare-flips-ai-scraping-model-with-pay-per-crawl-system-for-publishers\">Slashdot<\/a>):<\/p>\n<blockquote cite=\"https:\/\/blog.cloudflare.com\/content-independence-day-no-ai-crawl-without-compensation\/\"><p>The problem is whether you create content to sell ads, sell subscriptions, or just to know that people value what you&rsquo;ve created, an AI-driven web doesn&rsquo;t reward content creators the way that the old search-driven web did. And that means the deal that Google made to take content in exchange for sending you traffic just doesn&rsquo;t make sense anymore.<\/p><p>Instead of being a fair trade, the web is being stripmined by AI crawlers with content creators seeing almost no traffic and therefore almost no value.<\/p><p>That changes today, July 1, what we&rsquo;re calling Content Independence Day. Cloudflare, along with a majority of the world&rsquo;s leading publishers and AI companies, is changing the default to block AI crawlers unless they pay creators for their content. That content is the fuel that powers AI engines, and so it&rsquo;s only fair that content creators are compensated directly for it.<\/p><\/blockquote>\n\n<p><a href=\"https:\/\/mastodon.social\/@thaddeus\/114779942884207216\">thaddeus<\/a>:<\/p>\n<blockquote cite=\"https:\/\/mastodon.social\/@thaddeus\/114779942884207216\"><p>This is pretty cool, but we&rsquo;re also dangerously close to Cloudflare basically being the whole internet.<\/p><\/blockquote>\n\n<p><a href=\"https:\/\/www.theregister.com\/2025\/07\/01\/cloudflare_creates_ai_crawler_toll\/\">Thomas Claburn<\/a>:<\/p>\n<blockquote cite=\"https:\/\/www.theregister.com\/2025\/07\/01\/cloudflare_creates_ai_crawler_toll\/\"><p>In a separate <a href=\"https:\/\/blog.cloudflare.com\/ai-search-crawl-refer-ratio-on-radar\/\">post<\/a>, Cloudflare&rsquo;s David Belson, head of data insight, and Sam Rhea, VP of product, published data illustrating the disparity between what AI crawlers take and the referral traffic they send back to websites.<\/p><p>During the period between June 19 and 26, 2025, for example, &ldquo;Anthropic&rsquo;s AI platform Claude made nearly 71,000 HTML page requests for every HTML page referral,&rdquo; observe Belson and Rhea. We must note that these measures only track traffic from the Claude website, not the app, as the app does not emit a <code>Referer:<\/code> header. The same goes for the other AI vendors.<\/p><\/blockquote>\n\n<p><a href=\"https:\/\/www.manton.org\/2025\/07\/01\/cloudflare-is-on-the-offensive.html\">Manton Reece<\/a>:<\/p>\n<blockquote cite=\"https:\/\/www.manton.org\/2025\/07\/01\/cloudflare-is-on-the-offensive.html\">\n<p>I&rsquo;m concerned that this default goes too far. Cloudflare has enormous power to intercept web traffic, because they&rsquo;ve effectively re-centralized DNS for so many websites. While Matthew&rsquo;s reasons for doing this are good, it should still be an opt-in feature. The open web should by default be open.<\/p>\n<p>[&#8230;]<\/p>\n<p>Cloudflare has a series of blog posts today with more details. <a href=\"https:\/\/blog.cloudflare.com\/introducing-pay-per-crawl\/\">In one post<\/a>, they outline how AI crawlers can use HTTP Signatures (similar to what ActivityPub uses) to identify themselves if they have a relationship with Cloudflare for making payments to web publishers. When enabled, Cloudflare will return an HTTP 402 &ldquo;payment required&rdquo; response. There&rsquo;s a mechanism for crawlers to say how much they will pay or to accept the listed price.<\/p>\n<p>[&#8230;]<\/p>\n<p>I can also imagine a harmless bot accidentally getting mislabelled as an AI crawler. Cloudflare has significant control even though they <em>aren&rsquo;t even the ones hosting your web site<\/em>. According to <a href=\"https:\/\/www.cloudflare.com\/press-releases\/2025\/cloudflare-just-changed-how-ai-crawlers-scrape-the-internet-at-large\/\">a companion press release today<\/a>, Cloudflare proxies traffic for 20% of the web.<\/p>\n<\/blockquote>\n\n<p>Previously:<\/p>\n<ul>\n<li><a href=\"https:\/\/mjtsai.com\/blog\/2025\/05\/29\/macintouch-paused\/\">MacInTouch Paused<\/a><\/li>\n<li><a href=\"https:\/\/mjtsai.com\/blog\/2025\/03\/25\/please-stop-externalizing-your-costs-directly-into-my-face\/\">Please Stop Externalizing Your Costs Directly Into My Face<\/a><\/li>\n<li><a href=\"https:\/\/mjtsai.com\/blog\/2024\/07\/01\/microsofts-suleyman-on-ai-scraping\/\">Microsoft&rsquo;s Suleyman on AI Scraping<\/a><\/li>\n<li><a href=\"https:\/\/mjtsai.com\/blog\/2024\/06\/24\/ai-companies-ignoring-robots-txt\/\">AI Companies Ignoring Robots.txt<\/a><\/li>\n<\/ul>\n\n<p id=\"cloudflare-creates-ai-crawler-tollbooth-update-2025-07-04\">Update (<a href=\"#cloudflare-creates-ai-crawler-tollbooth-update-2025-07-04\">2025-07-04<\/a>): <a href=\"https:\/\/x.com\/vladquant\/status\/1940487203072684487\">Vladimir Prelovac<\/a>:<\/p>\n<blockquote cite=\"https:\/\/x.com\/vladquant\/status\/1940487203072684487\"><p>Cloudflare launched pay per crawl service in an attempt to centralize control of AI crawling economy.<\/p><p>Interestingly there is an open source effort by Coinbase which may be a better way to achieve this for publishers (and this could be the first actually useful thing to come out of the crypto world). This banks on existing http 402 response spec and is conveniently called <a href=\"https:\/\/x402.org\">x402.org<\/a><\/p><p>Now if only it wasn&rsquo;t so darn hard to setup a wallet for your grandma in her browser so we could have decent micropayments on the web. Something I think about a lot in the context of Kagi\/Orion.\n<\/p><\/blockquote>\n\n<p id=\"cloudflare-creates-ai-crawler-tollbooth-update-2025-07-08\">Update (<a href=\"#cloudflare-creates-ai-crawler-tollbooth-update-2025-07-08\">2025-07-08<\/a>): <a href=\"https:\/\/tidbits.com\/2025\/07\/07\/cloudflares-pay-per-crawl-points-to-a-new-model-for-paying-content-creators\/\">Adam Engst<\/a>:<\/p>\n<blockquote cite=\"https:\/\/tidbits.com\/2025\/07\/07\/cloudflares-pay-per-crawl-points-to-a-new-model-for-paying-content-creators\/\"><p>There are undoubtedly numerous concerns with pay-per-crawl, not the least of which is that it would put Cloudflare in a position of even greater power within the Internet ecosystem. It could also hinder academic research and open source projects that lack substantial funding.<\/p><p>However, what I find even more interesting about pay-per-crawl is how it might revive HTTP response code 402 as a more general method of enabling direct transactions between producers and consumers. We&rsquo;re getting close to some of the micropayment-related ideas in Ted Nelson&rsquo;s largely theoretical <a href=\"https:\/\/en.wikipedia.org\/wiki\/Project_Xanadu\">Project Xanadu<\/a>, which could radically democratize commerce on the Internet (I&rsquo;ve been beating this drum for decades; see &ldquo;<a href=\"https:\/\/tidbits.com\/1993\/11\/29\/xanadu-light\/\">Xanadu Light<\/a>,&rdquo; 29 November 1993).<\/p><\/blockquote>","protected":false},"excerpt":{"rendered":"<p>Matthew Prince (Hacker News, Slashdot): The problem is whether you create content to sell ads, sell subscriptions, or just to know that people value what you&rsquo;ve created, an AI-driven web doesn&rsquo;t reward content creators the way that the old search-driven web did. And that means the deal that Google made to take content in exchange [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"apple_news_api_created_at":"2025-07-02T20:18:01Z","apple_news_api_id":"fb4b7d80-865a-4ab3-b382-ea1725032220","apple_news_api_modified_at":"2025-07-08T19:41:57Z","apple_news_api_revision":"AAAAAAAAAAAAAAAAAAAAAQ==","apple_news_api_share_url":"https:\/\/apple.news\/A-0t9gIZaSrOzguoXJQMiIA","apple_news_coverimage":0,"apple_news_coverimage_caption":"","apple_news_is_hidden":false,"apple_news_is_paid":false,"apple_news_is_preview":false,"apple_news_is_sponsored":false,"apple_news_maturity_rating":"","apple_news_metadata":"\"\"","apple_news_pullquote":"","apple_news_pullquote_position":"","apple_news_slug":"","apple_news_sections":"\"\"","apple_news_suppress_video_url":false,"apple_news_use_image_component":false,"footnotes":""},"categories":[2],"tags":[1351,101,1485,1910,96,2612],"class_list":["post-48321","post","type-post","status-publish","format-standard","hentry","category-technology","tag-artificial-intelligence","tag-business","tag-cloudflare","tag-coinbase","tag-web","tag-web-crawlers"],"apple_news_notices":[],"_links":{"self":[{"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/posts\/48321","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/comments?post=48321"}],"version-history":[{"count":3,"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/posts\/48321\/revisions"}],"predecessor-version":[{"id":48392,"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/posts\/48321\/revisions\/48392"}],"wp:attachment":[{"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/media?parent=48321"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/categories?post=48321"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/tags?post=48321"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}