Cloudflare Creates AI Crawler Tollbooth
Matthew Prince (Hacker News, Slashdot):
The problem is whether you create content to sell ads, sell subscriptions, or just to know that people value what you’ve created, an AI-driven web doesn’t reward content creators the way that the old search-driven web did. And that means the deal that Google made to take content in exchange for sending you traffic just doesn’t make sense anymore.
Instead of being a fair trade, the web is being stripmined by AI crawlers with content creators seeing almost no traffic and therefore almost no value.
That changes today, July 1, what we’re calling Content Independence Day. Cloudflare, along with a majority of the world’s leading publishers and AI companies, is changing the default to block AI crawlers unless they pay creators for their content. That content is the fuel that powers AI engines, and so it’s only fair that content creators are compensated directly for it.
This is pretty cool, but we’re also dangerously close to Cloudflare basically being the whole internet.
In a separate post, Cloudflare’s David Belson, head of data insight, and Sam Rhea, VP of product, published data illustrating the disparity between what AI crawlers take and the referral traffic they send back to websites.
During the period between June 19 and 26, 2025, for example, “Anthropic’s AI platform Claude made nearly 71,000 HTML page requests for every HTML page referral,” observe Belson and Rhea. We must note that these measures only track traffic from the Claude website, not the app, as the app does not emit a
Referer:
header. The same goes for the other AI vendors.
I’m concerned that this default goes too far. Cloudflare has enormous power to intercept web traffic, because they’ve effectively re-centralized DNS for so many websites. While Matthew’s reasons for doing this are good, it should still be an opt-in feature. The open web should by default be open.
[…]
Cloudflare has a series of blog posts today with more details. In one post, they outline how AI crawlers can use HTTP Signatures (similar to what ActivityPub uses) to identify themselves if they have a relationship with Cloudflare for making payments to web publishers. When enabled, Cloudflare will return an HTTP 402 “payment required” response. There’s a mechanism for crawlers to say how much they will pay or to accept the listed price.
[…]
I can also imagine a harmless bot accidentally getting mislabelled as an AI crawler. Cloudflare has significant control even though they aren’t even the ones hosting your web site. According to a companion press release today, Cloudflare proxies traffic for 20% of the web.
Previously:
- MacInTouch Paused
- Please Stop Externalizing Your Costs Directly Into My Face
- Microsoft’s Suleyman on AI Scraping
- AI Companies Ignoring Robots.txt
4 Comments RSS · Twitter · Mastodon
Open Web choices are:
- Geoblock most of the world and throttle everything hard.
- Use megacorps to handle problems caused mostly by other megacorps.
Why don’t web hosts or DNS providers get in on this? If it takes off, I’m sure there will be some copycats
Obviously, this isn’t a *real* solution so of course there are tons of problems with it. The real solution is legislation that forces AI companies to honor website’s choices to allow crawling or not, but doesn’t disable search traffic
Cloudflare's outsized control over much of the internet should be a much bigger concern for everybody.
AI crawlers were monetizing content they didn't create, so Cloudflare, who also did not create said content, started monetizing it. and the people cheered