More

robhoeijmakers · 2026-05-03T16:56:48 1777827408

Same ratio roughly. 80% Crawlers and agents, 20% human. Loads of the agents actually serve the content to humans, mostly in ChatGPT.

steve_adams_86 · 2026-05-03T17:16:11 1777828571

Wow, that's a great point... I hadn't considered that. I assumed it's all training.

From what I understand, Cloudflare is trying to create a way for agents to consume content in a more structured manner than allowed for attention to the author, and potentially payment along with it.

I don't want to be paid but I'd love to see how often context from my writing winds up in a session a human is actively using.

robhoeijmakers · 2026-05-03T16:55:09 1777827309

It is good to make a proper distinction, in the ChatGPT context, between crawlers and agents. The crawlers go for the content to build a new model, the agents serve content to users. The last one can be very useful.

tardedmeme · 2026-05-03T17:30:06 1777829406

They use different user-agent strings. The crawlers obfuscate themselves and use residential proxies. The agents call themselves ChatGPT-User. Of course Cloudflare wants OpenAI to pay them for not blocking ChatGPT-User by default.

faangguyindia · 2026-05-03T17:36:52 1777829812

It's true, crawlers used for AI training don't say they are crawlers at all.

robhoeijmakers · 2026-05-03T16:53:03 1777827183

I am on a low tier Ghost subscription, I could not rewrite some of the HTML. So I do this with Cloudflare and then cache it again.

Yes, the architecture setup is generated by ChatGPT but in itself it says what it needs.

robhoeijmakers · 2026-05-03T15:58:38 1777823918

I started blocking some of them. But for now I want to improve visibility before further blocking or optimising. The dashboard helps with this.

robhoeijmakers · 2026-05-03T15:56:34 1777823794

Thanks. It seems to be very local/incidental. The page works from the locations I can test, but I’ll check whether one edge cache or request path served a bad response.

Krasnol · 2026-05-03T17:06:10 1777827970

White page from one of Germany's largest ISPs.

robhoeijmakers · 2026-04-29T09:24:29 1777454669

Built a Cloudflare Worker that classifies all traffic by visitor type: humans, AI crawlers, SEO bots, residential proxies. On most days, humans are a minority. The article explains what the patterns reveal and why edge logs show things your analytics tool can't.

robhoeijmakers · on Feb 1, 2023

A short story on the decades long search for the blue LED, the missing piece in the production of screens and energy efficient white light.

robhoeijmakers · on Jan 22, 2023

Social media platform Twitter reverted to cropping images in the timeline. Not for the web and desktop app, but they do for iPhone.

robhoeijmakers · on Sept 22, 2021

The new Visual Look Up finds words in the tiniest of corners. A story on how I searched for SOUL and found it in a little Amsterdam bookstore.

robhoeijmakers · on Aug 5, 2021

How small bits of text explain and support your message and make your story complete.