I don't see how an AI crawler is different from any others. The simplest approac...

Roark66 · 2025-12-22T10:31:24 1766399484

>It's open source, there's no pain in writing specific rules for rate limiting, thus my question.

Depends on the goal.

Author wants his instance not to get killed. Request rate limiting may achieve that easily in a way transparent to normal users.

mmarian · 2025-12-22T21:32:46 1766439166

> count the UA as risky

It's trivial to spoof UAs unfortunately.

reconnecting · 2025-12-23T11:21:18 1766488878

It depends. If you want to stop OAI-SearchBot/1.3, UA will be enough.

mmarian · 2025-12-24T17:44:46 1766598286

Why would you need tirreno if you just want to stop OAI's bot though?

reconnecting · 2025-12-24T20:04:46 1766606686

OAI's is just an example that's easy to explain.

I believe that if something is publicly available, it shouldn't be overprotected in most cases.

However, there are many advanced cases, such as crawlers that collect data for platform impersonation (for scams) or custom phishing attacks, or account brute-force attacks. In those cases, I use tirreno to understand traffic through different dimensions.