<<>>open webamp
 
justin = {main feed , music , code , askjf , pubkey };
 
death by a million papercuts
January 7, 2026
To follow up on the previous post about our forums and getting scraped by bots, I enabled some statistics in our human-authentication system for the forum. In the last 18 hours, we have seen requests from about 300,000 IPs, of which about 5,000 (approx 1.5%) successfully verified themselves as human.

Of the 295,000 IPs that did not verify themselves as human, about 15,000 were poorly behaved, so much so that they triggered our log-analysis-ban system.

The remaining 280,000 IPs each behaved relatively fine, each doing occasional requests more or less mimicking the usage patterns of a user. There are two problems it caused for us, though:
  1. they often would request pages that are expensive to generate (page 90 of our member list, e.g., which vbulletin has to go and sort/generate then skip 89 pages of the results)
  2. there are 280,000 of them
If we had 280,000 simultaneous legitimate users on our forum, we would have similar problems, though in that situation we could justify spending a lot more on the infrastructure to support our userbase. Moderating such a place would be an impossible nightmare, too.

In short, to use a metaphor, the modern internet (LLM-scraping-bots) will effectively kill a forum like ours with a million tiny papercuts (if allowed).
1 Comment:

Posted by Samster on Sat 24 Jan 2026 at 20:40 from 80.211.143.x

I know you probably won't like my 1st suggestion, but Cloudflare has been working hard recently on smarter bot detection strategies in order to detect many of the human-mimicking bots.

But if you have negative feelings towards Cloudflare, perhaps you can try some self-hostable solution, such as BunkerWeb, with crow-sourced intelligence. I'm not sure how good that is, but it sounds promising. I'm yet to try it myself.

Add comment:

Name:
Human?: (no or yes, patented anti crap stuff here)
Comment:
search : rss : recent comments : Copyright © 2026 Justin Frankel