This comic follows on from the Previous comic which will almost certainly provide context.

You might not wanna be famous, but when you’re level 10, every organization within a mile is watching what you’re doing.

You are viewing a single thread.
View all comments
87 points
*

Here’s a short little “meanwhile” comic as a bonus, since it’s been a while.

Oh, apologies about my personal website. My hosting is done by a friend with a small server, and… well basically every wordpress site in existence is now under constant effective-DDoS by AI bots trying to scrape all the data. They’re not subtle about it, and just try to download all the pictures simultaneously. My server is too small to handle that load, so just reboots when that happens (it’s usually down for about a minute).

The fact that it’s near constantly down is just a product of how often I’m getting these requests.

permalink
report
reply
19 points

Welcome back. I love these comics. The website situation is super shitty, I wish you luck on that endless battle.

I ended up just switching to your Tumblr page in my webcomic rotation to get around it.

permalink
report
parent
reply
12 points

Yeah, I post to tumblr, extwitter, mastodon, and bluesky. https://linktr.ee/ahdok

permalink
report
parent
reply
46 points
*

This won’t fix it but it might help.

Make sure you have a robots.txt file with a crawl delay set for all agents once every 30 seconds and that you are disallowing most of the WordPress directories such as WP admin, the media directory, etc.

I would also strongly recommend that you use a caching system if you are not using one. It’s a lot more efficient to serve the same image a hundred times to different bots from the ram than loading it off your drive.

Just my personal opinions working in a web hosting environment.

That’ll probably help if it’s i/o issues.

permalink
report
parent
reply
12 points

Yeah, you might need some combination of fail2ban for rude AI and cloudflare caching or something.

permalink
report
parent
reply
15 points

Whoever their host is, they already appear to have some type of load balancing based on the four IPS. But I would also agree that a free cloudflare account does wonders for most WordPress users. But that’s probably mostly because it filters out a shitload of bots and known bad actors. Just make sure you set up your origin certificates if you use a cloudflare account.

permalink
report
parent
reply
35 points

most of these AI scrapers don’t respect robots.txt, so I’m not sure that really helps much, but… we have tried doing all of these things.

permalink
report
parent
reply
0 points
Deleted by creator
permalink
report
parent
reply
22 points

Someone on lemmy suggested to create a dummy endpoint that normal people won’t be able to navigate to, and disallow it in robots.txt

Then when somebody crawls it you know they are ignoring robots.txt, and you ip ban them

permalink
report
parent
reply

RPGMemes

!rpgmemes@ttrpg.network

Create post

Humor, jokes, memes about TTRPGs

Community stats

  • 3.7K

    Monthly active users

  • 885

    Posts

  • 4K

    Comments