Are there any Lemmy servers facing AI scraping invasion?

a Kendrick fan@lemmy.ml · 22 days ago

Are there any Lemmy servers facing AI scraping invasion?

ramble81@lemmy.zip · edit-2 22 days ago

They don’t really need to scrape. They just have to set up their own federated instance and the ActivityPub protocol will willingly hand it all to them in a nicely parsable format.

Yardy Sardley@lemmy.ca · 22 days ago

Nepenthes

One link on your website leads to a neverending labyrinth of nonesense to slowly poison a LLM.

potatoguy@potato-guy.space · 22 days ago

I use this nginx extension.

ℕ𝕖𝕞𝕠@slrpnk.net · 22 days ago

slrpnk.net has an AI intercept called Anubis, fwiw

CaptainBasculin@lemmy.ml · 22 days ago

It’s very easy for any activitypub content to be scraped, all servers practically serve the content on a silver platter to any federated server.

Otter@lemmy.ca · 22 days ago

We made a post about our actions here

https://lemmy.ca/post/44214013

Lemuria@lemmy.ml · 20 days ago

I’m sure the AI devs so lazy they cannot train their AI on anything other than scraped HTML can set up a Lemmy instance and point their crawlers at that.