• 0 Posts
  • 54 Comments
Joined 1 year ago
cake
Cake day: July 8th, 2023

help-circle







  • It’s not really a bot’s opinion though? It’s reporting on salon in general, and letting you know that the reporting has a bias, which means generally, it might promote parts of the story that show Vance in a bad light compared to other reporting, and the. The Ground News link shows that reporting on this topic across several sources tends to be pretty non biased and factual. That’s all good information to have, and saying otherwise means you want to let yourself be misled.

    And everything other than joining the topic and the source is written by humans who are trying to keep people informed.





  • That’s helpful information, they both look like they’ve got good pedigrees. I’m a bit surprised that none of the normal fact and bias checker sites have anything for Drop Site.

    I don’t quite understand the down votes here: Israel is a topic with lots of strong opinions, and basic good media literacy is to do research on sources you don’t recognize. There’s been a lot of shoddy journalism to get clicks out there, and one should be critical of the information they ingest. The Ground News bot is a good thing.






  • I think of this as a problem with opt-in only systems. Think of how sites ask you to opt in to allow tracking cookies every goddamn time a page loads. A rule based system which lets you opt in and opt out, like robots.txt, to let you opt out of cookie requests and tell all sites to fuck would be great. @[email protected] is complaining about malicious instances of crawlers that ignore those rules (assuming they’re right and that the rules are set up correctly), and lumping that malware with software made by established corporations. However, Meta and other big tech companies haven’t historically had a problem with ignoring configurations like robots.txt. They have had an issue with using the data they scrape in ways that are different than what they claimed they would, or scraping data from a site that does not allow scraping by coming at it via a URL on a page that it legitimately scraped, but that’s not the kind of shenanigans this article is about, as meta is being pretty upfront about what they’re doing with the data. At least after they announced it existed.

    An opt-in only solution would just lead to a world where all hosts were being constantly bombarded with requests to opt in. My major take away from how meta handled this is that you should configure any site you own to disallow any action from bots you don’t recognize. As much as reddit can fuck off, I don’t disagree with their move to change their configuration to:

    User-agent: *
    Disallow: /