fosstodon.org is one of the many independent Mastodon servers you can use to participate in the fediverse.
Fosstodon is an invite only Mastodon instance that is open to those who are interested in technology; particularly free & open source software. If you wish to join, contact us for an invite.

Administered by:

Server stats:

8.8K
active users

#Scraping

3 posts3 participants0 posts today
MDZG (Markdown Zen Garden)<p>🔍 / <a href="https://mastodon.uno/tags/software" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>software</span></a> / <a href="https://mastodon.uno/tags/automation" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>automation</span></a> / <a href="https://mastodon.uno/tags/scraping" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>scraping</span></a></p><p>You can build some pretty insane applications using just <a href="https://mastodon.uno/tags/LLMs" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>LLMs</span></a>, even if you don't really know what you're doing. But what separates a good AI app from a great AI app is one thing, and that's data.</p><p>🐱🔗 <a href="https://laravista.altervista.org/CatLink/links/321" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">laravista.altervista.org/CatLi</span><span class="invisible">nk/links/321</span></a></p><p><a href="https://mastodon.uno/tags/catlink" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>catlink</span></a> <a href="https://mastodon.uno/tags/SoftwareAutomation" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SoftwareAutomation</span></a> <a href="https://mastodon.uno/tags/SoftwareAutomationScraping" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SoftwareAutomationScraping</span></a> <a href="https://mastodon.uno/tags/Python" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Python</span></a> <a href="https://mastodon.uno/tags/BrightData" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>BrightData</span></a> <a href="https://mastodon.uno/tags/AIScraping" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AIScraping</span></a> <a href="https://mastodon.uno/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AI</span></a></p>
Alec Muffett<p>[ai-control] prevent robots.txt entries from becoming law | Brewster Kahle of the Internet Archive, weighs in re: legally enforceable statements in robots.txt<br><a href="https://alecmuffett.com/article/113737" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="">alecmuffett.com/article/113737</span><span class="invisible"></span></a><br><a href="https://mastodon.social/tags/InternetArchive" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>InternetArchive</span></a> <a href="https://mastodon.social/tags/ai" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ai</span></a> <a href="https://mastodon.social/tags/eula" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>eula</span></a> <a href="https://mastodon.social/tags/llm" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>llm</span></a> <a href="https://mastodon.social/tags/privacy" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>privacy</span></a> <a href="https://mastodon.social/tags/scraping" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>scraping</span></a></p>
alecm<p><strong>[ai-control] prevent robots.txt entries from becoming law | Brewster Kahle of the Internet Archive, weighs in re: legally enforceable statements in robots.txt</strong></p><blockquote><p>Having anything in robots.txt reflected in law is such a bad idea, that I suggest we, the Internet standards community, do our best to make it not happen.&nbsp; (I do understand the temptation to think we should make law, but others might disagree 🙂 )&nbsp;&nbsp; So let’s not add any AI features into this standard.</p></blockquote><p><a href="https://mailarchive.ietf.org/arch/msg/ai-control/iy38WylitCEjq76ZogVLcriHeOQ/" rel="nofollow noopener" target="_blank">https://mailarchive.ietf.org/arch/msg/ai-control/iy38WylitCEjq76ZogVLcriHeOQ/</a></p><p><a rel="nofollow noopener" class="hashtag u-tag u-category" href="https://alecmuffett.com/article/tag/ai" target="_blank">#ai</a> <a rel="nofollow noopener" class="hashtag u-tag u-category" href="https://alecmuffett.com/article/tag/eula" target="_blank">#EULA</a> <a rel="nofollow noopener" class="hashtag u-tag u-category" href="https://alecmuffett.com/article/tag/internet-archive" target="_blank">#internetArchive</a> <a rel="nofollow noopener" class="hashtag u-tag u-category" href="https://alecmuffett.com/article/tag/llm" target="_blank">#llm</a> <a rel="nofollow noopener" class="hashtag u-tag u-category" href="https://alecmuffett.com/article/tag/privacy" target="_blank">#privacy</a> <a rel="nofollow noopener" class="hashtag u-tag u-category" href="https://alecmuffett.com/article/tag/scraping" target="_blank">#scraping</a></p>
Ramin HonaryBookmarking this: <a href="https://billauer.co.il/blog/2025/05/phpbb-attack-bots-ip-addresses/" rel="nofollow noopener" target="_blank">https://billauer.co.il/blog/2025/05/phpbb-attack-bots-ip-addresses/</a><br><br><a class="hashtag" href="https://fe.disroot.org/tag/tech" rel="nofollow noopener" target="_blank">#tech</a> <a class="hashtag" href="https://fe.disroot.org/tag/webadmin" rel="nofollow noopener" target="_blank">#WebAdmin</a> <a class="hashtag" href="https://fe.disroot.org/tag/bots" rel="nofollow noopener" target="_blank">#Bots</a> <a class="hashtag" href="https://fe.disroot.org/tag/scraping" rel="nofollow noopener" target="_blank">#Scraping</a> <a class="hashtag" href="https://fe.disroot.org/tag/scraperbots" rel="nofollow noopener" target="_blank">#ScraperBots</a> <a class="hashtag" href="https://fe.disroot.org/tag/devops" rel="nofollow noopener" target="_blank">#DevOps</a> <a class="hashtag" href="https://fe.disroot.org/tag/security" rel="nofollow noopener" target="_blank">#security</a>
PrivacyDigest<p><a href="https://mas.to/tags/Browser" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Browser</span></a> <a href="https://mas.to/tags/Extensions" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Extensions</span></a> Turn Nearly 1 Million <a href="https://mas.to/tags/Browsers" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Browsers</span></a> Into Website-Scraping <a href="https://mas.to/tags/Bots" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Bots</span></a> - Slashdot <br><a href="https://mas.to/tags/scraping" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>scraping</span></a> <a href="https://mas.to/tags/hijack" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>hijack</span></a></p><p><a href="https://tech.slashdot.org/story/25/07/09/2257245/browser-extensions-turn-nearly-1-million-browsers-into-website-scraping-bots?utm_source=rss1.0mainlinkanon&amp;utm_medium=feed" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">tech.slashdot.org/story/25/07/</span><span class="invisible">09/2257245/browser-extensions-turn-nearly-1-million-browsers-into-website-scraping-bots?utm_source=rss1.0mainlinkanon&amp;utm_medium=feed</span></a></p>
Frontend Dogma<p>The Open-Source Software Saving the Internet From AI Bot Scrapers, by <span class="h-card" translate="no"><a href="https://mastodon.social/@emanuelmaiberg" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>emanuelmaiberg</span></a></span> (<span class="h-card" translate="no"><a href="https://mastodon.social/@404mediaco" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>404mediaco</span></a></span>):</p><p><a href="https://archive.fo/weURd" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="">archive.fo/weURd</span><span class="invisible"></span></a></p><p><a href="https://mas.to/tags/ai" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ai</span></a> <a href="https://mas.to/tags/scraping" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>scraping</span></a> <a href="https://mas.to/tags/tooling" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>tooling</span></a></p>
Jonathan Bailey<p>An architecture firm has filed a lawsuit against Pinterest over alleged scraping. However, the case is a real blast from the past.</p><p><a href="https://www.plagiarismtoday.com/2025/07/09/architect-sues-pinterest-over-scraping/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">plagiarismtoday.com/2025/07/09</span><span class="invisible">/architect-sues-pinterest-over-scraping/</span></a></p><p><a href="https://mastodon.world/tags/Copyright" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Copyright</span></a> <a href="https://mastodon.world/tags/Pinterest" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Pinterest</span></a> <a href="https://mastodon.world/tags/Scraping" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Scraping</span></a></p>
Lawrence B. Almeida<p>2025: Uploading your mind onto the cyberspace?<br>Best I can do is make a half-baked simulacra from some blog posts, a 2014 Twitter bio and 2 potatoes. <br><a href="https://mastodon.social/tags/showerthoughts" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>showerthoughts</span></a> <a href="https://mastodon.social/tags/ai" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ai</span></a> <a href="https://mastodon.social/tags/llm" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>llm</span></a> <a href="https://mastodon.social/tags/scraping" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>scraping</span></a> <a href="https://mastodon.social/tags/tech" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>tech</span></a> <a href="https://mastodon.social/tags/thoughts" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>thoughts</span></a></p>
DocYeet :verified:<p>Wow ok, done</p><p>That was so easy</p><p>Kudos to this blog post for the amazing tutorial : <a href="https://xeiaso.net/blog/2025/anubis/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="">xeiaso.net/blog/2025/anubis/</span><span class="invisible"></span></a></p><p>Managed to also quickly add a grafana dashboard to reflect some metrics, and those numbers give some perspective to the insane spam all the internet is under, just to generate more slop</p><p><a href="https://mastodon.halis.io/tags/selfhosted" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>selfhosted</span></a> <a href="https://mastodon.halis.io/tags/homelab" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>homelab</span></a> <a href="https://mastodon.halis.io/tags/kubernetes" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>kubernetes</span></a> <a href="https://mastodon.halis.io/tags/grafana" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>grafana</span></a> <a href="https://mastodon.halis.io/tags/prometheus" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>prometheus</span></a> <a href="https://mastodon.halis.io/tags/anubis" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>anubis</span></a> <a href="https://mastodon.halis.io/tags/gitea" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>gitea</span></a> <a href="https://mastodon.halis.io/tags/faang" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>faang</span></a> <a href="https://mastodon.halis.io/tags/spam" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>spam</span></a> <a href="https://mastodon.halis.io/tags/ai" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ai</span></a> <a href="https://mastodon.halis.io/tags/nginx" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>nginx</span></a> <a href="https://mastodon.halis.io/tags/ingress" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ingress</span></a> <a href="https://mastodon.halis.io/tags/scraping" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>scraping</span></a></p>
DocYeet :verified:<p>Ok, time to deploy Anubis in front of Gitea, I'm done with those FAANG oligarchs scraping my repos 24/7 to check if anything changed...</p><p>F*ck off.</p><p>But that also means Gitea might get unstable for some time, woops</p><p>If you are curious : <a href="https://git.halis.io" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="">git.halis.io</span><span class="invisible"></span></a></p><p>If you see the cute furry, it worked</p><p><a href="https://mastodon.halis.io/tags/homelab" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>homelab</span></a> <a href="https://mastodon.halis.io/tags/selfhosted" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>selfhosted</span></a> <a href="https://mastodon.halis.io/tags/kubernetes" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>kubernetes</span></a> <a href="https://mastodon.halis.io/tags/anubis" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>anubis</span></a> <a href="https://mastodon.halis.io/tags/nginx" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>nginx</span></a> <a href="https://mastodon.halis.io/tags/ingress" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ingress</span></a> <a href="https://mastodon.halis.io/tags/gitea" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>gitea</span></a> <a href="https://mastodon.halis.io/tags/ai" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ai</span></a> <a href="https://mastodon.halis.io/tags/scraping" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>scraping</span></a> <a href="https://mastodon.halis.io/tags/faang" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>faang</span></a></p>
Kevin Russell<p><span class="h-card" translate="no"><a href="https://front-end.social/@zeldman" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>zeldman</span></a></span> </p><p>Watt is being Dunn about AI scraping images and descriptions?</p><p>Make RED sure you fill your gravy description meat with AI hostile get em on the beaches words.</p><p>Images uploaded to mastodon should have AI poison added to them.</p><p><a href="https://mstdn.social/tags/Scraping" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Scraping</span></a> <a href="https://mstdn.social/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AI</span></a> <a href="https://mstdn.social/tags/ZuckSucks" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ZuckSucks</span></a></p>
vicash<p>Really interesting project Anubis to protect against <a href="https://fosstodon.org/tags/LLM" class="mention hashtag" rel="tag">#<span>LLM</span></a> scraping bots : <a href="https://anubis.techaro.lol/" target="_blank" rel="nofollow noopener" translate="no"><span class="invisible">https://</span><span class="">anubis.techaro.lol/</span><span class="invisible"></span></a> <a href="https://fosstodon.org/tags/Scraping" class="mention hashtag" rel="tag">#<span>Scraping</span></a> <a href="https://fosstodon.org/tags/bots" class="mention hashtag" rel="tag">#<span>bots</span></a></p>
Rod2ik 🇪🇺 🇨🇵 🇪🇸 🇺🇦 🇨🇦 🇩🇰 🇬🇱<p>Le <a href="https://mastodon.social/tags/scraping" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>scraping</span></a> <a href="https://mastodon.social/tags/payant" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>payant</span></a> : vers un changement radical du modèle économique de l’ <a href="https://mastodon.social/tags/IA" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>IA</span></a> <a href="https://mastodon.social/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AI</span></a> <a href="https://mastodon.social/tags/g%C3%A9n%C3%A9rative" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>générative</span></a> ?</p><p><a href="https://www.journaldugeek.com/2025/07/04/le-scraping-payant-vers-un-changement-radical-du-modele-economique-de-lia-generative/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">journaldugeek.com/2025/07/04/l</span><span class="invisible">e-scraping-payant-vers-un-changement-radical-du-modele-economique-de-lia-generative/</span></a></p>
Marcel SIneM(S)US<p><a href="https://social.tchncs.de/tags/Cloudflare" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Cloudflare</span></a> lässt KI-Crawler auflaufen, wenn nicht für <a href="https://social.tchncs.de/tags/Scraping" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Scraping</span></a> bezahlt wird | heise online <a href="https://www.heise.de/news/Cloudflare-laesst-KI-Crawler-auflaufen-wenn-nicht-fuer-Scraping-bezahlt-wird-10467015.html" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">heise.de/news/Cloudflare-laess</span><span class="invisible">t-KI-Crawler-auflaufen-wenn-nicht-fuer-Scraping-bezahlt-wird-10467015.html</span></a> <a href="https://social.tchncs.de/tags/PayPerCrawl" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>PayPerCrawl</span></a> <a href="https://social.tchncs.de/tags/ArtificialIntelligence" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ArtificialIntelligence</span></a> <a href="https://social.tchncs.de/tags/copyright" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>copyright</span></a> <a href="https://social.tchncs.de/tags/Urheberrecht" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Urheberrecht</span></a></p>
Alec Muffett<p>Civil Society: Cloudflare’s latest change {blocks, unblocks} network use by {people, software} that we {hate, love} – {yay, boo} this is {great, terrible}!<br><a href="https://alecmuffett.com/article/113629" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="">alecmuffett.com/article/113629</span><span class="invisible"></span></a><br><a href="https://mastodon.social/tags/ai" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ai</span></a> <a href="https://mastodon.social/tags/censorship" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>censorship</span></a> <a href="https://mastodon.social/tags/cloudflare" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>cloudflare</span></a> <a href="https://mastodon.social/tags/scraping" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>scraping</span></a></p>
alecm<p><strong>Civil Society: Cloudflare’s latest change {blocks, unblocks} network use by {people, software} that we {hate, love} – {yay, boo} this is {great, terrible}!</strong></p><p>Details don’t matter – pick your own headline. I doubt we have heard the last of this, but this, too, shall pass:</p><blockquote><p>With Cloudflare’s new setting, websites can block – by default – online bots that scrape their data</p></blockquote><p><a href="https://www.nytimes.com/2025/07/01/technology/cloudflare-ai-data.html" rel="nofollow noopener" target="_blank">https://www.nytimes.com/2025/07/01/technology/cloudflare-ai-data.html</a></p> <p>2022: cloudflare blocks kiwifarms, but in 2025 it still exists: </p><p><a href="https://www.theguardian.com/technology/2022/sep/04/cloudflare-reverses-decision-and-drops-trans-trolling-website-kiwi-farms" rel="nofollow noopener" target="_blank">https://www.theguardian.com/technology/2022/sep/04/cloudflare-reverses-decision-and-drops-trans-trolling-website-kiwi-farms</a></p><p>Quote:</p><blockquote><p>In a blog post [in September 2022], which didn’t mention Kiwi Farms or the pressure campaign, Cloudflare’s chief executive, Matthew Prince, and its vice-president of public policy, Alissa Starzak, suggested the company regretted taking action against the far-right websites 8chan and Daily Stormer in 2019 and 2017, saying there was a “deeply troubling” response afterwards from authoritarian regimes calling for the company to block human rights websites.</p></blockquote><p>2017: cloudflare blocks daily stormer, but in 2025 it still exists: </p><p><a href="https://blog.cloudflare.com/why-we-terminated-daily-stormer/" rel="nofollow noopener" target="_blank">https://blog.cloudflare.com/why-we-terminated-daily-stormer/</a></p><p>2016: cloudflare blocks users of the tor project:</p><p><a href="https://blog.torproject.org/trouble-cloudflare/" rel="nofollow noopener" target="_blank">https://blog.torproject.org/trouble-cloudflare/</a></p><p><a href="https://blog.cloudflare.com/the-trouble-with-tor/" rel="nofollow noopener" target="_blank">https://blog.cloudflare.com/the-trouble-with-tor/</a></p><p>2018: cloudflare introduces tor project onion services: </p><p><a href="https://blog.cloudflare.com/cloudflare-onion-service/" rel="nofollow noopener" target="_blank">https://blog.cloudflare.com/cloudflare-onion-service/</a></p><p><strong>Personal Perspective </strong></p><p>Cloudflare’s position is expeditious. Don’t read too much into what either the long term impact will be, nor what the moral impact will pan out as.</p><p><a rel="nofollow noopener" class="hashtag u-tag u-category" href="https://alecmuffett.com/article/tag/ai" target="_blank">#ai</a> <a rel="nofollow noopener" class="hashtag u-tag u-category" href="https://alecmuffett.com/article/tag/censorship" target="_blank">#censorship</a> <a rel="nofollow noopener" class="hashtag u-tag u-category" href="https://alecmuffett.com/article/tag/cloudflare" target="_blank">#cloudflare</a> <a rel="nofollow noopener" class="hashtag u-tag u-category" href="https://alecmuffett.com/article/tag/scraping" target="_blank">#scraping</a></p>
Petra van Cronenburg<p><span class="h-card" translate="no"><a href="https://indieweb.social/@akamran" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>akamran</span></a></span> <span class="h-card" translate="no"><a href="https://me.dm/@davidtoddmccarty" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>davidtoddmccarty</span></a></span> If you search Google for <a href="https://mastodon.online/tags/Mastodon" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Mastodon</span></a> hashtag scraping, you find software and programs that help AI for doing that. It exists.</p><p>Fact is that from today, the main instances mastodon.social and mastodon.online prohibit <a href="https://mastodon.online/tags/scraping" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>scraping</span></a> officially: <a href="https://techcrunch.com/2025/06/17/mastodon-updates-its-terms-to-prohibit-ai-model-training/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">techcrunch.com/2025/06/17/mast</span><span class="invisible">odon-updates-its-terms-to-prohibit-ai-model-training/</span></a></p><p>Problem of decentralisation: admins/users of other instances must get aware of the problem and change their terms, too.</p><p>It may be funny but it's no joke.</p><p><a href="https://mastodon.online/tags/gravy" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>gravy</span></a></p>
Tommaso Gagliardoni<p>I keep reading rumours that <a href="https://infosec.exchange/tags/gravy" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>gravy</span></a> breaks AI crawlers. I am skeptical. Can anyone link a proper source?</p><p><a href="https://infosec.exchange/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AI</span></a> <a href="https://infosec.exchange/tags/ML" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ML</span></a> <a href="https://infosec.exchange/tags/scraping" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>scraping</span></a></p>
Sozialwelten<p><a href="https://ifwo.eu/tags/Hinweis" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Hinweis</span></a> auf <a href="https://ifwo.eu/tags/Nutzbarkeit" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Nutzbarkeit</span></a> von <a href="https://ifwo.eu/tags/Data" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Data</span></a> <a href="https://ifwo.eu/tags/Analytics" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Analytics</span></a> / <a href="https://ifwo.eu/tags/Data" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Data</span></a> <a href="https://ifwo.eu/tags/Science" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Science</span></a> <a href="https://ifwo.eu/tags/Methode" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Methode</span></a>​n <a href="https://ifwo.eu/tags/Scraping" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Scraping</span></a>, <a href="https://ifwo.eu/tags/Pattern" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Pattern</span></a> <a href="https://ifwo.eu/tags/Recognition" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Recognition</span></a>, <a href="https://ifwo.eu/tags/Machine" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Machine</span></a> <a href="https://ifwo.eu/tags/Learning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Learning</span></a> oder <a href="https://ifwo.eu/tags/Text" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Text</span></a> <a href="https://ifwo.eu/tags/Mining" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Mining</span></a> für <a href="https://ifwo.eu/tags/soziologisch" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>soziologisch</span></a>​e <a href="https://ifwo.eu/tags/Forschung" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Forschung</span></a>. </p><p><a href="https://ifwo.eu/tags/Sutter" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Sutter</span></a> / <a href="https://ifwo.eu/tags/Maasen" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Maasen</span></a> - <a href="https://ifwo.eu/tags/Neuerfindung" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Neuerfindung</span></a> <a href="https://ifwo.eu/tags/Soziologie" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Soziologie</span></a> S.76 f. 2020 DOI: 10.5771/9783845295008-73</p><p><a href="https://ifwo.eu/tags/MachineLearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>MachineLearning</span></a> <a href="https://ifwo.eu/tags/ML" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ML</span></a> <a href="https://ifwo.eu/tags/TextMining" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>TextMining</span></a> <a href="https://ifwo.eu/tags/Soziologie" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Soziologie</span></a> <a href="https://ifwo.eu/tags/BigData" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>BigData</span></a> <a href="https://ifwo.eu/tags/Methodologie" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Methodologie</span></a> <a href="https://ifwo.eu/tags/Methodik" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Methodik</span></a> <a href="https://ifwo.eu/tags/Sozialforschung" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Sozialforschung</span></a> <a href="https://ifwo.eu/tags/Sozialwissenschaft" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Sozialwissenschaft</span></a></p>
Walled Culture<p><strong>Fighting fire with fire: how to tackle the AI bots that threaten the open Web</strong></p><p>It is a measure of how fast the field of AI has developed in the three years since Walled Culture the book (free digital versions available) was published that the issue of using copyright material for training AI systems, briefly mentioned in the book, has become one of the hottest topics in the copyright world, as numerous posts on this blog attest.</p><p>The current situation sees the copyright […]</p><p><a rel="nofollow noopener" class="hashtag u-tag u-category" href="https://walledculture.org/tag/ai-bots/" target="_blank">#aiBots</a> <a rel="nofollow noopener" class="hashtag u-tag u-category" href="https://walledculture.org/tag/cloud-computing/" target="_blank">#cloudComputing</a> <a rel="nofollow noopener" class="hashtag u-tag u-category" href="https://walledculture.org/tag/cloudflare/" target="_blank">#cloudflare</a> <a rel="nofollow noopener" class="hashtag u-tag u-category" href="https://walledculture.org/tag/firewalls/" target="_blank">#firewalls</a> <a rel="nofollow noopener" class="hashtag u-tag u-category" href="https://walledculture.org/tag/free-software/" target="_blank">#freeSoftware</a> <a rel="nofollow noopener" class="hashtag u-tag u-category" href="https://walledculture.org/tag/genai/" target="_blank">#genai</a> <a rel="nofollow noopener" class="hashtag u-tag u-category" href="https://walledculture.org/tag/glam-e-lab/" target="_blank">#glamELab</a> <a rel="nofollow noopener" class="hashtag u-tag u-category" href="https://walledculture.org/tag/open-source/" target="_blank">#openSource</a> <a rel="nofollow noopener" class="hashtag u-tag u-category" href="https://walledculture.org/tag/open-web/" target="_blank">#openWeb</a> <a rel="nofollow noopener" class="hashtag u-tag u-category" href="https://walledculture.org/tag/robots-txt/" target="_blank">#robotsTxt</a> <a rel="nofollow noopener" class="hashtag u-tag u-category" href="https://walledculture.org/tag/scraping/" target="_blank">#scraping</a> <a rel="nofollow noopener" class="hashtag u-tag u-category" href="https://walledculture.org/tag/survey/" target="_blank">#survey</a> <a rel="nofollow noopener" class="hashtag u-tag u-category" href="https://walledculture.org/tag/training/" target="_blank">#training</a> <a rel="nofollow noopener" class="hashtag u-tag u-category" href="https://walledculture.org/tag/unc/" target="_blank">#unc</a> <a rel="nofollow noopener" class="hashtag u-tag u-category" href="https://walledculture.org/tag/wikimedia/" target="_blank">#wikimedia</a></p><p><a href="https://walledculture.org/fighting-fire-with-fire-how-to-tackle-the-ai-bots-that-threaten-the-open-web/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">walledculture.org/fighting-fir</span><span class="invisible">e-with-fire-how-to-tackle-the-ai-bots-that-threaten-the-open-web/</span></a></p>