Mirror of the gdb mailing list
 help / color / mirror / Atom feed
* scraperbot protection - Patchwork and Bunsen behind Anubis
@ 2025-04-21 15:59 Mark Wielaard
  2025-04-22 12:34 ` Guinevere Larsen via Gdb
                   ` (2 more replies)
  0 siblings, 3 replies; 9+ messages in thread
From: Mark Wielaard @ 2025-04-21 15:59 UTC (permalink / raw)
  To: binutils, elfutils-devel, gcc, gdb, libc-alpha, libabigail, newlib
  Cc: overseers

Hi hackers,

TLDR; When using https://patchwork.sourceware.org or Bunsen
https://builder.sourceware.org/testruns/ you might now have to enable
javascript. This should not impact any scripts, just browsers (or bots
pretending to be browsers). If it does cause trouble, please let us
know. If this works out we might also "protect" bugzilla, gitweb,
cgit, and the wikis this way.

We don't like to hav to do this, but as some of you might have noticed
Sourceware has been fighting the new AI scraperbots since start of the
year. We are not alone in this.

https://lwn.net/Articles/1008897/
https://arstechnica.com/ai/2025/03/devs-say-ai-crawlers-dominate-traffic-forcing-blocks-on-entire-countries/

We have tried to isolate services more and block various ip-blocks
that were abusing the servers. But that has helped only so much.
Unfortunately the scraper bots are using lots of ip addresses
(probably by installing "free" VPN services that use normal user
connections as exit point) and pretending to be common
browsers/agents.  We seem to have to make access to some services
depend on solving a javascript challenge.

So we have installed Anubis https://anubis.techaro.lol/ in front of
patchwork and bunsen. This means that if you are using a browser that
identifies as Mozilla or Opera in their User-Agent you will get a
brief page showing the happy anime girl that requires javascript to
solve a challenge and get a cookie to get through. Scripts and search
engines should get through without. Also removing Mozilla and/or Opera
from your User-Agent will get you through without javascript.

We want to thanks Xe Iaso who has helped us set this up and worked
with use over the Easter weekend solving some of our problems/typos.
Please check out if you want to be one of their patrons as thank you.
https://xeiaso.net/notes/2025/anubis-works/
https://xeiaso.net/patrons/

Cheers,

Mark

^ permalink raw reply	[flat|nested] 9+ messages in thread

end of thread, other threads:[~2025-04-23 17:54 UTC | newest]

Thread overview: 9+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2025-04-21 15:59 scraperbot protection - Patchwork and Bunsen behind Anubis Mark Wielaard
2025-04-22 12:34 ` Guinevere Larsen via Gdb
2025-04-22 13:06   ` Jonathan Wakely via Gdb
2025-04-22 13:17     ` Guinevere Larsen via Gdb
2025-04-22 14:44       ` Jonathan Wakely via Gdb
2025-04-22 21:39     ` Aurelien Jarno via Gdb
2025-04-23  3:52 ` Chris Packham via Gdb
2025-04-23 16:56 ` Christophe Lyon via Gdb
2025-04-23 17:49   ` Frank Ch. Eigler via Gdb

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox