Thanks to a particularly annoying botnet, everyone’s favorite anime cat girl firewall is now helping protect piefed.ca & lemmy.ca from bots and scrapers.

This is requests per second and these are all thousands of scrapers on residential IPs hammering us:

They’d increase their usage until the site started struggling, then move on. I banned their user agents, but have no interest in a cat & mouse game. Anubis should hopefully keep things running much smoother for everyone.

Let me know if you have any trouble!

  • quaff@lemmy.ca
    link
    fedilink
    English
    arrow-up
    3
    ·
    2 days ago

    I’ll be curious to know if y’all experience any federation issues. If not, I may introduce this on the mastodon instances I administrate!

    • Shadow@lemmy.caOPM
      link
      fedilink
      English
      arrow-up
      3
      ·
      2 days ago

      Nothing so far. Anubis has a built in rule set for activity pub.

  • Shadow@lemmy.caOPM
    link
    fedilink
    English
    arrow-up
    47
    ·
    edit-2
    4 days ago

    • red = obvious bots
    • blue = bots and users hitting the first anubis page (ie, it’s 99.9% bots)
    • green = users.
  • egerlach@lemmy.ca
    link
    fedilink
    English
    arrow-up
    16
    ·
    3 days ago

    F the bots. Would like to be able to have nice things. Happy that at least this is the 🇨🇦-made solution (at least the primary dev, anyways).

    Does Fedecan have the budget to throw a couple of bucks a month to Xe? Completely understand if not, I’ve done not-for-profit corps before and I know what it’s like. But if the budget is there, spending it on a Canadian dev would be a nice choice, IMO.

  • Limonene@lemmy.world
    link
    fedilink
    English
    arrow-up
    10
    ·
    3 days ago

    Name and shame. What are the useragent strings? Can the companies be identified?

    It won’t affect me personally, because I already hate all AI companies. But maybe I could convince some people if I tell them what a specific company is doing.

        • F/15/Cali@threads.net@sh.itjust.works
          link
          fedilink
          English
          arrow-up
          4
          ·
          3 days ago

          If you really want to get the info, bludgeoning them legally and cheaply with repeated small claims court processes seems asymmetrical enough to become a slightly cash positive hobby

      • Tollana1234567@lemmy.today
        link
        fedilink
        English
        arrow-up
        1
        ·
        3 days ago

        that make sense, since bots both propaganda ones and the “normal ones” use residential ip on reddit for the same evasion method

    • Shadow@lemmy.caOPM
      link
      fedilink
      English
      arrow-up
      5
      ·
      3 days ago

      They’re all generic user agents that just look like a browser. Nothing fingerprintable

    • Phoenixz@lemmy.ca
      link
      fedilink
      English
      arrow-up
      5
      ·
      3 days ago

      Meh, useragents are easily spoofed and something tells me that most (all) AI companies don’t really care about behing honest there

  • Alfredolin@lemmy.wtf
    link
    fedilink
    English
    arrow-up
    19
    ·
    4 days ago

    Bots can get rekt. I am afraid it’s still gonna be cat & mouse game, let’s see how long anubis works for us (I also use it for my services).

  • connaisseur@feddit.org
    link
    fedilink
    English
    arrow-up
    8
    ·
    4 days ago

    On feddit.org it was also implemented to get rid of bots and reduce load on the infrastructure. There had been some complaints because of the anubis landing page initially, however I think the general acceptance of this measure after explaining is rather high.

    • 9point6@lemmy.world
      link
      fedilink
      English
      arrow-up
      11
      ·
      4 days ago

      How come you’re looking for an alternative? Does it not do the job for you or something?

      • ikt@aussie.zone
        link
        fedilink
        English
        arrow-up
        9
        ·
        4 days ago

        tbh i would prefer something silent instead of a full screen block page while it figures out whether I’m a bot or not

        I don’t even like cloudflare click to confirm you’re not a bot pages which auto confirm

        • [object Object]@lemmy.ca
          link
          fedilink
          English
          arrow-up
          8
          ·
          edit-2
          3 days ago

          To my knowledge, which is often wrong, that’s necessary.

          It’s a proof of work system, so your browser has to receive the challenge work, create background workers to do it, then submit the results and get authenticated.

          If the work wasn’t challenging (slow), then it wouldn’t be any impediment to scrapers and bots.

          Whether there are alternatives to proof of work that work well, I do not know. But fingerprinting alone is actually very difficult.

        • 9point6@lemmy.world
          link
          fedilink
          English
          arrow-up
          3
          ·
          3 days ago

          FWIW I think cloudflare and similar do the full screen thing too, they just render a blank page though so it just feels like more load time.

          I don’t run Anubis on my stuff currently, but I’d be surprised if it doesn’t have a similar feature

    • Shadow@lemmy.caOPM
      link
      fedilink
      English
      arrow-up
      4
      ·
      3 days ago

      I didn’t realize thunderbird could access lemmy. Will look later today.

      • flameleaf@ani.social
        link
        fedilink
        English
        arrow-up
        5
        ·
        3 days ago

        Yep. The community pages all provide RSS feeds and Mastodon has them as well.

        It’s so nice using it as an all-in-one newsfeed aggregated with everything else.

        • Grimpen@lemmy.ca
          link
          fedilink
          English
          arrow-up
          2
          ·
          3 days ago

          Now that you say that, I did set up another RSS reader to display a Lemmy RSS. Thunderbird sports RSS, therefore…