I wondered about the robots.txt

I can see the case for it, I could also see the case for allowing at least Google to index the site.

Has there been some discussion about this previously?

    • Sam_uk@slrpnk.netOP
      link
      fedilink
      English
      arrow-up
      1
      ·
      1 month ago

      I think it would just be

      User-agent: *
      Disallow: /
      User-agent: Googlebot
      Allow: /
      
      • poVoq@slrpnk.netM
        link
        fedilink
        English
        arrow-up
        1
        ·
        edit-2
        30 days ago

        Ok I tried to allow-list some search engine spiders in the robot.txt, however they will probably still just run into the AI scraper block if they act too shady.

        But honestly, I highly doubt we will get much traffic from Google search. It’s completely gone to shit these days.