Lemmy.one
  • Communities
  • Create Post
  • heart
    Support Lemmy
  • search
    Search
  • Login
  • Sign Up
GnuLinuxDude@lemmy.ml to lemmy.ml meta@lemmy.ml · 2 years ago

Should lemmy.ml block chatgpt scraping in robots.txt?

message-square
message-square
14
fedilink
41
message-square

Should lemmy.ml block chatgpt scraping in robots.txt?

GnuLinuxDude@lemmy.ml to lemmy.ml meta@lemmy.ml · 2 years ago
message-square
14
fedilink

Some context about this here: https://arstechnica.com/information-technology/2023/08/openai-details-how-to-keep-chatgpt-from-gobbling-up-website-data/

the robots.txt would be updated with this entry

User-agent: GPTBot
Disallow: /

Obviously this is meaningless against non-openai scrapers or anyone who just doesn’t give a shit.

  • dreadedsemi@lemmy.world
    link
    fedilink
    arrow-up
    11
    ·
    edit-2
    2 years ago

    Robots.txt is more of an honor system. If they respect , they won’t do that trick.

lemmy.ml meta@lemmy.ml

meta@lemmy.ml

Subscribe from Remote Instance

Create a post
You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !meta@lemmy.ml

Anything about the lemmy.ml instance and its moderation.

For discussion about the Lemmy software project, go to !lemmy@lemmy.ml.

Visibility: Public
globe

This community can be federated to other instances and be posted/commented in by their users.

  • 2 users / day
  • 2 users / week
  • 6 users / month
  • 139 users / 6 months
  • 9 local subscribers
  • 1.51K subscribers
  • 120 Posts
  • 1.11K Comments
  • Modlog
  • mods:
  • Nutomic@lemmy.ml
  • BE: 0.19.7
  • Modlog
  • Legal
  • Instances
  • Docs
  • Code
  • join-lemmy.org