• @Breve@pawb.social
    link
    fedilink
    English
    372 days ago

    They’re too late, there’s going to be way too much AI generated garbage in their data and so many social media platforms like Reddit and Twitter have already taken measures to curb scrapers.

    • @chickenf622@sh.itjust.works
      link
      fedilink
      English
      182 days ago

      Like those platforms aren’t already full of AI garbage as well. Training new models will require a cut-off date before the genie was let out of the bottle.

    • Drunemeton
      link
      fedilink
      English
      32 days ago

      I think that’s the “25-times faster” bit. They seem to be in a hurry to collect as much human-generated data as possible.

      • GHiLA
        link
        fedilink
        English
        42 days ago

        How does it know what is and isn’t?

        Uh oh.

        • JackbyDev
          link
          fedilink
          English
          18 hours ago

          I mean, if I could theoretically take a snapshot of the entire Internet I’d rather do it now than later because there’s just gonna be more AI later.

        • Drunemeton
          link
          fedilink
          English
          52 days ago

          Yeah…

          Hey! Perhaps they’ll use A.I. to weed out the A.I. generated bits.