Today, a prominent child safety organization, Thorn, in partnership with a leading cloud-based AI solutions provider, Hive, announced the release of an AI model designed to flag unknown CSAM at upload. It’s the earliest AI technology striving to expose unreported CSAM at scale.

  • Railcar8095@lemm.ee
    link
    fedilink
    English
    arrow-up
    3
    ·
    8 days ago

    So you need to have a model that generates CP to begin with. Flawless reasoning there.

    Look, it’s clear you have no clue what you’re talking about. Stop demonstrating it, moron.

    • JackbyDev@programming.dev
      link
      fedilink
      English
      arrow-up
      1
      ·
      8 days ago

      The model I use (I forget the name) popped out something pretty sus once. I wouldn’t describe it as CP, but it was definitely weird enough to really make me uncomfortable. It’s the only thing it ever made that I immediately deleted and removed from the recycling bin too lol.

      The point I’m making is that this isn’t as far fetched as you believe.

      Plus, you can merge models. Get a general purpose model that knows what children look like, a general purpose pornographic model, merge them, then start generating and selecting images based on Thorn’s classifier.

      • Railcar8095@lemm.ee
        link
        fedilink
        English
        arrow-up
        2
        ·
        8 days ago

        You can’t merge a generative model and a classification model. You can run then in series to get a bunch of false positives/hallucinations, but you can’t make it generate something from the other model.

        • JackbyDev@programming.dev
          link
          fedilink
          English
          arrow-up
          1
          ·
          8 days ago

          When I said a “general purpose model that knows what children look like” I didn’t mean the classification model from the article. I meant a normal, general purpose image generation model. When I said “that knows what children look like” I mean part of its training set is on children, because it’s sort of trained a little on everything. When I said “pornographic model” I mean a model trained exclusively on NSFW content (and not including any CSAM, but that may be generous depending on how much care was out into the model’s creation).

    • JackbyDev@programming.dev
      link
      fedilink
      English
      arrow-up
      1
      ·
      8 days ago

      Alright, I found the name of what I was thinking of that sounds similar to what they’re suggesting: generative adversarial network (GAN).

      The core idea of a GAN is based on the “indirect” training through the discriminator, another neural network that can tell how “realistic” the input seems, which itself is also being updated dynamically. This means that the generator is not trained to minimize the distance to a specific image, but rather to fool the discriminator. This enables the model to learn in an unsupervised manner.

      • Railcar8095@lemm.ee
        link
        fedilink
        English
        arrow-up
        3
        ·
        8 days ago

        Applying GAN won’t work. If used for filtering would result on results being skewed to a younger, but it won’t show 9 the body of a 9 year old unless the model could do that from the beginning.

        If used to “tune” the original model, it will result on massive hallucination and aberrations that can result in false positives.

        In both cases, decent results will be rare and time consuming. Anybody with the dedication to attempt this already has pictures and can build their own model.

        Source: I’m a data scientist