Study shows AI image-generators being trained on explicit photos of children

L4sBot@lemmy.world · 2 years ago

Study shows AI image-generators being trained on explicit photos of children

cyd@lemmy.world · 2 years ago

3200 images is 0.001% of the dataset in question, obviously sucked in by mistake. The problematic images ought to be removed from the dataset, but this does not “contaminate” models trained on the dataset in any plausible way.

L_Acacia · 2 years ago

It’s not even 3200 images used, it’s 3200 hashed url found in the dataset. Most images were most likely removed and the url are dead, and no model was trained on them.