On Tuesday, Microsoft Research Asia unveiled VASA-1, an AI model that can create a synchronized animated video of a person talking or singing from a single photo and an existing audio track. In the future, it could power virtual avatars that render locally and don’t require video feeds—or allow anyone with similar tools to take a photo of a person found online and make them appear to say whatever they want.

  • @simple@lemm.ee
    link
    fedilink
    English
    67 months ago

    We’re going to need strong digital signatures on everything

    That won’t help anything considering how easy it is to strip metadata.

    • @cygnus@lemmy.ca
      link
      fedilink
      107 months ago

      I mean the opposite scenario, where if there’s no signature we assume it’s fake.

      • @catloaf@lemm.ee
        link
        fedilink
        English
        2
        edit-2
        7 months ago

        We’ve had email forgery and signatures to prevent it for decades, but barely anyone does that either.