YouTube videos of 6K celebrities helped train AI model to animate photos in real time.
On Tuesday, Microsoft Research Asia unveiled VASA-1, an AI model that can create a synchronized animated video of a person talking or singing from a single photo and an existing audio track. In the future, it could power virtual avatars that render locally and don't require video feeds—or allow anyone with similar tools to take a photo of a person found online and make them appear to say whatever they want.
We're going to need strong digital signatures on everything, and we need it fast, else we won't be able to believe anything we see. It will be Steve Bannon's "flood the zone with shit" dream come true.