Transforming Video Content Into Another Video’s Style, Automatically
Researchers at Carnegie Mellon University have devised a way to automatically transform the content of one video into the style of another, making it possible to transfer the facial expressions of comedian John Oliver to those of a cartoon character, or to make a daffodil bloom in much the same way a hibiscus would.
Because the data-driven method does not require human intervention, it can rapidly transform large amounts of video, making it a boon to movie production. It can also be used to convert black-and-white films to color and to create content for virtual reality experiences.
“I think there are a lot of stories to be told,” said Aayush Bansal, a Ph.D. student in CMU’s Robotics Institute. Film production was his primary motivation in helping devise the method, he explained, enabling movies to be produced more quickly and cheaply. “It’s a tool for the artist that gives them an initial model that they can then improve,” he added.
The technology also has the potential to be used for so-called “deep fakes,” videos in which a person’s image is inserted without permission, making it appear that the person has done or said things that are out of character, Bansal acknowledged.
“It was an eye opener to all of us in the field that such fakes would be created and have such an impact,” he said. “Finding ways to detect them will be important moving forward.”
Transferring content from one video to the style of another relies on artificial intelligence. In particular, a class of algorithms called generative adversarial networks (GANs) have made it easier for computers to understand how to apply the style of one image to another, particularly when they have not been carefully matched.
In a GAN, two models are created: a discriminator that learns to detect what is consistent with the style of one image or video, and a generator that learns how to create images or videos that match a certain style. When the two work competitively — the generator trying to trick the discriminator and the discriminator scoring the effectiveness of the generator — the system eventually learns how content can be transformed into a certain style.
A variant, called cycle-GAN, completes the loop, much like translating English speech into Spanish and then the Spanish back into English and then evaluating whether the twice-translated speech still makes sense. Using cycle-GAN to analyze the spatial characteristics of images has proven effective in transforming one image into the style of another.
That spatial method still leaves something to be desired for video, with unwanted artifacts and imperfections cropping up in the full cycle of translations. To mitigate the problem, the researchers developed a technique, called Recycle-GAN, that incorporates not only spatial, but temporal information. This additional information, accounting for changes over time, further constrains the process and produces better results.
The researchers showed that Recycle-GAN can be used to transform video of Oliver into what appears to be fellow comedian Stephen Colbert and back into Oliver. Or video of John Oliver’s face can be transformed into a cartoon character. Recycle-GAN allows not only facial expressions to be copied, but also the movements and cadence of the performance.
The effects aren’t limited to faces, or even bodies. The researchers demonstrated that video of a blooming flower can be used to manipulate the image of other types of flowers. Or clouds that are crossing the sky rapidly on a windy day can be slowed to give the appearance of calmer weather.
Such effects might be useful in developing self-driving cars that can navigate at night or in bad weather, Bansal said. Obtaining video of night scenes or stormy weather in which objects can be identified and labeled can be difficult, he explained. Recycle-GAN, on the other hand, can transform easily obtained and labeled daytime scenes into nighttime or stormy scenes, providing images that can be used to train cars to operate in those conditions.
Learn more: Beyond Deep Fakes
The Latest on: Deep fakes
via Google News
The Latest on: Deep fakes
- Trump doesn't take Russian electoral interference seriously. This is what Democrats did to oppose it in 2018. on December 18, 2018 at 11:17 am
We think future pledges like this one should include promises not to hack, use hacked materials or use fake accounts, bots, troll farms or “deep fakes.” Whether the parties themselves can agree to a c... […]
- AI can now create life-like human faces from scratch on December 18, 2018 at 9:35 am
This technology, called Generative Adversarial Networks (GANs), is behind the wave of digitally manipulated videos and photos called "deep fakes". The results are that famous people such as actors or ... […]
- Fake news vs fact in online battle for truth on December 15, 2018 at 2:28 am
A relatively new development is deep fakes -- manipulated videos that appear genuine but depict events or speech that never happened. For now, deep fakes are technically difficult to create and have n... […]
- Text fight tops FCC meeting on December 12, 2018 at 7:00 am
The fake news frontier: Foreign Affairs looks at “the coming age of post-truth geopolitics,” including the rise of deep fakes and new disinformation tactics. […]
- The Cybersecurity 202: Trump is getting tough on Chinese hacking. Will it work? on December 12, 2018 at 4:28 am
Moreover, operatives may use doctored videos known as deep fakes to throw the next presidential campaign into disarray, Mook said at a WSJ Pro Cybersecurity Executive Forum. Judd Choate, Colorado’s el... […]
- 'Start Here': Brexit vote postponed, Pelosi and Schumer to meet with Trump, deceptive 'deepfakes.' What you need to know to start your day. on December 11, 2018 at 2:03 am
British Prime Minister Theresa May delayed Tuesday's key vote in Parliament on Brexit after she admitted the deal "would be rejected by a significant margin." May told members of Parliament she ... […]
- Seeing but not believing: Inside the business of “deepfakes” on December 10, 2018 at 6:36 pm
In a video seen by millions, a man that looks and sounds just like President Obama gives an address. But instead of a polished speech, he spouts out controversial opinions -- and even a curse word. […]
- NVIDIA's new AI turns videos of the real world into virtual landscapes on December 3, 2018 at 5:18 am
Just look at deepfakes: it's getting remarkably difficult to tell these artificially generated videos apart from the real thing, and as NVIDIA proved with its Gangnam Style test, its neural model coul... […]
via Bing News