A new algorithm allows video editors to modify talking head videos as if they were editing text – copying, pasting, or adding and deleting words.
In television and film, actors often flub small bits of otherwise flawless performances. Other times they leave out a critical word. For editors, the only solution so far is to accept the flaws or fix them with expensive reshoots.
Imagine, however, if that editor could modify video using a text transcript. Much like word processing, the editor could easily add new words, delete unwanted ones or completely rearrange the pieces by dragging and dropping them as needed to assemble a finished video that looks almost flawless to the untrained eye.
A team of researchers from Stanford University, Max Planck Institute for Informatics, Princeton University and Adobe Research created such an algorithm for editing talking-head videos – videos showing speakers from the shoulders up.
The work could be a boon for video editors and producers but does raise concerns as people increasingly question the validity of images and videos online, the authors said. However, they propose some guidelines for using these tools that would alert viewers and performers that the video has been manipulated.
“Unfortunately, technologies like this will always attract bad actors,” said Ohad Fried, a postdoctoral scholar at Stanford. “But the struggle is worth it given the many creative video editing and content creation applications this enables.”
The application uses the new transcript to extract speech motions from various video pieces and, using machine learning, convert those into a final video that appears natural to the viewer – lip-synched and all.
“Visually, it’s seamless. There’s no need to rerecord anything,” said Fried, who is first author of a paper about the research published on the pre-publication website arXiv. It will also be in the journal ACM Transactions on Graphics. Fried works in the lab of Maneesh Agrawala, the Forest Baskett Professor in the School of Engineering and senior author of the paper. The project began when Fried was a graduate student working with computer scientist Adam Finkelstein at Princeton, more than two years ago.
Should an actor or performer flub a word or misspeak, the editor can simply edit the transcript and the application will assemble the right word from various words or portions of words spoken elsewhere in the video. It’s the equivalent of rewriting with video, much like a writer retypes a misspelled or unfit word. The algorithm does require at least 40 minutes of original video as input, however, so it won’t yet work with just any video sequence.
As the transcript is edited, the algorithm selects segments from elsewhere in the recorded video with motion that can be stitched to produce the new material. In their raw form these video segments would have jarring jump cuts and other visual flaws.
To make the video appear more natural, the algorithm applies intelligent smoothing to the motion parameters and renders a 3D animated version of the desired result. However, that rendered face is still far from realistic. As a final step, a machine learning technique called Neural Rendering converts the low-fidelity digital model into a photorealistic video in perfect lip-synch.
To test the capabilities of their system the researchers performed a series of complex edits including adding, removing and changing words, as well as translations to different languages, and even created full sentences as if from whole cloth.
In a crowd-sourced study with 138 participants, the team’s edits were rated as “real” almost 60 percent of the time. The visual quality is such that it is very close to the original, but Fried said there’s plenty of room for improvement.
“The implications for movie post-production are big,” said Ayush Tewari, a student at the Max Planck Institute for Informatics and the paper’s second author. It presents for the first time the possibility of fixing filmed dialogue without reshoots.
Nonetheless, in an era of synthesized fake videos such capabilities raise important ethical concerns, Fried added. There are very valuable and justifiable reasons to want to edit video in this way, namely the expense and effort required to rerecord or repair such flaws in video content, or to customize existing audio-visual video content by audience. Instructional videos might be fine-tuned to different languages or cultural backgrounds, for instance, or children’s stories could be adapted to different ages.
“This technology is really about better storytelling,” Fried said.
Fried acknowledges concerns that such a technology might be used for illicit purposes, but says the risk is worth taking. Photo-editing software went through a similar reckoning, but in the end, people want to live in a world where photo-editing software is available.
As a remedy, Fried says there are several options. One is to develop some sort of opt-in watermarking that would identify any content that had been edited and provide a full ledger of the edits. Moreover, researchers could develop better forensics such as digital or non-digital fingerprinting techniques to determine whether a video had been manipulated for ulterior purposes. In fact, this research and others like it also build the essential insights that are needed to develop better manipulation detection.
None of the solutions can fix everything, so viewers must remain skeptical and cautious, Fried said. Besides, he added, there are already many other ways to manipulate video that are much easier to execute. He said that perhaps the most pressing matter is to raise public awareness and education on video manipulation, so people are better equipped to question and assess the veracity of synthetic content.
The Latest on: Synthetic content
via Google News
The Latest on: Synthetic content
- Why I'd invest in a robot teenageron July 4, 2019 at 8:41 am
As a clever colleague from betaworks--Jared--points out, many VCs are allergic to content studios and hence startups pitch ... as well as the introduction of synthetic media regulation. On the latter ... […]
- Global Bone Glue Market Is Expected To Reach US$ 1,863.9 Mn by 2025 | Credence Researchon July 2, 2019 at 11:22 pm
Synthetic bone glues are being loaded with antibiotics in ... https://www.credenceresearch.com/sample-request/58925 Table of Content ... […]
- Drone simulators with MetaVR image generators ordered by USAFon July 2, 2019 at 11:21 am
MetaVR, 3D content and software developer ... Within the MJAT, VRSG is part of the Air Force Synthetic Environment for Reconnaissance and Surveillance (AFSERS) software to stimulate the tactical ... […]
- Santander opts for dual tranche structure on UK SME syntheticon July 2, 2019 at 7:41 am
Santander has closed York 2019-1, a synthetic securitization of loans granted to SME ... Please take a trial or subscribe to access this content. Contact our subscriptions team to discuss your access: ... […]
- Santander closes synthetic CLOon July 1, 2019 at 8:37 am
The weighted average life of the 3,466 underlying loans is 3.1 years. Please take a trial or subscribe to access this content. Contact our subscriptions team to discuss your access: [email protected] ... […]
- Synthetic Musk Market Size and Key Trends in Terms of Volume and Value 2018 - 2028on July 1, 2019 at 8:30 am
Synthetic musk has been a common content in perfumes; most of the perfumes contain about 10–20% of synthetic musk. It is used as the base fragrance in many perfumes. However, there are some challenges ... […]
- Synthetic Biologics Regains Compliance with NYSE American Continued Listing Standardson July 1, 2019 at 4:00 am
Synthetic Biologics, Inc. (NYSE:SYN) is a clinical-stage company developing therapeutics that preserve the microbiome to protect and restore the health of patients. The Company's lead candidates are: ... […]
- Synthetic Reality & Deep Fakes: Considerations For Employers And Implications Of The Rise Of Deep Fakes In The Workplaceon June 28, 2019 at 1:55 am
With the recent advent of generative deep-learning models, like variational autoencoders and generative adversarial networks (GANs), in which one neural network generates content (e.g., realistic ... […]
- Microfiber Synthetic Leather Market Expected to Secure Notable Revenue Share During 2018-2028on June 24, 2019 at 7:59 am
Table of Content Covered in the report are: 1. Global Microfiber Synthetic Leather Market Outlook 2. Global Microfiber Synthetic Leather Market - Executive Summary 3. Global Microfiber Synthetic ... […]
- Synthetic Genomics’ “Breakthrough” Algae Produces Twice as Much Oilon June 18, 2019 at 5:00 pm
See below.] The San Diego biotech company Synthetic Genomics and supermajor ExxonMobil (NYSE: XOM) said today they have successfully inserted a gene in a particular strain of algae that more than ... […]
via Bing News