An algorithm-based system that identifies telltale linguistic cues in fake news stories could provide news aggregator and social media sites like Google News with a new weapon in the fight against misinformation.
The University of Michigan researchers who developed the system have demonstrated that it’s comparable to and sometimes better than humans at correctly identifying fake news stories.
In a recent study, it successfully found fakes up to 76 percent of the time, compared to a human success rate of 70 percent. In addition, their linguistic analysis approach could be used to identify fake news articles that are too new to be debunked by cross-referencing their facts with other stories.
Rada Mihalcea, the U-M computer science and engineering professor behind the project, said an automated solution could be an important tool for sites that are struggling to deal with an onslaught of fake news stories, often created to generate clicks or to manipulate public opinion.
Catching fake stories before they have real consequences can be difficult, as aggregator and social media sites today rely heavily on human editors who often can’t keep up with the influx of news. In addition, current debunking techniques often depend on external verification of facts, which can be difficult with the newest stories. Often, by the time a story is proven a fake, the damage has already been done.
Linguistic analysis takes a different approach, analyzing quantifiable attributes like grammatical structure, word choice, punctuation and complexity. It works faster than humans and it can be used with a variety of different news types.
“You can imagine any number of applications for this on the front or back end of a news or social media site,” Mihalcea said. “It could provide users with an estimate of the trustworthiness of individual stories or a whole news site. Or it could be a first line of defense on the back end of a news site, flagging suspicious stories for further review. A 76 percent success rate leaves a fairly large margin of error, but it can still provide valuable insight when it’s used alongside humans.”
Linguistic algorithms that analyze written speech are fairly common today, Mihalcea said. The challenge to building a fake news detector lies not in building the algorithm itself, but in finding the right data with which to train that algorithm.
Fake news appears and disappears quickly, which makes it difficult to collect. It also comes in many genres, further complicating the collection process. Satirical news, for example, is easy to collect, but its use of irony and absurdity make it less useful for training an algorithm to detect fake news that’s meant to mislead.
Ultimately, Mihalcea’s team created its own data, crowdsourcing an online team that reverse-engineered verified genuine news stories into fakes. This is how most actual fake news is created, Mihalcea said, by individuals who quickly write them in return for a monetary reward.
Study participants, recruited with the help of Amazon Mechanical Turk, were paid to turn short, actual news stories into similar but fake news items, mimicking the journalistic style of the articles. At the end of the process, the research team had a dataset of 500 real and fake news stories.
They then fed these labeled pairs of stories to an algorithm that performed a linguistic analysis, teaching itself distinguish between real and fake news. Finally, the team turned the algorithms to a dataset of real and fake news pulled directly from the web, netting the 76 percent success rate.
The details of the new system and the dataset that the team used to build it are freely available, and Mihalcea says they could be used by news sites or other entities to build their own fake news detection systems. She says that future systems could be further honed by incorporating metadata such as the links and comments associated with a given online news item.
Receive an email update when we add a new FAKE NEWS article.
The Latest on: Fake news
via Google News
The Latest on: Fake news
- 'Murphy Brown' makes real Sarah Sanders a target of fake reporter's rant on October 6, 2018 at 4:05 am
“Murphy Brown” put politics front and center again in the rebooted comedy's second episode Thursday when the title character infiltrated a White House press briefing and took aim at press secretary Sa... […]
- Study: Fake Twitter accounts from the 2016 election are still active on October 5, 2018 at 5:05 pm
Study: Fake Twitter accounts from the 2016 election are still active As midterm election season comes around, social media site Twitter is catching heat for failing to stop fake news. Veuer's Natasha ... […]
- Fake fairy photographs sell for ten times the estimate on October 5, 2018 at 9:36 am
Two photographs which famously fooled Sir Arthur Conan Doyle into believing in fairies have sold at auction for more than ten times their estimated value. The images of the Cottingley Fairies, as they ... […]
- Cottingley Fairies fake photos sell for £20,000, 10 times estimate on October 5, 2018 at 3:58 am
Frances Griffiths in one of the Cottingley fake fairies photographs. Photograph: Dominic Winter Auctioneers/PA The Cottingley Fairies photographs, widely considered to be one of the greatest hoaxes of ... […]
- Fake news: French language body urges alternative phrase on October 4, 2018 at 5:23 pm
A group of French grammarians have declared war on "fake news". Bravo, you might be thinking. But rather than false headlines or biased information, it is the English expression itself they want to ge... […]
- GOP official shares fake photo from meme mocking Ford as too unattractive to sexually assault on October 4, 2018 at 4:06 pm
A Republican Party official in North Carolina shared a photo that has been used by some on social media to claim that Christine Blasey Ford was too unattractive to sexually assault in high school, alt... […]
- Woman tried to pass off fake $100 bills with pink Chinese lettering written on them: police on October 4, 2018 at 5:50 am
If anyone wants to know what a fake U.S. $100 bill looks like, look no further. A woman reportedly trying to buy a $5,000 prepaid Visa card at a Safeway store in Washington on Wednesday with $4,900 in ... […]
- Core i9-9900K vs. Ryzen 2800X: Totally fake benchmarks! on October 4, 2018 at 3:00 am
Intel’s rumored “Ryzen-Killing” 5GHz Core i9-9900K and AMD’s rumored “Core i9-Killing” Ryzen 7 2800X aren’t official CPUs yet. But the slow drip of leaks and rumors surrounding the launch of the next ... […]
- Fake-news ecosystem still thrives, two years after the 2016 election, new report says on October 4, 2018 at 2:10 am
Leading sources of phony news reports are still pumping massive amounts of misleading content onto the Internet despite nearly two years of promises by technology companies to address the problem, acc... […]
via Bing News