Machine ‘Unlearning’ Technique Wipes Out Unwanted Data Quickly and Completely

The novel approach to making systems forget data is called "machine unlearning" by the two researchers who are pioneering the concept. Instead of making a model directly depend on each training data sample (left), they convert the learning algorithm into a summation form (right) — a process that is much easier and faster than retraining the system from scratch. Courtesy of Yinzhi Cao and Junfeng Yang
The novel approach to making systems forget data is called “machine unlearning” by the two researchers who are pioneering the concept. Instead of making a model directly depend on each training data sample (left), they convert the learning algorithm into a summation form (right) — a process that is much easier and faster than retraining the system from scratch. Courtesy of Yinzhi Cao and Junfeng Yang
Machine learning systems are everywhere. Computer software in these machines predicts the weather, forecasts earthquakes, provides recommendations based on the books and movies we like and, even, applies the brakes on our cars when we are not paying attention.

To do this, computer systems are programmed to find predictive relationships calculated from the massive amounts of data we supply to them. Machine learning systems use advanced algorithms — a set of rules for solving math problems — to identify these predictive relationships using “training data.” This data is then used to construct the models and features within a system that enables it to correctly predict your desire to read the latest best-seller, or the likelihood of rain next week.

This intricate learning process means that a piece of raw data often goes through a series of computations in a given system. The data, computations and information derived by the system from that data together form a complex propagation network called the data’s “lineage.” The term was coined by researchers Yinzhi Cao of Lehigh University and Junfeng Yang of Columbia University who are pioneering a novel approach toward making such learning systems forget.

Considering how important this concept is to increasing security and protecting privacy, Cao and Yang believe that easy adoption of forgetting systems will be increasingly in demand. The pair has developed a way to do it faster and more effectively than what is currently available.

Their concept, called “machine unlearning,” is so promising that the duo have been awarded a four-year, $1.2 million National Science Foundation grant — split between Lehigh and Columbia — to develop the approach.

“Effective forgetting systems must be able to let users specify the data to forget with different levels of granularity,” said Yinzhi Cao, Assistant Professor of Computer Science and Engineering at Lehigh University’s P.C. Rossin College of Engineering & Applied Science and a Principal Investigator on the project. “These systems must remove the data and undo its effects so that all future operations run as if the data never existed.”

There are a number of reasons why an individual user or service provider might want a system to forget data and its complete lineage. Privacy is one.

After Facebook changed its privacy policy, many users deleted their accounts and the associated data. The iCloud photo hacking incident in 2014 — in which hundreds of celebrities’ private photos were accessed via Apple’s cloud services suite — led to online articles teaching users how to completely delete iOS photos including the backups. New research has revealed that machine learning models for personalized medicine dosing leak patients’ genetic markers. Only a small set of statistics on genetics and diseases are enough for hackers to identify specific individuals, despite cloaking mechanism.

Naturally, users unhappy with these newfound risks want their data and its influence on the models and statistics to be completely forgotten.

Security is another reason.

Learn more: Machine ‘Unlearning’ Technique Wipes Out Unwanted Data Quickly and Completely

 

See Also

 

The Latest on: Machine unlearning

[google_news title=”” keyword=”machine unlearning” num_posts=”10″ blurb_length=”0″ show_thumb=”left”]

via Google News

 

The Latest on: Machine unlearning
  • Why Macklemore’s new song “Hind’s Hall” is one of the most important protest songs in history
    on May 7, 2024 at 1:24 pm

    Macklemore has dropped a new song and video titled “Hind’s Hall.” The protest song supporting the people of Palestine is a no-holds-barred proclamation calling out everyone from President Biden and ...

  • OpenAI Offers an Olive Branch to Artists Wary of Feeding AI Algorithms
    on May 7, 2024 at 12:24 pm

    Research is underway on machine “unlearning,” a process that adjusts an AI system to retrospectively remove the contribution of one part of its training data, but the technique has not yet been ...

  • Macklemore releases pro-Palestine track Hind’s Hall as he hits out at Drake vs Kendrick Lamar beef
    on May 6, 2024 at 11:56 pm

    Macklemore has stunned fans by releasing a blistering new protest track in support of Palestine, with musician Tom Morello branding it “the most Rage Against the Machine song since ... with ‘em/ ...

  • Macklemore releases pro-Palestine track Hind’s Hall
    on May 6, 2024 at 11:24 pm

    Macklemore has stunned fans by releasing a blistering new protest track in support of Palestine, with musician Tom Morello branding it “the most Rage Against the Machine song since ... with ‘em/ ...

  • fruit machine
    on May 5, 2024 at 5:00 pm

    Who wants to push a button on a slot machine, anyway? Might as well just play video poker. [John Bradnam] seems to agree, and has built an open-source three-color matrix slot machine complete with ...

  • Machine learning - statistics & facts
    on May 2, 2024 at 5:00 pm

    What kind of AI is machine learning? Of the forms of AI used, machine learning is the simplest, but that makes it also one of the most useful. Other AI subsets include deep learning, neural ...

  • Which washing machine brand is the most reliable?
    on May 1, 2024 at 5:00 pm

    We investigated the performance and reliability of some of the most popular washing machine brands including Bosch, Hotpoint and Miele. In our unique large-appliance survey, we ask more than 7,000 ...

  • Best virtual machine software of 2024
    on April 22, 2024 at 12:37 am

    However, virtual machine software is also available to home users as well. A key advantage of running a virtual machine is that it allows you to run apps that would otherwise not be available due ...

  • Wayback Machine: 5 Alternatives To Try
    on April 17, 2024 at 5:00 pm

    But there’s good news! There are web archives, like the Wayback Machine, that take “snapshots” of websites at different times. This means they save a copy of the website’s appearance on a ...

  • Best espresso machines 2024
    on April 11, 2024 at 7:41 am

    Apart from delivering a smooth drink, the machine needs to be easy to use, with an intuitive control panel. You’ll also want it to be quick to brew and quiet in operation. To help you decide wha ...

via  Bing News

 

What's Your Reaction?
Don't Like it!
0
I Like it!
0
Scroll To Top