FREQUENT visitors to the Hustler Club, a gentlemen’s entertainment venue in New York, could not have known that they would become part of a debate about anonymity in the era of “big data”.
But when, for sport, a data scientist called Anthony Tockar mined a database of taxi-ride details to see what fell out of it, it became clear that, even though the data concerned included no direct identification of the customer, there were some intriguingly clustered drop-off points at private addresses for journeys that began at the club. Stir voter-registration records into the mix to identify who lives at those addresses (which Mr Tockar did not do) and you might end up creating some rather unhappy marriages.
The anonymisation of a data record typically means the removal from it of personally identifiable information. Names, obviously. But also phone numbers, addresses and various intimate details like dates of birth. Such a record is then deemed safe for release to researchers, and even to the public, to make of it what they will. Many people volunteer information, for example to medical trials, on the understanding that this will happen.
But the ability to compare databases threatens to make a mockery of such protections. Participants in genomics projects, promised anonymity in exchange for their DNA, have been identified by simple comparison with electoral rolls and other publicly available information. The health records of a governor of Massachusetts were plucked from a database, again supposedly anonymous, of state-employee hospital visits using the same trick. Reporters sifting through a public database of web searches were able to correlate them in order to track down one, rather embarrassed, woman who had been idly searching for single men. And so on.
Each of these headline-generating stories creates a demand for more controls. But that, in turn, deals a blow to the idea of open data—that the electronic “data exhaust” people exhale more or less every time they do anything in the modern world is actually useful stuff which, were it freely available for analysis, might make that world a better place.
Of cake, and eating it
Modern cars, for example, record in their computers much about how, when and where the vehicle has been used. Comparing the records of many vehicles, says Viktor Mayer-Schönberger of the Oxford Internet Institute, could provide a solid basis for, say, spotting dangerous stretches of road. Similarly, an opening of health records, particularly in a country like Britain, which has a national health service, and cross-fertilising them with other personal data, might help reveal the multifarious causes of diseases like Alzheimer’s.
This is a true dilemma. People want both perfect privacy and all the benefits of openness. But they cannot have both. The stripping of a few details as the only means of assuring anonymity, in a world choked with data exhaust, cannot work. Poorly anonymised data are only part of the problem. What may be worse is that there is no standard for anonymisation. Every American state, for example, has its own prescription for what constitutes an adequate standard.
All these approaches, though, are anathema to the open-data movement, because they limit the scope of studies. “If we’re making it so hard to share that only a few have access,” says Tim Althoff, a data scientist at Stanford University, “that has profound implications for science, for people being able to replicate and advance your work.”
The Latest on: Data Privacy
via Google News
The Latest on: Data Privacy
- Facebook will have to pay a record-breaking fine for violating users’ privacy. But the FTC wanted more.on July 22, 2019 at 3:48 pm
“Certainly a company like Facebook has the firepower to fight fire with fire and actually take the U.S. government all the way to the Supreme Court, maybe and once and for all settle the authority the ... […]
- Equifax to Pay at Least $650 Million in Largest-Ever Data Breach Settlementon July 22, 2019 at 3:17 pm
“Equifax put profits over privacy and greed over people ... “To date, we haven’t seen any instances of the data that was stolen being sold.” The current settlement figure of about $650 million is a ... […]
- Facebook Settlement Expected to Mandate Privacy Committeeon July 22, 2019 at 2:58 pm
Facebook declined to comment. Among the measures expected to be imposed on the social-media giant to prevent future consumer-data violations, the board committee would add to an internal privacy team ... […]
- Google privacy lawsuit: Tech giant to pay $13 million over Street View data collectionon July 22, 2019 at 1:30 pm
Google has agreed to pay a $13 million settlement that could resolve a class-action lawsuit over the company's collection of people's private information through its Street View project. The agreement ... […]
- Senate Tech Task Force Convenes, Focused on Data Privacy Concernson July 22, 2019 at 7:19 am
July 22, 2019 - The first Senate Judiciary Committee Tech Task Force convened last week, to begin discussions on how to handle technology issues across all sectors, including those with data ... […]
- Immuta Releases Guidelines for Privacy, Data Protection by Designon July 22, 2019 at 6:36 am
Immuta Experts Lay Foundation for Creating Workflows and Maintaining Compliance with Strict Mandates on Data Immuta, the automated data governance company, today released two important whitepapers ... […]
- TikTok parent to open India data centre in privacy reformon July 22, 2019 at 6:20 am
The Chinese parent company of popular video app TikTok will set up a data centre in India, as privacy concerns prompt Indian regulators to push for greater domestic storage of information. The app has ... […]
- Do data-privacy rules make cross-selling more difficult?on July 20, 2019 at 5:20 am
Data privacy rules are causing concern that cross-selling retail products and services to 401(k) plan participants may not always be kosher. Brokerage executives attending InvestmentNews' recent ... […]
- FaceApp and data privacy - the joke's on youon July 19, 2019 at 4:09 am
FaceApp is funny stuff, isn't it? Actually, no it isn't. Unless you’re on an online detox this week, it’s pretty much guaranteed that your social media timeline has been backed up with ‘hilarious’ ... […]
- Privitar Adds Enhanced Capabilities for Data Protection and Safe Data Analysis in the Latest Version of Its Data-Privacy Softwareon July 18, 2019 at 2:34 pm
Privitar, whose software delivers the uncompromised data privacy that is essential for organizations worldwide to conduct safe and ethical data analysis, today released version 3.0 of its ... […]
via Bing News