FREQUENT visitors to the Hustler Club, a gentlemen’s entertainment venue in New York, could not have known that they would become part of a debate about anonymity in the era of “big data”.
But when, for sport, a data scientist called Anthony Tockar mined a database of taxi-ride details to see what fell out of it, it became clear that, even though the data concerned included no direct identification of the customer, there were some intriguingly clustered drop-off points at private addresses for journeys that began at the club. Stir voter-registration records into the mix to identify who lives at those addresses (which Mr Tockar did not do) and you might end up creating some rather unhappy marriages.
The anonymisation of a data record typically means the removal from it of personally identifiable information. Names, obviously. But also phone numbers, addresses and various intimate details like dates of birth. Such a record is then deemed safe for release to researchers, and even to the public, to make of it what they will. Many people volunteer information, for example to medical trials, on the understanding that this will happen.
But the ability to compare databases threatens to make a mockery of such protections. Participants in genomics projects, promised anonymity in exchange for their DNA, have been identified by simple comparison with electoral rolls and other publicly available information. The health records of a governor of Massachusetts were plucked from a database, again supposedly anonymous, of state-employee hospital visits using the same trick. Reporters sifting through a public database of web searches were able to correlate them in order to track down one, rather embarrassed, woman who had been idly searching for single men. And so on.
Each of these headline-generating stories creates a demand for more controls. But that, in turn, deals a blow to the idea of open data—that the electronic “data exhaust” people exhale more or less every time they do anything in the modern world is actually useful stuff which, were it freely available for analysis, might make that world a better place.
Of cake, and eating it
Modern cars, for example, record in their computers much about how, when and where the vehicle has been used. Comparing the records of many vehicles, says Viktor Mayer-Schönberger of the Oxford Internet Institute, could provide a solid basis for, say, spotting dangerous stretches of road. Similarly, an opening of health records, particularly in a country like Britain, which has a national health service, and cross-fertilising them with other personal data, might help reveal the multifarious causes of diseases like Alzheimer’s.
This is a true dilemma. People want both perfect privacy and all the benefits of openness. But they cannot have both. The stripping of a few details as the only means of assuring anonymity, in a world choked with data exhaust, cannot work. Poorly anonymised data are only part of the problem. What may be worse is that there is no standard for anonymisation. Every American state, for example, has its own prescription for what constitutes an adequate standard.
All these approaches, though, are anathema to the open-data movement, because they limit the scope of studies. “If we’re making it so hard to share that only a few have access,” says Tim Althoff, a data scientist at Stanford University, “that has profound implications for science, for people being able to replicate and advance your work.”
The Latest on: Data Privacy
via Google News
The Latest on: Data Privacy
- The Birth Of The Data Science Generationon October 14, 2019 at 4:22 am
Data scientists will build one model that adapts itself in minutes to serve a wide variety of business structures and enterprise functions. But as the past decade has illustrated, our quest for ...
- iOS 13 Safari’s Safe Browsing reportedly sending some data to Tencenton October 14, 2019 at 4:11 am
That information is only available via Safari’s Privacy Terms, which you’ll only know about if you go looking for it. It also doesn’t make it clear which data is sent to which provider in which ...
- Bracing for sweeping new data privacy lawon October 14, 2019 at 4:01 am
“In the two years since introducing the legislation that passed CCPA, which gives nearly 40 million people in this state the strongest data privacy rights in the country, I’ve realized the immense ...
- Data privacy is more important than product quality for consumerson October 14, 2019 at 4:00 am
and a company with great data protection practices but not that great of a product, most consumers would choose the latter. This is according to a new IBM Privacy study which analysed businesses’ data ...
- Apple Accused Of Sending Data From 1 Billion+ iPhones And iPads To Chinaon October 14, 2019 at 3:57 am
Apple also uses Google’s equivalent safe browsing service, and let’s face it that’s a company with its own questionable track record on data privacy. There’s little technical substance here—the ...
- PwC Qatar, QFC lay out approaches on data privacy in Mideast regionon October 14, 2019 at 12:54 am
DOHA: Interest in data privacy in the Middle East and globally continues to grow at a rapid rate. PwC Qatar in collaboration with the Qatar Financial Centre held a seminar on 'Beyond GDPR: Data ...
- This AI-powered Data Intelligence Provider Gained From Tighter Privacy Regulationson October 11, 2019 at 7:20 am
Singapore-based start-up Near, which provides AI-based data analytics services to firms across the world, has gained from the new regulations to govern online data privacy in Europe, Anil Mathews, the ...
- Nevada latest state to pass data privacy lawon October 10, 2019 at 8:00 pm
Privacy lawyers and other legal experts gave the Nevada law mixed reviews, with some saying the legislation gives consumers more control over their data. Critics say the law is limited in scope and ...
- ‘Ignorance is not an excuse’: California draft rules on data privacy releasedon October 10, 2019 at 3:09 pm
California Attorney General Xavier Becerra released a series of draft regulations Thursday aimed at getting businesses to comply with the state’s landmark data privacy law, scheduled to take effect ...
- Maryland forms data privacy council to address student information issueson October 10, 2019 at 2:40 pm
After a recent state audit showed that Maryland needs to do a better job of protecting student's personal records, the state has formed a student data privacy council to help address this issue.
via Bing News