FREQUENT visitors to the Hustler Club, a gentlemen’s entertainment venue in New York, could not have known that they would become part of a debate about anonymity in the era of “big data”.
But when, for sport, a data scientist called Anthony Tockar mined a database of taxi-ride details to see what fell out of it, it became clear that, even though the data concerned included no direct identification of the customer, there were some intriguingly clustered drop-off points at private addresses for journeys that began at the club. Stir voter-registration records into the mix to identify who lives at those addresses (which Mr Tockar did not do) and you might end up creating some rather unhappy marriages.
The anonymisation of a data record typically means the removal from it of personally identifiable information. Names, obviously. But also phone numbers, addresses and various intimate details like dates of birth. Such a record is then deemed safe for release to researchers, and even to the public, to make of it what they will. Many people volunteer information, for example to medical trials, on the understanding that this will happen.
But the ability to compare databases threatens to make a mockery of such protections. Participants in genomics projects, promised anonymity in exchange for their DNA, have been identified by simple comparison with electoral rolls and other publicly available information. The health records of a governor of Massachusetts were plucked from a database, again supposedly anonymous, of state-employee hospital visits using the same trick. Reporters sifting through a public database of web searches were able to correlate them in order to track down one, rather embarrassed, woman who had been idly searching for single men. And so on.
Each of these headline-generating stories creates a demand for more controls. But that, in turn, deals a blow to the idea of open data—that the electronic “data exhaust” people exhale more or less every time they do anything in the modern world is actually useful stuff which, were it freely available for analysis, might make that world a better place.
Of cake, and eating it
Modern cars, for example, record in their computers much about how, when and where the vehicle has been used. Comparing the records of many vehicles, says Viktor Mayer-Schönberger of the Oxford Internet Institute, could provide a solid basis for, say, spotting dangerous stretches of road. Similarly, an opening of health records, particularly in a country like Britain, which has a national health service, and cross-fertilising them with other personal data, might help reveal the multifarious causes of diseases like Alzheimer’s.
This is a true dilemma. People want both perfect privacy and all the benefits of openness. But they cannot have both. The stripping of a few details as the only means of assuring anonymity, in a world choked with data exhaust, cannot work. Poorly anonymised data are only part of the problem. What may be worse is that there is no standard for anonymisation. Every American state, for example, has its own prescription for what constitutes an adequate standard.
All these approaches, though, are anathema to the open-data movement, because they limit the scope of studies. “If we’re making it so hard to share that only a few have access,” says Tim Althoff, a data scientist at Stanford University, “that has profound implications for science, for people being able to replicate and advance your work.”
The Latest on: Data Privacy
via Google News
The Latest on: Data Privacy
- 5 Data Hurdles in Real-Time Customer Experience Managementon February 3, 2020 at 1:51 pm
data governance, security, and compliance to consumer privacy regulation. That’s precisely why Adobe is partnering with leading academia to uncover solutions to these challenges, and help companies ...
- The Personal Data You Enter Into Period Trackers And Other Health Apps May Not Be As Private As You Thinkon February 3, 2020 at 1:40 pm
You have come to expect privacy issues with social media platforms such as Facebook. Data breaches have become commonplace. But how much should you worry about the apps you use to track highly ...
- Can privacy be big business? A wave of startups thinks so.on February 3, 2020 at 12:38 pm
California helped create the modern Big Data industry, in which tech companies vacuum up and profit off personal information. Now a new law in the state is creating something like a solution to the ...
- After Big Tech, can Big Privacy be the next big thing?on February 3, 2020 at 12:30 pm
Privacy-focused technology companies are offering a variety of services, from personal data scrubbing to business-focused software meant to help companies comply with the law.
- Google Stuck Between Privacy, Antitrust With Ad Data Limitson February 3, 2020 at 11:04 am
Google is limiting access to key tools that track ad spending, disrupting hundreds of marketers and underscoring the powerful role the search giant plays in the digital advertising industry. One ...
- Consumer Data Privacy, Preferences and Permission: A Time of Reckoningon February 3, 2020 at 8:00 am
Before we saw the headlines, we noticed it: The fast-growing number of brands alerting us – or asking our permission – to collect our personal consumer data as we visited their site. It’s not always ...
- Enterprise hits and misses - Data Privacy Day gets breached, and IBM places its leadership beton February 3, 2020 at 3:41 am
Lead story - Data Privacy Day - CEOs weigh in on tech's responsibility, does it matter? MyPOV: So Data Privacy Day came and went. Do you know where your data is? Snark aside, I understand why Stuart ...
- Trading data privacy for healthy habitson February 3, 2020 at 1:00 am
I know I’d discover that I’d consented to letting all of my information — my data — be shared. But where I walk and drink coffee isn’t all that important. The fact is that Pacer is free. And if I want ...
- Correction: Europe-Data Privacy storyon January 31, 2020 at 1:49 pm
LONDON -- In a story January 14, 2020, about a Norwegian group making privacy complaints about some dating services, The Associated Press erroneously cited the source of a statement explaining some of ...
- What Is A Data Passport: Building Trust, Data Privacy And Security In The Cloudon January 30, 2020 at 10:06 pm
Keeping our data safe and our privacy protected has become critical elements of businesses today. As businesses are moving more and more of our precious data to the cloud, it is vital that we look at ...
via Bing News