FREQUENT visitors to the Hustler Club, a gentlemen’s entertainment venue in New York, could not have known that they would become part of a debate about anonymity in the era of “big data”.
But when, for sport, a data scientist called Anthony Tockar mined a database of taxi-ride details to see what fell out of it, it became clear that, even though the data concerned included no direct identification of the customer, there were some intriguingly clustered drop-off points at private addresses for journeys that began at the club. Stir voter-registration records into the mix to identify who lives at those addresses (which Mr Tockar did not do) and you might end up creating some rather unhappy marriages.
The anonymisation of a data record typically means the removal from it of personally identifiable information. Names, obviously. But also phone numbers, addresses and various intimate details like dates of birth. Such a record is then deemed safe for release to researchers, and even to the public, to make of it what they will. Many people volunteer information, for example to medical trials, on the understanding that this will happen.
But the ability to compare databases threatens to make a mockery of such protections. Participants in genomics projects, promised anonymity in exchange for their DNA, have been identified by simple comparison with electoral rolls and other publicly available information. The health records of a governor of Massachusetts were plucked from a database, again supposedly anonymous, of state-employee hospital visits using the same trick. Reporters sifting through a public database of web searches were able to correlate them in order to track down one, rather embarrassed, woman who had been idly searching for single men. And so on.
Each of these headline-generating stories creates a demand for more controls. But that, in turn, deals a blow to the idea of open data—that the electronic “data exhaust” people exhale more or less every time they do anything in the modern world is actually useful stuff which, were it freely available for analysis, might make that world a better place.
Of cake, and eating it
Modern cars, for example, record in their computers much about how, when and where the vehicle has been used. Comparing the records of many vehicles, says Viktor Mayer-Schönberger of the Oxford Internet Institute, could provide a solid basis for, say, spotting dangerous stretches of road. Similarly, an opening of health records, particularly in a country like Britain, which has a national health service, and cross-fertilising them with other personal data, might help reveal the multifarious causes of diseases like Alzheimer’s.
This is a true dilemma. People want both perfect privacy and all the benefits of openness. But they cannot have both. The stripping of a few details as the only means of assuring anonymity, in a world choked with data exhaust, cannot work. Poorly anonymised data are only part of the problem. What may be worse is that there is no standard for anonymisation. Every American state, for example, has its own prescription for what constitutes an adequate standard.
All these approaches, though, are anathema to the open-data movement, because they limit the scope of studies. “If we’re making it so hard to share that only a few have access,” says Tim Althoff, a data scientist at Stanford University, “that has profound implications for science, for people being able to replicate and advance your work.”
The Latest on: Data Privacy
via Google News
The Latest on: Data Privacy
- Senators propose COVID-19 contact-tracing privacy billon June 1, 2020 at 5:23 pm
The bipartisan effort aims to protect users as technology is used to trace the spread of the novel coronavirus.
- The shape of future data privacy regulation is immaterialon June 1, 2020 at 2:50 pm
A full disclosure of data practices, along with enhanced controls that put consumers in the data privacy driving seat, are vital to winning trust and loyalty.
- As Apple and Google begin to roll out their contact tracing tech, a new bill could enforce strict rules to protect user dataon June 1, 2020 at 1:38 pm
A new bill is being drawn up to create rules over how users' private information is managed in contact tracing apps.
- New DIFC law seeks to enhance security, privacy of dataon June 1, 2020 at 1:12 pm
General fines have been introduced for serious breaches of the law, in addition to or instead of administrative fines.
- Members of Congress to unveil bipartisan bill to regulate contact-tracing apps, fearing potential privacy abuseson June 1, 2020 at 11:40 am
Senate lawmakers plan to unveil a bipartisan bill on Monday that would regulate contact-tracing and exposure-notification apps, seeking to ensure new digital tools meant to combat the coronavirus ...
- Bloomberg Law Leadership Forum Convening Privacy and Data Security Experts to Tackle Pressing Compliance and Regulatory Issueson June 1, 2020 at 7:00 am
Bloomberg Law today announced that its Bloomberg Law Leadership Forum will take place over three days and bring together legal industry ...
- GlobeX Data Launches Social Media Influencers Affiliate Program for Cybersecurity and Data Privacy Solutionson June 1, 2020 at 6:30 am
TORONTO, ON / ACCESSWIRE / June 1 2020 / GlobeX Data Ltd. (OTCQB:SWISF) (CSE:SWIS) (“GlobeX” or the “Company”), the leader in Swiss hosted cyber security and Internet privacy solutions for secure data ...
- Balancing Privacy Concerns Around Facial Recognitionon June 1, 2020 at 6:12 am
There has been recent global uproar around facial recognition technology and whether it’s ethically sound. Its use without citizen consent could have potential safety benefits but is undoubtedly a ...
- Why Contact Tracing Apps Will Be The Biggest Test Yet Of Data Privacy Versus Public Safetyon May 31, 2020 at 9:29 pm
One of the key requirements for countries to come out of the Coronavirus lockdown and resume a more normal life, is to have good track and trace systems in place. But are the data privacy implications ...
- Google’s federated analytics method could analyze end user data without invading privacyon May 27, 2020 at 11:56 am
Google's federated analytics techniques, which power features like Now Playing, could be used to analyze end user data without invading privacy.
via Bing News