Data Anonymisation generally refers to the process of removing identifying information such that the remaining data does not identify any particular individual. This is an important step to render the resultant data, which is no longer personal data, suitable for use in research and data mining. Such data analytics can bring greater value to different aspects of our lives, from improving transportation and healthcare services to enhancing public safety.
“There is often greater value in
Mr Zack Bana,
aggregating data instead of looking at
specific data points.”
Co-Founder and Data Protection Officer
of Beacon Consulting
Data Anonymisation has numerous ways to go about data anonymisation. In the birthday poser, Cheryl says her birthday is a secret, but gives Albert and Bernard two separate sets of clues as to when it might be. Albert is told it could be any one of four months. Bernard is told it could be any one of 10 days, of which only two occur uniquely. This is an example of data reduction where some values are removed from a data set and, it is usually done because those values are not required.
Data Anonymisation of personal data is carried out to render the resultant data suitable for more uses than its original state would permit under data protection regimes. For example, Data Anonymisation may be used for research and data mining where personal identifiers in the data are unnecessary or undesired. Data Anonymisation could also be a protection measure against inadvertent disclosures and security breaches.
There are often conflicting needs for anonymity and data integrity. Stripping data of too many identifiers may not preserve the usefulness of the data, or might deny potential uses for the data. Data Anonymisation for specific purposes might not be useful for others because its functionality is reduced.
Also read: 7 Client Data Protection Tips to Keep Customers Safe
To manage re-identification risks, organisations should consider if the entities receiving the Data Anonymisation are likely to possess or have access to information that can inadvertently lead to re-identification. Personal knowledge is also an important factor in assessing re-identification risks, as the people who are close to an individual, such as an individual’s friends or relatives, will possess unique personal knowledge about the individual. Although this personal knowledge will make it easier for an individual to be identified from an Data Anonymisation by his or her friends than a stranger, it is unlikely to amount to high re-identification risks for the Data Anonymisation.
Data Anonymisation should also be properly safeguarded from unintended recipients, whether they are within or outside the organisation.
Besides Data Anonymisation, other practices that organisations can adopt to minimise the risks of reidentification include:
The following is a non-exhaustive list of commonly used Data Anonymisation techniques, and examples of how each technique can be used.
Data Anonymisation remains a key tenet in personal data protection, safeguarding individual identities while allowing organisations to use data to gain valuable insights in more ways than would have been permitted under data protection regimes. As businesses embrace big data and use data analytics to do more sense-making and predictive analysis to extract insights to serve customers better and find new growth opportunities, anonymisation with robust re-identification assessment and risk management will be important to allow firms to optimally extract value from data and, at the same time, safeguard personal data.
Also read: 9 Policies For Security Procedures Examples