
Explanation:
A re-identification risk analysis job using the Data Loss Prevention (DLP) service is the most efficient and accurate method for researchers to quantify the risk of re-identification through quasi-identifiers. While a custom machine learning model could theoretically estimate this risk, it would require significant time, expertise, and maintenance. Counting infotypes might offer some insight, but it lacks the precision of a DLP analysis. Notably, there is no specific 're-identification risk' infotype in DLP for direct application. For more details, refer to Google Cloud's blog on understanding re-identification risk.
Ultimate access to all questions.
No comments yet.
A team of socio-economic researchers is analyzing documents where personally identifying information has been redacted. They are concerned about the potential re-identification of individuals through quasi-identifiers like age and postal code. What method can they use to quantify this risk?
A
Utilize a custom machine learning model designed to estimate the risk of re-identification.
B
Count the occurrences of quasi-identifiers using Data Loss Prevention infotypes as a measure of risk.
C
Conduct a re-identification risk analysis with the Data Loss Prevention service for accurate risk assessment.
D
Apply a re-identification infotype to each document containing quasi-identifiers to evaluate risk levels.