Google Professional Data Engineer

Get started today

Ultimate access to all questions.

Explanation:

A re-identification risk analysis job using the Data Loss Prevention (DLP) service is the most efficient and accurate method for researchers to quantify the risk of re-identification through quasi-identifiers. While a custom machine learning model could theoretically estimate this risk, it would require significant time, expertise, and maintenance. Counting infotypes might offer some insight, but it lacks the precision of a DLP analysis. Notably, there is no specific 're-identification risk' infotype in DLP for direct application. For more details, refer to Google Cloud's blog on understanding re-identification risk.

Explanation:

Comments (0)

No comments yet.

A team of socio-economic researchers is analyzing documents where personally identifying information has been redacted. They are concerned about the potential re-identification of individuals through quasi-identifiers like age and postal code. What method can they use to quantify this risk?

Real Exam

Utilize a custom machine learning model designed to estimate the risk of re-identification.

2.1%

Count the occurrences of quasi-identifiers using Data Loss Prevention infotypes as a measure of risk.

17.0%

Conduct a re-identification risk analysis with the Data Loss Prevention service for accurate risk assessment.

76.6%

Apply a re-identification infotype to each document containing quasi-identifiers to evaluate risk levels.

4.3%