Google Professional Data Engineer

Google Professional Data Engineer

Get started today

Ultimate access to all questions.


You are employed at a shipping company that utilizes handheld scanners to read shipping labels. The company adheres to stringent data privacy standards, necessitating that scanners solely transmit tracking numbers when events are sent to Kafka topics. However, a recent software update has caused these scanners to mistakenly transmit recipients' personally identifiable information (PII) to analytics systems, thereby breaching user privacy rules. Your objective is to swiftly develop a scalable solution using cloud-native managed services to prevent the exposure of PII to the analytics systems. What actions should you take?




Explanation:

The correct answer is D. Building a Cloud Function that reads the Kafka topics and makes a call to the Cloud Data Loss Prevention (Cloud DLP) API is the best approach. This solution uses a managed cloud-native service that can quickly identify and handle PII in real-time. The Cloud DLP API allows for tagging data with confidence levels, enabling you to decide whether to pass the data through or quarantine it for further review, thus effectively preventing the exposure of PII to analytics systems.