Google Professional Data Engineer

Google Professional Data Engineer

Get started today

Ultimate access to all questions.


You are tasked with ensuring that additional dependencies are deployed to all nodes of a Cloud Dataproc cluster during startup. This process must utilize an existing initialization action script. However, your company's security policies mandate that Cloud Dataproc nodes must operate without internet access, thus preventing public initialization actions from being able to fetch resources online. What steps should you take to accomplish this deployment?




Explanation:

The correct answer is C. Given that the company's security policies do not allow Cloud Dataproc nodes to access the Internet, the best solution is to place the additional dependencies in a Cloud Storage bucket within the VPC security perimeter. This way, the dependencies can be accessed from the internal network without exposing the Dataproc nodes to the public Internet. Enabling Private Google Access ensures that the cluster nodes can download the dependencies from Cloud Storage using internal IP addresses.