Google Professional Data Engineer

Google Professional Data Engineer

Get started today

Ultimate access to all questions.


You are planning to transition an existing on-premises Hadoop infrastructure to Cloud Dataproc. The current setup mainly utilizes Hive, and the data is stored in Optimized Row Columnar (ORC) format. All ORC files have already been transferred to a Cloud Storage bucket. In order to enhance performance, you need to duplicate certain data into the cluster’s local Hadoop Distributed File System (HDFS). What are two methods to begin working with Hive on Cloud Dataproc? (Choose two.)





Powered ByGPT-5