
Ultimate access to all questions.
You have set up an analytics environment on Google Cloud for your data scientist team, allowing them to explore data without affecting the on-premises Apache Hadoop solution. The data stored in the on-premises Hadoop Distributed File System (HDFS) cluster is formatted in Optimized Row Columnar (ORC) files and includes multiple columns of Hive partitioning. Your data scientist team requires the capability to explore this data using SQL on the Hive query engine, similar to how they do on the on-premises HDFS cluster. To meet this requirement, you need to determine the most cost-effective storage and processing solution. What should you do?
A
Import the ORC files to Bigtable tables for the data scientist team.
B
Import the ORC files to BigQuery tables for the data scientist team.
C
Copy the ORC files on Cloud Storage, then deploy a Dataproc cluster for the data scientist team.
D
Copy the ORC files on Cloud Storage, then create external BigQuery tables for the data scientist team.