
Answer-first summary for fast verification
Answer: Leverage Google Cloud's Data Catalog to perform a keyword-based search across all BigQuery datasets, utilizing the detailed table descriptions to quickly pinpoint the relevant table.
**Correct Answer: D. Leverage Google Cloud's Data Catalog to perform a keyword-based search across all BigQuery datasets, utilizing the detailed table descriptions to quickly pinpoint the relevant table.** Data Catalog is designed for metadata management and discovery, offering seamless integration with BigQuery. It enables efficient and scalable search capabilities across vast amounts of data, making it the optimal choice for quickly finding the right dataset in a large enterprise environment. **Why not the others?** - **Option A** is more about resource management post-discovery and does not aid in the initial data discovery process. - **Option B** adds unnecessary complexity and maintenance overhead, as Data Catalog already provides a more efficient and integrated solution. - **Option C** is less efficient, especially at scale, as it requires manual effort to parse through potentially thousands of table names without leveraging the descriptive metadata as effectively as Data Catalog.
Author: LeetQuiz Editorial Team
Ultimate access to all questions.
In your role as a Machine Learning Engineer at a large enterprise, you are tasked with developing a new AI model using AI Platform. The company has thousands of datasets stored in BigQuery, each with accurate and detailed descriptions. Given the scale of data and the need for efficiency, how would you most effectively identify the correct BigQuery table for your model's training data? Consider the following options and choose the best one. (Choose one correct option)
A
Implement a custom tagging system for each model and version on AI Platform, explicitly linking them to the corresponding BigQuery table names used for training.
B
Create and maintain a separate lookup table within BigQuery that manually maps table descriptions to their respective table IDs, then query this lookup table to find the needed table ID.
C
Run a comprehensive query in BigQuery utilizing the INFORMATION_SCHEMA metadata tables to list all table names in your project, then manually sift through the results to locate the desired table.
D
Leverage Google Cloud's Data Catalog to perform a keyword-based search across all BigQuery datasets, utilizing the detailed table descriptions to quickly pinpoint the relevant table.
No comments yet.