
Explanation:
Correct Answer: D. Data modeling
Explanation:
Data modeling is a critical phase in the machine learning data preparation process. It focuses on defining the structure, relationships, and constraints of the data to ensure it is well-organized and primed for analysis. This phase includes creating entity-relationship diagrams (ERDs) to visualize data relationships, designing data schemas for logical and physical database structure, and applying data normalization techniques to minimize redundancy and enhance data integrity.
Incorrect Options:
Ultimate access to all questions.
No comments yet.
In the context of preparing data for machine learning, a team is tasked with ensuring that the data is well-organized, relationships among data entities are clearly defined, and the data is primed for analysis. This involves creating entity-relationship diagrams, designing data schemas, and applying data normalization techniques. Which phase of the machine learning data preparation process is primarily responsible for these tasks? Choose the best option.
A
Data collection, where raw data is gathered from various sources without any structuring.
B
Data pre-processing, which involves cleaning and transforming data to improve its quality and usability.
C
Data integration, focusing on combining data from multiple sources into a unified dataset.
D
Data modeling, dedicated to establishing the data schema, structures, and relationships to ensure the data is well-organized and ready for analysis.