
Ultimate access to all questions.
In the design of a data system for a large-scale e-commerce platform, the team is tasked with ensuring that the system can handle high volumes of transactions while maintaining data integrity and supporting complex queries for analytics. Which phase of the data system design is primarily dedicated to outlining the data schema, structures, and the relationships between them to meet these requirements? Choose one correct option.
A
Data collection, which involves gathering raw data from various sources without considering the structure or relationships.
B
Data integration, which focuses on merging data from different sources into a unified dataset but does not define the schema or relationships.
C
Data modeling, which is crucial for defining how data is structured, related, and constrained within a database or data warehouse to support scalability and complex queries.
D
Data pre-processing, which is about cleaning and transforming data for analysis but does not involve defining the data schema or relationships.