
Ultimate access to all questions.
In the context of Exploratory Data Analysis (EDA), a data scientist is working on a project to predict housing prices based on various features such as location, size, and age of the property. The dataset includes numerical and categorical variables. The team is in the initial stages of understanding the data's structure and relationships. Which of the following best describes the primary purpose of generating a correlation matrix in this scenario? Choose one correct option.
A
To directly deploy the predictive model into production.
B
To perform feature engineering by creating new variables based on existing ones.
C
To identify and remove all missing values from the dataset.
D
To visualize and quantify the linear relationships between pairs of numerical variables in the dataset.