
Ultimate access to all questions.
In the context of preparing a dataset for machine learning, you are conducting Exploratory Data Analysis (EDA) and encounter missing values across several features. The dataset is large, and the missing values are randomly distributed. Cost efficiency and scalability are key considerations for your project. Which of the following methods is MOST appropriate for addressing missing data under these constraints? Choose the BEST option.
A
Data encryption to secure the dataset before further processing
B
Data scaling to normalize the range of all features
C
Imputation using the mean or median for numerical features and mode for categorical features
D
Data normalization to adjust the scale of features without considering missing values
E
Deleting rows or columns with missing values to ensure data completeness