Ultimate access to all questions.
In the context of preparing a large dataset for machine learning, you are tasked with reducing its dimensionality to improve model performance and reduce computational costs. The dataset contains hundreds of features, some of which are highly correlated. Given the constraints of needing to preserve as much of the original variability as possible and the requirement to facilitate easier data visualization, which of the following techniques would be the MOST appropriate to achieve these goals? Choose one correct option.
Explanation:
Correct Option: C. Principal Component Analysis (PCA)
Explanation:
Principal Component Analysis (PCA) is specifically designed for dimensionality reduction. It works by transforming the original variables into a new set of variables, the principal components, which are orthogonal (uncorrelated) and capture the maximum variance in the data. This makes PCA the most suitable option for the given scenario because:
Why other options are incorrect: