
Ultimate access to all questions.
In the context of machine learning, you are tasked with visualizing the distribution of a dataset across various categories to identify potential outliers and understand the underlying probability density. The dataset contains numerical data with several categories, and you aim to provide a comprehensive overview that includes median, quartiles, and density information. Which of the following plots would be the most effective for this purpose? Choose one correct option.
A
Line plot, which is best for displaying trends over time or continuous data.
B
Scatter plot, ideal for examining the relationship between two continuous variables.
C
Violin plot, which merges the features of a box plot and a kernel density plot, offering a detailed view of how numerical data is distributed across different categories.
D
Box plot, useful for comparing distributions across categories but lacks the detailed density information provided by violin plots.