
Ultimate access to all questions.
As a Microsoft Fabric Analytics Engineer Associate, you are tasked with profiling a dataset of social media posts to understand public sentiment and identify key trends. The dataset contains millions of posts from various platforms, and your analysis must be scalable, cost-effective, and compliant with data privacy regulations. Which of the following approaches would BEST achieve a comprehensive analysis while addressing these constraints? Choose one option.
A
Perform sentiment analysis to classify posts as positive, negative, or neutral, and visualize the results using a pie chart, without considering data quality issues.
B
Calculate basic statistics such as mean and standard deviation for the length of posts and visualize the results using a histogram, ignoring the textual content of the posts.
C
Use natural language processing techniques to extract keywords and topics from the posts, perform sentiment analysis, and visualize the results using a word cloud and sentiment distribution charts, after ensuring the dataset is clean and compliant.
D
Perform a data quality assessment to identify missing values, duplicates, and inconsistencies in the dataset, without analyzing the content of the posts for sentiment or trends.