
Explanation:
Changing the distribution style to ALL distribution stores a copy of the entire table on every compute node. This is highly recommended for relatively slow-moving, smaller dimension tables (such as a store location table) because it completely eliminates the need to broadcast the table over the network during joins. By doing so, query performance speeds up significantly while saving compute and network resources, making it the most cost-effective and optimal approach for this specific problem.
Ultimate access to all questions.
A data engineer notices that Redshift queues are slowing down because the whole store location table is constantly being broadcast to all four compute nodes for most queries. The data engineer wants to speed up the query performance by minimizing the broadcasting of the store location table. Which solution will meet these requirements in the MOST cost-effective way?
A
Change the distribution style of the store location table from EVEN distribution to ALL distribution.
B
Change the distribution style of the store location table to KEY distribution based on the column that has the highest dimension.
C
Add a join column named store_id into the sort key for all the tables.
D
Upgrade the Redshift reserved node to a larger instance size in the same instance family.
No comments yet.