Google Professional Machine Learning Engineer

Get started today

Ultimate access to all questions.

In the context of optimizing a distributed machine learning system for a global e-commerce platform, where the system is required to process petabytes of data across multiple data centers with minimal latency and cost, which of the following techniques is MOST effective for minimizing communication overhead between nodes? Consider the need for real-time model updates and the constraints of network bandwidth and data privacy regulations. Choose the best option.

Real Exam

Data replication across all nodes to ensure high availability and fault tolerance, despite the potential increase in communication overhead.

14.3%

Model compression techniques such as pruning and quantization to reduce the size of the model, without directly addressing the communication overhead.

Comments

Loading comments...

Gradient aggregation, where each node computes gradients locally and only the aggregated gradients are communicated, significantly reducing the amount of data transferred.

61.9%