Databricks Certified Data Engineer - Professional

Ultimate access to all questions.

In the context of Spark Structured Streaming, you are tasked with implementing a stream-static join to enrich streaming data with static reference data. The solution must handle late data efficiently without compromising the accuracy of the join results. Considering the need for scalability and the handling of out-of-order events, which of the following approaches is the BEST to implement a stream-static join? Choose the correct option from the four provided.

Simulated

Use the 'join' function to join the streaming DataFrame with the static DataFrame without any additional configurations.

23.3%

Use the 'join' function with a watermark to handle late data and specify the threshold for out-of-order events.

Loading comments...