
Databricks Certified Data Engineer - Associate
Get started today
Ultimate access to all questions.
In your role as a data engineer responsible for maintaining a Delta Live Table (DLT) pipeline, you encounter an error during the pipeline's execution. You have access to the events log and need to identify the specific notebook that caused the error to facilitate troubleshooting. Considering the need for efficiency and accuracy in a production environment, which SQL query would you use to retrieve the notebook name and error message from the events log, ensuring the results are sorted to easily identify the most recent error? Choose the best option.
In your role as a data engineer responsible for maintaining a Delta Live Table (DLT) pipeline, you encounter an error during the pipeline's execution. You have access to the events log and need to identify the specific notebook that caused the error to facilitate troubleshooting. Considering the need for efficiency and accuracy in a production environment, which SQL query would you use to retrieve the notebook name and error message from the events log, ensuring the results are sorted to easily identify the most recent error? Choose the best option.
Explanation:
The best SQL query for this scenario must not only filter for entries with non-null error messages but also provide the notebook name and error message for effective troubleshooting. Sorting the results by timestamp in descending order is crucial to quickly identify the most recent error, which is often the most relevant for immediate action. Therefore, the correct query is: SELECT notebook_name, error_message FROM events_log WHERE error_message IS NOT NULL ORDER BY timestamp DESC;