Databricks Certified Data Engineer - Professional

Get started today

Ultimate access to all questions.

In the context of designing a multiplex bronze table for productionalizing streaming workloads on Azure Databricks, consider the following scenario: Your organization is ingesting streaming data from multiple sources, each with different formats and schemas. The goal is to ensure efficient data ingestion, processing, and maintain data consistency and integrity while adhering to cost constraints and scalability requirements. Which of the following approaches is the BEST to avoid common pitfalls in this scenario? (Choose one option)

Simulated

Ingest all streaming data into a single bronze table without considering the source or data format, relying on post-ingestion transformations to handle discrepancies.

6.1%

Create separate bronze tables for each streaming data source, each with a unique schema, and manage them independently to ensure data isolation.

Comments

Loading comments...

Design a single bronze table with a unified schema capable of accommodating all streaming data sources and formats, leveraging Delta Lake's capabilities for efficient data processing and consistency.

49.3%

Implement multiple bronze tables, each tailored to a specific data source or format, and use a complex orchestration layer to manage data flow and transformations between them.

21.1%