Databricks Certified Data Engineer - Professional

Get started today

Ultimate access to all questions.

How should you model your lakehouse architecture to efficiently handle a high-volume, write-heavy IoT workload with millions of devices reporting every minute, while ensuring the ability to query data in near real-time?

Real Exam

Store raw data in a NoSQL database for write efficiency, periodically ETLing processed data into the lakehouse for analytical queries.

12.0%

Implement a sharded approach, creating separate tables for subsets of devices, and use a metastore to track shards for querying.

10.2%

Comments

Loading comments...

Apply a micro-batching technique that combines streaming ingestion with periodic optimization (compaction and indexing) of the stored data for analysis.

28.7%