
Answer-first summary for fast verification
Answer: Rowkey: device_id Column data: date, data_point
The correct schema is **C. Rowkey: device_id Column data: date, data_point**. This design is optimal because: - **Rowkey: device_id**: Facilitates efficient retrieval of all data points for a specific device, aligning with the most common query requirement. - **Column data: date, data_point**: Allows for efficient filtering by specific days or date ranges within a device's data, and stores the actual data record received every 15 minutes. Other options are less suitable due to inefficiencies in retrieving data for specific devices or not aligning with the common query pattern.
Author: LeetQuiz Editorial Team
Ultimate access to all questions.
No comments yet.
MJTelco, a startup with innovative optical communications hardware patents, is scaling its proof-of-concept (PoC) project to support over 50,000 installations. They require a schema in Google Bigtable for historical analysis of the last 2 years of records, with each record sent every 15 minutes containing a unique device identifier and a data record. The most common query is for all data for a given device on a given day. Which schema should you use?
A
Rowkey: date#device_id Column data: data_point
B
Rowkey: date Column data: device_id, data_point
C
Rowkey: device_id Column data: date, data_point
D
Rowkey: data_point Column data: device_id, date
E
Rowkey: date#data_point Column data: device_id