Ultimate access to all questions.
A data architect has designed a system where two Structured Streaming jobs will concurrently write to the same bronze Delta table. Each job consumes data from a different Apache Kafka topic but writes records with identical schemas. To simplify the directory structure, a data engineer proposes using a shared checkpoint directory for both streams, with the following layout:
/bronze
_checkpoint
delta_log
year_week=2020_01
year_week=2020_02
Is this checkpoint directory structure valid for the given scenario, and why?