
Answer-first summary for fast verification
Answer: Utilize HDFS encryption zones for data stored by Spark, ensuring data is encrypted at the storage level.
Utilizing HDFS encryption zones (option C) is the most suitable strategy for ensuring data is encrypted when persisted in Spark DataFrames. This approach provides transparent encryption of data at the storage level, ensuring data remains encrypted on disk. It offers a robust and efficient solution for encryption-at-rest, without the need for manual encryption/decryption operations or relying on Spark‘s internal encryption mechanisms. Enabling Spark‘s internal encryption mechanism (option B) may not secure data at rest, only in transit. Manual encryption (option A) is cumbersome and error-prone, while relying on Spark‘s automatic encryption (option D) may not meet the required security standards for encryption-at-rest.
Author: LeetQuiz Editorial Team
Ultimate access to all questions.
To enhance data security by implementing encryption-at-rest for data stored in Spark DataFrames, which strategy ensures data is encrypted when persisted?
A
Encrypt data manually before storing in DataFrames and decrypt upon reading, using UDFs for encryption/decryption operations.
B
Enable Spark‘s internal encryption mechanism by configuring spark.io.encryption.enabled to true.
C
Utilize HDFS encryption zones for data stored by Spark, ensuring data is encrypted at the storage level.
D
Rely on Spark‘s automatic encryption for persisted DataFrames, requiring no additional configuration.
No comments yet.