
AWS Certified Data Engineer - Associate
Get started today
Ultimate access to all questions.
You are designing a data pipeline for a healthcare organization that needs to process and analyze patient data from various sources, including electronic health records (EHR) and medical imaging. The pipeline should be able to handle both structured and unstructured data. Which AWS services would you use to create this pipeline, and how would you configure them to meet the requirements?
You are designing a data pipeline for a healthcare organization that needs to process and analyze patient data from various sources, including electronic health records (EHR) and medical imaging. The pipeline should be able to handle both structured and unstructured data. Which AWS services would you use to create this pipeline, and how would you configure them to meet the requirements?
Explanation:
Option A is the most suitable choice for the given scenario. Amazon S3 can be used to store both structured and unstructured patient data from various sources. AWS Glue can process the data, including extracting information from medical imaging, and transform it into a structured format. Amazon Redshift can be used as a data warehousing solution to store and analyze the processed data. By configuring Glue to process both structured and unstructured data and store the processed data in Redshift, the pipeline can handle the requirements of the healthcare organization.