AWS Certified Data Engineer - Associate - (DEA-C01) Logo
Amazon Logo

AWS Certified Data Engineer - Associate - (DEA-C01) Exam Questions

811

Total Questions

SEP
2025

Last Updated

1st

1st Try Guaranteed

Expert Verified

Experts Verified

Question 1 Single Choice

The data engineering team at a company wants to analyze Amazon S3 storage access patterns to decide when to transition the right data to the right storage class.

Which of the following represents a correct option regarding the capabilities of Amazon S3 Analytics storage class analysis?

Question 2 Multiple Choice

An e-commerce company runs its workloads on Amazon EMR clusters. The data engineering team at the company manually installs third-party libraries on the newly launched clusters by logging onto the master nodes. The team wants to develop an automated solution that will replace this human intervention.

Which of the following options would you recommend for the given requirement? (Select two)

Question 3 Single Choice

An application uses Kinesis Data Streams to process real-time data for business analytics. Monitoring this incoming and outgoing data stream from the Kinesis Data Streams is important for the performance of the system as well as the downstream applications. For a read-intensive requirement, the age for the last record in the data stream for all the GetRecords requests need to be tracked.

Which stream-level metric will help address this requirement?

Question 4 Single Choice

A financial analytics company wants to gather insights from personal finance data stored on Amazon S3 in the Microsoft Excel workbook format.

Which of the following represents a serverless solution to interactively discover, clean and transform this raw data for performing this analysis?

Question 5 Multiple Choice

The web development team at an IT company has about 200 TB of web-log data that is stored in an Amazon S3 bucket as raw text. Each log file is identified by a key of the type year-month-day_log_HHmmss.txt where HHmmss denotes the time the log file was created. The data engineering team has created an Amazon Athena table that links to the given S3 bucket. The team executes several queries every hour against a subset of the table's columns. The company wants a Hive-metastore compatible solution that costs less and requires less maintenance to support the ongoing analytics on this log data.

As an AWS Certified Data Engineer Associate, which of the following solutions would you combine to address these requirements? (Select three)

Question 6 Single Choice

A data analytics job requires data from multiple sources like Amazon DynamoDB, Amazon RDS, and Amazon Redshift. The job is run on Amazon Athena.

Which of the following is the MOST cost-effective way to join data from these sources?

Question 7 Single Choice

A logistics company operates a near real-time inventory tracking system for vehicle depots across multiple geographic regions. Third-party vendors upload multiple logs of vehicle arrivals and departures in the form of small compressed files (less than 10 KB) to a central Amazon S3 bucket. The company needs to immediately process new uploads to keep a dashboard up to date. The dashboard must be refreshed near real-time to reflect the latest vehicle inventory across regions. A data engineer is tasked with designing a cost-effective, low-latency, and scalable solution that automates the processing and transformation of the uploaded data, enables ad hoc querying for business analysts, and supports visual reporting through dashboards.

Which solution will best meet these requirements in the most cost-effective and scalable manner?

Question 8 Multiple Choice

The data engineering team at a social media company wants to use Amazon CloudWatch alarms to automatically recover Amazon EC2 instances if they become impaired. The team has hired you to provide subject matter expertise.

Which of the following statements would you identify as CORRECT regarding this automatic recovery process? (Select two)

Question 9 Single Choice

A company regularly extracts about 2 TB of data daily from various data sources - including MySQL, MSSQL Server, Oracle, Vertica, and Teradata Vantage. Some of these sources feature undefined or frequently changing data schemas. A data engineer is tasked with implementing a solution that can automatically detect the schema of these data sources and perform data extraction, transformation, and loading to an Amazon S3 bucket.

What solution would meet these needs while minimizing operational overhead?

Question 10 Single Choice

A financial services company stores confidential data on an Amazon Simple Storage Service (S3) bucket. The compliance guidelines require that files be stored with server-side encryption. The encryption used must be Advanced Encryption Standard (AES-256) and the company does not want to manage the encryption keys.

What do you recommend?

Page: 1 / 82