Certified Associate Developer for Apache Spark Logo
Databricks Logo

Certified Associate Developer for Apache Spark Exam Questions

570

Total Questions

SEP
2025

Last Updated

1st

1st Try Guaranteed

Expert Verified

Experts Verified

Question 11 Single Choice

Which of the following statements about RDDs is incorrect?

Question 12 Single Choice

Which of the elements that are labeled with a circle and a number contain an error or are misrepresented?

Question 13 Single Choice

Which of the following describes characteristics of the Spark UI?

Question 14 Single Choice

Which of the following is a viable way to improve Spark's performance when dealing with large amounts of data, given that there is only a single application running on the cluster?

Question 15 Single Choice

Which of the following describes a shuffle?

Question 16 Single Choice

Which of the following describes Spark's Adaptive Query Execution?

Question 17 Single Choice

The code block displayed below contains an error. The code block is intended to join DataFrame itemsDf with the larger DataFrame transactionsDf on column itemId. Find the error.

Code block:

transactionsDf.join(itemsDf, "itemId", how="broadcast")

Question 18 Single Choice

Which of the following code blocks efficiently converts DataFrame transactionsDf from 12 into 24 partitions?

Question 19 Single Choice

Which of the following code blocks removes all rows in the 6-column DataFrame transactionsDf that have missing data in at least 3 columns?

Question 20 Single Choice

The code block displayed below contains an error. The code block should create DataFrame itemsAttributesDf which has columns itemId and attribute and lists every attribute from the attributes column in DataFrame itemsDf next to the itemId of the respective row in itemsDf. Find the error.

A sample of DataFrame itemsDf is below.

Code block:

itemsAttributesDf = itemsDf.explode("attributes").alias("attribute").select("attribute", "itemId")

Page: 2 / 57