Databricks Certified Associate Developer for Apache Spark Exam Practice Questions (P. 4)

Full Access (207 questions)
Six months of Premium Access
Access to one million comments
Seamless ChatGPT Integration

Ability to download PDF files
Anki Flashcard files for revision
No Captcha & No AdSense
Advanced Exam Configuration

Get Contributor Access

Question #16

Which of the following operations can be used to create a new DataFrame that has 12 partitions from an original DataFrame df that has 8 partitions?

A
df.repartition(12)
Most Voted
B
df.cache()
C
df.partitionBy(1.5)
D
df.coalesce(12)
E
df.partitionBy(12)

Correct Answer:
A

GPT-4o - Answer

The repartition() function in Spark, highlighted in option A, is the right call when you're looking to adjust the number of partitions in a DataFrame, and it can both increase and decrease this number. Specifically, when you use df.repartition(12), you're instructing Spark to redistribute the data across 12 new partitions, regardless of the original count, which is perfect when the frame initially has 8 partitions. This makes it a powerful tool for managing how data is divided and handled within Spark, leading to potentially improved performance in distributed environments.

Show Answer

Hi! Do you need help with this question ?

- Why isn't the A the right answer?

- Traducir la pregunta al español

Contributor get free access to an augmented ChatGPT 4 trained with the latest IT Questions.

content_copy

send

light_mode delete

Question #17

Which of the following object types cannot be contained within a column of a Spark DataFrame?

A
DataFrame
Most Voted
B
String
C
Array
D
null
E
Vector

Correct Answer:
A

Show Answer

Hi! Do you need help with this question ?

- Why isn't the A the right answer?

- Traducir la pregunta al español

Contributor get free access to an augmented ChatGPT 4 trained with the latest IT Questions.

content_copy

send

light_mode delete

Question #18

Which of the following operations can be used to create a DataFrame with a subset of columns from DataFrame storesDF that are specified by name?

A
storesDF.subset()
B
storesDF.select()
Most Voted
C
storesDF.selectColumn()
D
storesDF.filter()
E
storesDF.drop()

Correct Answer:
B

Show Answer

Hi! Do you need help with this question ?

- Why isn't the A the right answer?

- Traducir la pregunta al español

Contributor get free access to an augmented ChatGPT 4 trained with the latest IT Questions.

content_copy

send

light_mode delete

Question #19

The code block shown below contains an error. The code block is intended to return a DataFrame containing all columns from DataFrame storesDF except for column sqft and column customerSatisfaction. Identify the error.
Code block:
storesDF.drop(sqft, customerSatisfaction)

A
The drop() operation only works if one column name is called at a time – there should be two calls in succession like storesDF.drop("sqft").drop("customerSatisfaction").
B
The drop() operation only works if column names are wrapped inside the col() function like storesDF.drop(col(sqft), col(customerSatisfaction)).
C
There is no drop() operation for storesDF.
D
The sqft and customerSatisfaction column names should be quoted like "sqft" and "customerSatisfaction".
Most Voted
E
The sqft and customerSatisfaction column names should be subset from the DataFrame storesDF like storesDF."sqft" and storesDF."customerSatisfaction".

Correct Answer:
D

Show Answer

Hi! Do you need help with this question ?

- Why isn't the A the right answer?

- Traducir la pregunta al español

Contributor get free access to an augmented ChatGPT 4 trained with the latest IT Questions.

content_copy

send

light_mode delete

Question #20

Which of the following code blocks returns a DataFrame containing only the rows from DataFrame storesDF where the value in column sqft is less than or equal to 25,000?

A
storesDF.filter("sqft" <= 25000)
B
storesDF.filter(sqft > 25000)
C
storesDF.where(storesDF[sqft] > 25000)
D
storesDF.where(sqft > 25000)
E
storesDF.filter(col("sqft") <= 25000)
Most Voted

Correct Answer:
E

Show Answer

Hi! Do you need help with this question ?

- Why isn't the A the right answer?

- Traducir la pregunta al español

Contributor get free access to an augmented ChatGPT 4 trained with the latest IT Questions.

content_copy

send

light_mode delete

Previous Questions Next Questions

All Pages

Databricks Certified Associate Developer for Apache Spark Exam Practice Questions (P. 4)

Get Contributor Access

Download Demo PDF

Question #16

Question #17

Question #18

Question #19

Question #20

Best prices & offers

Latest Question

Expert Verified

Instant Download

High Success Rate

Follow Us

Databricks Certified Associate Developer for Apache Spark Exam Practice Questions (P. 4)

Get Contributor Access

Download Demo PDF

Question #16

Question #17

Question #18

Question #19

Question #20