Databricks Certified Data Engineer Associate Exam Practice Questions (P. 3)

Full Access (223 questions)
Six months of Premium Access
Access to one million comments
Seamless ChatGPT Integration

Ability to download PDF files
Anki Flashcard files for revision
No Captcha & No AdSense
Advanced Exam Configuration

Get Contributor Access

Question #11

A data engineer has left the organization. The data team needs to transfer ownership of the data engineer’s Delta tables to a new data engineer. The new data engineer is the lead engineer on the data team.
Assuming the original data engineer no longer has access, which of the following individuals must be the one to transfer ownership of the Delta tables in Data Explorer?

A
Databricks account representative
B
This transfer is not possible
C
Workspace administrator
Most Voted
D
New lead data engineer
E
Original data engineer

Correct Answer:
C

GPT-4o - Answer

The correct response seems to be C, the Workspace Administrator rather than D, the New Lead Data Engineer. Workspace administrators customarily hold the authority necessary for changing ownership of resources like Delta tables in Data Explorer. This task normally falls under administrative privileges because they manage broader access controls and data governance frameworks, especially when the original owner is no longer available to make such changes. Always be sure to consult specific workspace permissions and configurations to verify the appropriate procedures.

Show Answer

send

light_mode delete

Question #12

A data analyst has created a Delta table sales that is used by the entire data analysis team. They want help from the data engineering team to implement a series of tests to ensure the data is clean. However, the data engineering team uses Python for its tests rather than SQL.
Which of the following commands could the data engineering team use to access sales in PySpark?

A
SELECT * FROM sales
B
There is no way to share data between PySpark and SQL.
C
spark.sql("sales")D. spark.delta.table("sales")
D
spark.table("sales")

Correct Answer: E ?️

GPT-4o - Answer

The correct function for the data engineering team to use in accessing the "sales" Delta table within a PySpark environment would indeed be spark.table("sales"). This function allows them to access any table that's registered in the Spark Session catalog, facilitating the execution of Python-based operations on the Delta table. It’s a straightforward and common method in PySpark to engage with tables without needing to specify the data format, making it an apt choice for seamlessly transitioning between SQL and PySpark contexts.

Show Answer

send

light_mode delete

Question #13

Which of the following commands will return the location of database customer360?

A
DESCRIBE LOCATION customer360;
B
DROP DATABASE customer360;
C
DESCRIBE DATABASE customer360;
Most Voted
D
ALTER DATABASE customer360 SET DBPROPERTIES ('location' = '/user'};
E
USE DATABASE customer360;

Correct Answer:
C

GPT-4o - Answer

For the retrieval of the location of the database "customer360" especially in environments like Hive or Databricks, the command "DESCRIBE DATABASE customer360" is indeed the correct choice. It succinctly gives you comprehensive details about the database, of which the location is a key part. Utilizing this precise command effectively fetches all necessary metadata, ensuring clarity and ease in database management.

Show Answer

send

light_mode delete

Question #14

A data engineer wants to create a new table containing the names of customers that live in France.
They have written the following command:

A senior data engineer mentions that it is organization policy to include a table property indicating that the new table includes personally identifiable information (PII).
Which of the following lines of code fills in the above blank to successfully complete the task?

A
There is no way to indicate whether a table contains PII.
B
"COMMENT PII"
C
TBLPROPERTIES PII
D
COMMENT "Contains PII"
Most Voted
E
PII

Correct Answer:
D

Show Answer

send

light_mode delete

Question #15

Which of the following benefits is provided by the array functions from Spark SQL?

A
An ability to work with data in a variety of types at once
B
An ability to work with data within certain partitions and windows
C
An ability to work with time-related data in specified intervals
D
An ability to work with complex, nested data ingested from JSON files
Most Voted
E
An ability to work with an array of tables for procedural automation

Correct Answer:
D

GPT-4o - Answer

Array functions in Spark SQL are particularly valuable for handling complex, nested data structures, frequently found in JSON files. These functions streamline the processing of nested arrays and structures, enabling easier manipulation, querying, and extraction of specific elements. This empowers data engineers to efficiently handle and analyze data that comes in layered or hierarchical formats. If you're dealing with such complex data types, leveraging these array functions can significantly simplify your workflows.

Show Answer

send

light_mode delete

Previous Questions Next Questions

All Pages

Databricks Certified Data Engineer Associate Exam Practice Questions (P. 3)

Get Contributor Access

Download Demo PDF

Question #11

Question #12

Question #13

Question #14

Question #15

Best prices & offers

Latest Question

Expert Verified

Instant Download

High Success Rate

Follow Us

Databricks Certified Data Engineer Associate Exam Practice Questions (P. 3)

Get Contributor Access

Download Demo PDF

Question #11

Question #12

Question #13

Question #14

Question #15