The Databricks Certified Data Engineer Associate exam validates your ability to use the Databricks platform to complete introductory data engineering tasks. This includes an understanding of the Databricks Intelligence Platform, ELT with Apache Spark SQL and Python, incremental data processing, pipeline deployment, and data governance using Unity Catalog.
The study material for each section is in the subpages below. Open a section and work through the topics inside it. Sections 2 and 3 make up over 60% of the exam, so spend the most time there.
| Section | Topic | Exam Weight |
|---|---|---|
| 1 | 🏗️ Databricks Intelligence Platform | 10% |
| 2 | 🔄 Development and Ingestion | 30% |
| 3 | ⚙️ Data Processing & Transformations | 31% |
| 4 | 🚀 Productionizing Data Pipelines | 18% |
| 5 | 🛡️ Data Governance & Quality | 11% |
<aside> 💡 The Associate exam is primarily SQL with some Python. Make sure you are comfortable reading both before exam day.
</aside>
Section 1: Databricks Intelligence Platform