Following on from my previous post I wanted to cover off some more key topics that can really help your understanding of Spark and diving in to the Databricks Certified Associate Developer for Apache Spark 3.0 exam. For more information on general assessment tips, great practice exams to take and other core topics, please seeContinue reading “Tips for the Databricks Certified Associate Developer for Apache Spark 3.0 – Python – Pt.2”
Tag Archives: Databricks
Tips for the Databricks Certified Associate Developer for Apache Spark 3.0 – Python – Pt.1
After recently diving in to (and passing!) the Associate Developer for Apache Spark 3.0 exam certification from Databricks, I thought it would be useful to go over some quick points to remember and some potential ‘gotcha’ topics for anyone considering the challenge. The majority of the exam (72% in fact) features the use of theContinue reading “Tips for the Databricks Certified Associate Developer for Apache Spark 3.0 – Python – Pt.1”
From Warehouse to Lakehouse Pt.1 – Slowly Changing Dimensions (SCD) with Delta
SCD Type 1 in SQL and Python Introduction With the move to cloud based Data Lake platforms there has often been criticism from the more traditional Data Warehousing community. A Data Lake, offering cheap, almost endlessly scalable storage in the cloud is hugely appealing to a platform administrator however over the number of years thatContinue reading “From Warehouse to Lakehouse Pt.1 – Slowly Changing Dimensions (SCD) with Delta”