
Empower Clients Through IT
IT EXPERT SYSTEM, INC
IT Training, Staffing and IT Services Provider
Databricks
This Databricks course equips learners with the skills to build modern data engineering, analytics, and AI solutions using the Databricks Lakehouse Platform. Participants will learn to manage clusters, perform ETL/ELT using PySpark, work with Delta Lake for reliable data storage, build SQL dashboards, create real-time streaming pipelines, and develop machine learning models using MLflow and Databricks ML. The course also covers governance with Unity Catalog, end-to-end pipeline development with Workflows and DLT, and prepares students for Databricks certification exams.
Course Content
Module 1: Introduction to Databricks
-
What is Databricks?
-
Why Databricks? Key features & benefits
-
Databricks vs Hadoop vs Snowflake vs AWS EMR
-
Understanding the Lakehouse Architecture
-
Workspaces, users, groups, and notebooks
-
Databricks pricing & cluster types
Module 2: Databricks Workspace & Environment Setup
-
Navigating the Databricks UI
-
Creating and managing notebooks (Python, SQL, Scala)
-
Repos & Git integration
-
Databricks File System (DBFS)
-
Accessing data using Mount Points
-
Working with Databricks Utilities (DBUtils)
Module 3: Databricks Clusters & Compute
-
Types of clusters (Standard, High Concurrency, Single Node)
-
Cluster setup, autoscaling, spot instances
-
Jobs and Job Clusters
-
Pools for cost optimization
-
Cluster policies and security
Module 4: Data Engineering with Databricks
-
ETL vs ELT on Databricks
-
Reading/Writing data from ADLS/S3/GCS
-
Transformations with PySpark
-
Databricks Delta: Delta Lake essentials
-
Delta Tables, ACID Transactions, Time Travel, Vacuum
-
Auto Loader for streaming data ingestion
-
Data pipelines with Workflows
Module 5: Databricks SQL & BI Analytics
-
SQL Warehouses (formerly SQL Endpoints)
-
Writing analytical queries
-
Creating dashboards
-
Materialized views & querying Delta Tables
-
Performance optimization & query caching
-
Connecting Power BI/Tableau to Databricks
Module 6: Delta Lake Deep Dive
-
Delta Architecture
-
Delta schema enforcement & evolution
-
Delta Live Tables (DLT)
-
Medallion Architecture (Bronze, Silver, Gold)
-
Optimize & Z-Order commands
-
Handling slowly changing dimensions (SCD Type 1/2)
Module 7: Machine Learning & AI with Databricks
-
Databricks ML runtime
-
Feature engineering with Feature Store
-
MLflow for experiment tracking
-
AutoML in Databricks
-
Building ML pipelines
-
Model deployment & serving
-
Integrating with external ML libraries (TensorFlow, PyTorch, Scikit-Learn)


Module 8: Databricks GenAI & LLMs
-
What is Databricks MosaicML?
-
Training and fine-tuning LLMs
-
Databricks Foundation Models (Dolly 2.0, DBRX)
-
Vector Search & RAG pipelines
-
Building AI apps with Databricks
-
Integrating LLMs into data workflows
Module 9: Streaming with Databricks
-
Structured Streaming Overview
-
Auto Loader streaming ingestion
-
Event Hub / Kafka Integration
-
Real-time dashboards
-
Streaming Delta tables
-
Handling late data & watermarks
Module 10: Databricks Governance & Security
-
Unity Catalog Overview
-
Catalog → Schema → Tables Permission Model
-
Row-Level & Column-Level Security
-
Token Management & SCIM
-
Data lineage & audit logs
-
Secure data sharing (Delta Sharing)
Module 11: Databricks DevOps & CI/CD
-
GitHub / Azure DevOps Integration
-
CICD for notebooks & repos
-
Databricks Asset Bundles
-
Managing environments: Dev → Test → Prod
-
Infrastructure as Code (Terraform + Databricks)
-
Monitoring Jobs, Alerts, Clusters
Module 11: Real-World Projects
Staffing Support
-
Resume Preparation
-
Mock Interview Preparation
-
Phone Interview Preparation
-
Face to Face Interview Preparation
-
Project/Technology Preparation
-
Internship with internal project work
-
Externship with client project work
Our Salient Features:
-
Hands-on Labs and Homework
-
Group discussion and Case Study
-
Course Project work
-
Regular Quiz / Exam
-
Regular support beyond the classroom
-
Students can re-take the class at no cost
-
Dedicated conf. rooms for group project work
-
Live streaming for the remote students
-
Video recording capability to catch up the missed class
_edited_edited_edited.jpg)


