top of page

Databricks

 

This Databricks course equips learners with the skills to build modern data engineering, analytics, and AI solutions using the Databricks Lakehouse Platform. Participants will learn to manage clusters, perform ETL/ELT using PySpark, work with Delta Lake for reliable data storage, build SQL dashboards, create real-time streaming pipelines, and develop machine learning models using MLflow and Databricks ML. The course also covers governance with Unity Catalog, end-to-end pipeline development with Workflows and DLT, and prepares students for Databricks certification exams.

Course Content

Module 1: Introduction to Databricks

  1. What is Databricks?

  2. Why Databricks? Key features & benefits

  3. Databricks vs Hadoop vs Snowflake vs AWS EMR

  4. Understanding the Lakehouse Architecture

  5. Workspaces, users, groups, and notebooks

  6. Databricks pricing & cluster types

Module 2: Databricks Workspace & Environment Setup

  1. Navigating the Databricks UI

  2. Creating and managing notebooks (Python, SQL, Scala)

  3. Repos & Git integration

  4. Databricks File System (DBFS)

  5. Accessing data using Mount Points

  6. Working with Databricks Utilities (DBUtils)

Module 3: Databricks Clusters & Compute

  1. Types of clusters (Standard, High Concurrency, Single Node)

  2. Cluster setup, autoscaling, spot instances

  3. Jobs and Job Clusters

  4. Pools for cost optimization

  5. Cluster policies and security

Module 4: Data Engineering with Databricks

  1. ETL vs ELT on Databricks

  2. Reading/Writing data from ADLS/S3/GCS

  3. Transformations with PySpark

  4. Databricks Delta: Delta Lake essentials

  5. Delta Tables, ACID Transactions, Time Travel, Vacuum

  6. Auto Loader for streaming data ingestion

  7. Data pipelines with Workflows

Module 5: Databricks SQL & BI Analytics

  1. SQL Warehouses (formerly SQL Endpoints)

  2. Writing analytical queries

  3. Creating dashboards

  4. Materialized views & querying Delta Tables

  5. Performance optimization & query caching

  6. Connecting Power BI/Tableau to Databricks

Module 6: Delta Lake Deep Dive

  1. Delta Architecture

  2. Delta schema enforcement & evolution

  3. Delta Live Tables (DLT)

  4. Medallion Architecture (Bronze, Silver, Gold)

  5. Optimize & Z-Order commands

  6. Handling slowly changing dimensions (SCD Type 1/2)

Module 7: Machine Learning & AI with Databricks

  1. Databricks ML runtime

  2. Feature engineering with Feature Store

  3. MLflow for experiment tracking

  4. AutoML in Databricks

  5. Building ML pipelines

  6. Model deployment & serving

  7. Integrating with external ML libraries (TensorFlow, PyTorch, Scikit-Learn)

Module 8: Databricks GenAI & LLMs

  1. What is Databricks MosaicML?

  2. Training and fine-tuning LLMs

  3. Databricks Foundation Models (Dolly 2.0, DBRX)

  4. Vector Search & RAG pipelines

  5. Building AI apps with Databricks

  6. Integrating LLMs into data workflows

Module 9: Streaming with Databricks

  1. Structured Streaming Overview

  2. Auto Loader streaming ingestion

  3. Event Hub / Kafka Integration

  4. Real-time dashboards

  5. Streaming Delta tables

  6. Handling late data & watermarks

Module 10: Databricks Governance & Security

  1. Unity Catalog Overview

  2. Catalog → Schema → Tables Permission Model

  3. Row-Level & Column-Level Security

  4. Token Management & SCIM

  5. Data lineage & audit logs

  6. Secure data sharing (Delta Sharing)

Module 11: Databricks DevOps & CI/CD

  • GitHub / Azure DevOps Integration

  • CICD for notebooks & repos

  • Databricks Asset Bundles

  • Managing environments: Dev → Test → Prod

  • Infrastructure as Code (Terraform + Databricks)

  • Monitoring Jobs, Alerts, Clusters

Module 11: Real-World Projects

Staffing Support​
  • Resume Preparation

  • Mock Interview Preparation

  • Phone Interview Preparation

  • Face to Face Interview Preparation

  • Project/Technology Preparation

  • Internship with internal project work

  • Externship with client project work

Our Salient Features:
  • Hands-on Labs and Homework

  • Group discussion and Case Study

  • Course Project work

  • Regular Quiz / Exam

  • Regular support beyond the classroom

  • Students can re-take the class at no cost

  • Dedicated conf. rooms for group project work

  • Live streaming for the remote students

  • Video recording capability to catch up the missed class

Student Portal

Training / Service Center :

951 N. Plum Grove Rd.

Suite A, C
Schaumburg, IL, 60173

Ph: 847 350 9034 x option 1

Email: info@itexps.com

Service Center :

1560 Wall Street,

Suite #111,

Naperville, IL 60563 

Ph: 847 350 9034 x option 2

Email: info@itexps.com

IT Expert System, Inc is approved to operate by the Private Business and Vocational Schools Division of the Illinois Board of Higher Education.

 IBHE Mandatory Disclosure Reporting

IT Expert System, Inc is regulated by: Indiana Department of Workforce Development, Office for Career and Technical School

10 N Senate Avenue, Suite SE 308, Indianapolis, IN 46204

OCTS@dwd.in.gov, http://www.in.gov/dwd/2731.htm

‘PMP’ and 'CAPM' are registered marks of the Project Management Institute, Inc.

IT Expert provides staffing, placement, consulting, proctoring, and internship services separately, and these offerings are not included in the ACCET-accredited IT Expert System training programs.

bottom of page