Implementing a Lakehouse with Microsoft Fabric (DP-601)
Course 8681
1 DAY COURSE

Course Outline

This course is designed to build your foundational skills in data engineering on Microsoft Fabric, focusing on the Lakehouse concept. This Microsoft Fabric (DP-601) Training course will explore the powerful capabilities of Apache Spark for distributed data processing and the essential techniques for efficient data management, versioning, and reliability by working with Delta Lake tables. This course will also explore data ingestion and orchestration using Dataflows Gen2 and Data Factory pipelines.

This course includes a combination of lectures and hands-on exercises that will prepare you to work with Lakehouses in Microsoft Fabric.

Implementing a Lakehouse with Microsoft Fabric (DP-601) Benefits

  • In this course, you will:

    • Discover how Microsoft Fabric can meet your enterprise's analytics needs in one platform.
    • Describe core features and capabilities of lakehouses in Microsoft Fabric.
    • Analyze and process data in a Lakehouse at scale.
    • Create advanced analytics solutions using the enhanced capabilities of delta tables.
    • Visually create multi-step data ingestion and transformation using Power Query Online with Dataflows (Gen2).
    • Create pipelines that orchestrate data ingestion and transformation tasks with Data Factory capabilities within Microsoft Fabric.
  • Training Prerequisites

    You should be familiar with basic data concepts and terminology.

Microsoft Fabric (DP-601) Training Outline

Module 1: Introduction to end-to-end analytics using Microsoft Fabric

In this module, you'll learn how to:

  • Describe end-to-end analytics in Microsoft Fabric

Module 2: Get started with lakehouses in Microsoft Fabric

In this module, you'll learn how to:

  • Describe core features and capabilities of lakehouses in Microsoft Fabric
  • Create a lakehouse
  • Ingest data into files and tables in a lakehouse
  • Query lakehouse tables with SQL

Module 3: Use Apache Spark in Microsoft Fabric

In this module, you'll learn how to:

  • Configure Spark in a Microsoft Fabric workspace
  • Identify suitable scenarios for Spark notebooks and Spark jobs
  • Use Spark dataframes to analyze and transform data
  • Use Spark SQL to query data in tables and views
  • Visualize data in a Spark notebook

Module 4: Work with Delta Lake tables in Microsoft Fabric

In this module, you'll learn how to:

  • Understand Delta Lake and delta tables in Microsoft Fabric
  • Create and manage delta tables using Spark
  • Use Spark to query and transform data in delta tables
  • Use delta tables with Spark structured streaming

Module 5: Ingest Data with Dataflows Gen2 in Microsoft Fabric

In this module, you'll learn how to:

  • Describe Dataflow (Gen2) capabilities in Microsoft Fabric
  • Create Dataflow (Gen2) solutions to ingest and transform data
  • Include a Dataflow (Gen2) in a pipeline

Module 6: Use Data Factory pipelines in Microsoft Fabric

In this module, you'll learn how to:

  • Describe pipeline capabilities in Microsoft Fabric
  • Use the Copy Data activity in a pipeline
  • Create pipelines based on predefined templates
  • Run and monitor pipelines
Course Dates
Attendance Method
Note about the Certification Exam

When you register for the course, you will be prompted to choose Y/N to take the exam. Please select yes, as all HHS CISO employees are required to attempt the exam if one is offered for the course. Please be advised, if your course if funded by DIR, the Certification Organization has agreed to provide DIR the pass/fail status of your exam. DIR will only share this information in an aggregated report to state leadership that reflects total exam pass or fails. No individual names of any students will be included in any reports.

DIR requires that you submit the request for your exam voucher within one month of the last day of your course. DIR requires that you take your exam within six months of the last day of your course.

Additional comments or questions (optional)