Use Apache Spark in Microsoft Fabric

Intermediate
Data Analyst
Data Engineer
Microsoft Fabric

Apache Spark is a core technology for large-scale data analytics. Microsoft Fabric provides support for Spark clusters, enabling you to analyze and process data in a Lakehouse at scale.

Learning objectives

In this module, you'll learn how to:

  • Configure Spark in a Microsoft Fabric workspace

  • Identify suitable scenarios for Spark notebooks and Spark jobs

  • Use Spark dataframes to analyze and transform data

  • Use Spark SQL to query data in tables and views

  • Visualize data in a Spark notebook

Prerequisites

Before starting this module, you should be familiar with the Microsoft Fabric interface and core concepts.