Perform data engineering with Azure Synapse Apache Spark Pools

Learning Path
3 Modules

Intermediate

Data Engineer

Azure Synapse Analytics

At a glance

Level

Intermediate
Skill

 
Product

Azure Synapse Analytics
Role

Data Engineer
Subject

Data engineering

Apache Spark is a highly scalable distributed processing solution for big data analytics and transformation. You can leverage its power in Azure Synapse Analytics by using Spark pools.

Prerequisites

Before starting this learning path, you should be familiar with Azure Synapse Analytics. Consider completing the Introduction to Azure Synapse Analytics module first.

Modules in this learning path

Analyze data with Apache Spark in Azure Synapse Analytics

Apache Spark is a core technology for large-scale data analytics. Learn how to use Spark in Azure Synapse Analytics to analyze and visualize data in a data lake.

Transform data with Spark in Azure Synapse Analytics

Data engineers commonly need to transform large volumes of data. Apache Spark pools in Azure Synapse Analytics provide a distributed processing platform that they can use to accomplish this goal.

Use Delta Lake in Azure Synapse Analytics

Delta Lake is an open source relational storage area for Spark that you can use to implement a data lakehouse architecture in Azure Synapse Analytics.