Microsoft Azure Data Bricks–Collaborative Apache Spark Analytics Platform
I was having a conversation with some colleagues about a institutions which wanted to understand some ways of integrating Azure’s data science services in their curriculum for the new semester
So one of the suggestions we came up was the usage of Microsoft Azure DSVMs, HDInsight clusters, DataBricks & Notebooks.
I have talked a lot in past blogs about the Microsoft Data Science Virtual Machine DSVM, HDinsight but one of the new services we have in Azure is Azure DataBricks
A quick introduction to Azure Databricks
Azure Databricks is a fast, easy, and collaborative Apache Spark-based analytics platform optimized for Azure. Designed in collaboration with the founders of Apache Spark, Azure Databricks combines the best of Databricks and Azure to help customers accelerate innovation with one-click set up, streamlined workflows, and an interactive workspace that enables collaboration between data scientists, data engineers, and business analysts. As an Azure service, customers automatically benefit from the native integration with other Azure services such as Power BI, SQL Data Warehouse, and Cosmos DB, as well as from enterprise-grade Azure security, including Active Directory integration, compliance, and enterprise-grade SLAs.
See what's possible with Azure Databricks
Watch how to get started with Apache Spark on Azure Databricks
View critical use cases to solve your big data and AI challenges
Read the three practical use cases for data scientists
Educational Materials and Tutorials for Data Bricks
This is the core source for Azure Databricks and Spark training material. The notebooks in here are available to help you learn more about best practices in Spark. Educators around the world including Azure Databricks trainers created this material to help users learn how to use Apache Spark.
- Introduction to Apache Spark
- Analyzing 1000 Genomes with Spark and Hail
- Business Data Analysis LA
- Analyzing Genomes with VariantSpark
Additional resources and Material for Educators
We have a great set of Azure Labs for Students and Educators to get you started https://github.com/Microsoft/computerscience
Specific Big Data and HDINsight labs are at https://github.com/Microsoft/computerscience/tree/master/Labs/Big%20Data%20and%20Analytics
Azure HPC https://github.com/Microsoft/computerscience/tree/master/Labs/Azure%20Services/Azure%20HPC
And we also have notebooks labs at https://github.com/Microsoft/computerscience/tree/master/Labs/AI%20and%20Machine%20Learning