Jaa


Streaming on Azure Databricks

You can use Azure Databricks for near real-time data ingestion, processing, machine learning, and AI for streaming data.

Azure Databricks offers numerous optimizations for streaming and incremental processing, including the following:

Delta Lake provides the storage layer for these integrations. See Delta table streaming reads and writes.

For real-time model serving, see Deploy models using Mosaic AI Model Serving.

Azure Databricks has specific features for working with semi-structured data fields contained in Avro, protocol buffers, and JSON data payloads. To learn more, see:

Additional resources

Apache Spark provides a Structured Streaming Programming Guide that has more information about Structured Streaming.

For reference information about Structured Streaming, Databricks recommends the following Apache Spark API references: