How streaming data is processed in real time projects , which was ingested through azure event hubs

Prakash Mj 20 Reputation points
2025-03-10T21:16:06.5233333+00:00

And I have few questions :

  1. How streaming data is processed in real time projects?, lets say my streaming data is stored in adls gen-2, where every minute a new blob file is created.
  2. Does it should be treated as batch data and should process incrementally? or can i use spark streaming concept or trigger pipeline using even based trigeer- after every blob creation?
Azure Stream Analytics
Azure Stream Analytics
An Azure real-time analytics service designed for mission-critical workloads.
384 questions
{count} votes

Accepted answer
  1. Deepanshu katara 14,580 Reputation points MVP
    2025-03-11T05:55:19.46+00:00

    Hello Prakash , Welcome to MS Q&A

    To process streaming data stored in Azure Data Lake Storage Gen2, you have several options depending on your requirements for latency and processing complexity:

    Batch Processing with Incremental Loads: This approach treats the data as batch data, processing it incrementally. It's suitable if real-time processing isn't critical and some latency is acceptable.

    Real-Time Processing with Spark Streaming: Apache Spark Streaming can be used to process data in real-time. It reads data from ADLS Gen2 as it arrives, allowing for low-latency processing and immediate reaction to events.

    Event-Driven Processing with Azure Functions or Logic Apps: You can set up an event-based trigger that activates a pipeline or function every time a new blob is created. Azure Event Grid can trigger Azure Functions or Logic Apps to process the new data, which is useful for event-driven architectures.

    For more detailed guidance, you can refer to the following resources:

    Please let me know if any further ques

    Kindly accept answer if it helps

    Thanks

    Deepanshu

    You found this answer helpful.
    0 comments No comments

0 additional answers

Sort by: Most helpful

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.