Data flow failed due to transient error

Ananya patil 20 Reputation points
2024-12-31T02:11:17.29+00:00

The ADF data flow failed due to a transient error, and during the retry, some data was already loaded into the sink, leading to duplicate records in the insert scenario. Since I didn't implement CDC at the source level and there are no sink checkpoint options to prevent duplicates, is there a way to restart the job from the point of failure on the second attempt?

Your support would be appreciated.

Thanks,

Ananya

Azure Synapse Analytics
Azure Synapse Analytics
An Azure analytics service that brings together data integration, enterprise data warehousing, and big data analytics. Previously known as Azure SQL Data Warehouse.
5,098 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
11,064 questions
{count} votes

1 answer

Sort by: Most helpful
  1. Nandan Hegde 33,001 Reputation points MVP
    2024-12-31T04:13:42.5433333+00:00

    Unfortunately, based on my understanding;

    ADF/Synapse dataflows/copy activity are not transaction bound. And there is no way you can rerun from the point of failure w.r.t number of records and it would be from the beginning only.

    So you would have to revert your sink into original state prior to the execution (by deleting/updating the records) and rerun the job to overcome the issue


Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.