[Azure Data Factory] Data Flow Sink is extremely slow

Question

[Azure Data Factory] Data Flow Sink is extremely slow

Hanna Melashkina 0

I'm copying about 800 items from SAP to Azure Fabric. I'm running pipeline with data flow in Azure Data Factory (ADF). We are using Self-hosted integration runtime. The Data Flow is not complex, it just copies data and adds an additional column. There are no complex transformations.

However, the Sink activity takes around 10 minutes!

Sink configuration:

Sink type: dataset
Dataset: Fabric warehouse
Allow schema drift: true
Allow insert: true
Batch size: 100
Enable staging: false (I cannot turn it on; otherwise, I get an error in my pipeline: "Fabric DW supports staging only via copy command - EXE-0001, Dataflow cannot be analyzed as a graph.")

I also tried using Copy activity as suggested here, but it keeps failing with a BULK insert error:

Failure happened on 'Sink' side. ErrorCode=SqlOperationFailed, 'Type=Microsoft.DataTransfer.Common.Shared.HybridDeliveryException, Message=A database operation failed. Please search error to get more details., Source=Microsoft.DataTransfer.Connectors.MSSQL, ''Type=Microsoft.Data.SqlClient.SqlException, Message=Incorrect syntax near 'INSERT BULK'., Source=Framework Microsoft SqlClient Data Provider,'

Some other observations:

Sink processing time: 10m 33s

Stage time: 7m 36s

Table operations SQL duration: 2m 30s

I also checked in the warehouse's long_running_queries and saw that INSERT was called 840 times, with each insert taking around 450ms. That means 800 * 450ms = 360,000ms (6 minutes)!

My Question: Why is ADF inserting items one by one and ignoring the batch size of 100? Since I cannot enable Staging without errors and BULK insert fails, what is the correct configuration to load data from SAP to Fabric efficiently?

2 answers

Your answer

Answer 1

The slow performance you are experiencing with the Data Flow Sink in Azure Data Factory (ADF) could be attributed to several factors. Since you mentioned that the sink is inserting items one by one despite the batch size configuration, here are some considerations and suggestions to improve the performance:

Batch Size Configuration: Ensure that the batch size is set correctly in the sink settings. Although you mentioned that the batch size is set to 100, if the sink is not configured to handle batch inserts effectively, it may still process each insert individually.
Staging: Since you cannot enable staging due to the error you encountered, it limits the ability to optimize the data loading process. Staging can significantly improve performance by allowing bulk operations. If possible, investigate the error further to see if there is a way to resolve it or consider using a different approach that allows staging.
Copy Activity: You mentioned trying the Copy activity but encountering a BULK insert error. This might indicate that the configuration for the Copy activity is not set up correctly. Ensure that the allowCopyCommand property is set to true, which allows the use of the COPY statement for loading data into the Fabric Warehouse. This can help in achieving better performance.
Error Handling: Review the error handling settings in your Data Flow. If the flow is configured to stop on errors, it may be causing delays. Consider enabling options that allow the flow to continue processing even if some rows encounter errors.
Performance Monitoring: Utilize the monitoring tools available in Azure Data Factory to identify bottlenecks. Check for any throttling issues or resource constraints that may be affecting the performance of your integration runtime.
Parallelism: If your source data allows, consider increasing the degree of parallelism in your data flow. This can help in processing multiple records simultaneously, reducing the overall time taken for the sink operations.

By addressing these areas, you may be able to improve the performance of your data loading process from SAP to Azure Fabric.

References:

Answer 2

Hi Hanna Melashkina,

SAP is not visible as a source in ADF Copy Activity because the SAP connector you are using is supported in Mapping Data Flow (Spark runtime) and not exposed in the Copy Activity data movement runtime. This is a product capability difference, not a configuration issue.

Because of this, SAP extraction must be done using Mapping Data Flow in your environment.

When Mapping Data Flow writes directly to Fabric Warehouse, it cannot use Fabric’s required COPY INTO ingestion path. As a result, Data Flow falls back to row-by-row INSERT statements, and the batch size setting does not enforce bulk loading.

This is why you observed:

Hundreds of INSERT statements

~10 minutes sink time for ~800 rows

You must use a two-step pipeline:

Extract from SAP(Mapping Data Flow)
Source: SAP (via Self-Hosted Integration Runtime)
Sink: ADLS Gen2 (Parquet recommended, CSV supported)
Minimal transformation (for example, add the extra column here)

Load Into Fabric Warehouse (Copy Activity)
Source: ADLS Gen2 files from Step 1
Sink: Fabric Warehouse (WarehouseSink)
Enable COPY command (allowCopyCommand = true)

This allows Fabric to use server-side COPY INTO, which performs a true bulk load and avoids row-by-row INSERTs.

For your reference, please review the following Microsoft Learn documentation. These articles provide the official guidance for SAP connectors, Fabric Warehouse ingestion, and the supported COPY INTO pattern that applies to your scenario:

https://learn.microsoft.com/en-us/azure/data-factory/industry-sap-connectors https://learn.microsoft.com/en-us/azure/data-factory/sap-change-data-capture-shir-preparation
https://learn.microsoft.com/en-us/azure/data-factory/connector-microsoft-fabric-warehouse?tabs=data…
https://learn.microsoft.com/en-us/fabric/data-warehouse/ingest-data-copy https://learn.microsoft.com/en-us/azure/data-factory/copy-activity-overview

Hope this helps, Please let us know if you have any questions and concerns.

Manoj Kumar Boyini 5,795 Reputation points Microsoft External Staff Moderator

2026-02-03T09:54:27.06+00:00

Hi Hanna Melashkina,

Kindly let us know if you are using Fabric or integrating Azure services with Microsoft Fabric, so we can guide you accordingly.
Hanna Melashkina 0 Reputation points

2026-02-03T10:04:29.9666667+00:00

Hi, @Manoj Kumar Boyini ,

I'm think I'm integrating Azure services with Microsoft Fabric. My initial goal is to get data from SAP to MS Fabric and process data in Fabric. For coping data I'm using Azure Data Factory.
Manoj Kumar Boyini 5,795 Reputation points Microsoft External Staff Moderator

2026-02-03T10:09:23.48+00:00

Hi Hanna Melashkina,

I kindly request you to please share the details requested in the private message for further investigation.
Manoj Kumar Boyini 5,795 Reputation points Microsoft External Staff Moderator

2026-02-04T12:57:35.0533333+00:00

Hi Hanna Melashkina,

I hope you had a chance to review the information shared earlier, and I hope this information has been helpful! If you still have questions, please let us know what is needed in the comments so the question can be answered.

Share via

[Azure Data Factory] Data Flow Sink is extremely slow

2 answers

Your answer