Hi @Aleksandr Maxermis
Thanks for the question and using MS Q&A platform.
As I understand that you have already verified that the network is configured correctly and that the test connection between Data Factory and the Spark cluster is successful. Since you are able to run the script successfully using Livy, it is possible that there is an issue with the Spark activity configuration in your pipeline.
Here are some things you can check to troubleshoot the issue:
- Verify that the Spark activity is configured correctly in your pipeline. Make sure that the Spark cluster and the script path are specified correctly in the activity settings.
- Check the logs for the Spark activity in Data Factory. The logs may provide more information about the error that is occurring.
- Check the permissions for the storage account where the script is located. Make sure that the account has the necessary permissions to access the script.
- Check the version of Spark that you are using. Make sure that the version is compatible with the version of Data Factory that you are using.
- Try running a different script using the Spark activity to see if the issue is specific to the script you are using.
If none of these steps resolve the issue, you may need to contact Microsoft support for further assistance.
Reference: https://learn.microsoft.com/en-us/azure/data-factory/transform-data-using-spark
I hope this helps! Let me know if you have any further questions.