Automating Export of Parquet Schema Elements from Purview Using PyApacheAtlas
How can the fully qualified name, classifications, sensitivity labels, glossary terms, and column descriptions for each column be exported from an Azure Data Lake Storage Gen2 Resource Set within a scanned collection?
After completing the scan, the goal is to download all attributes for specific assets into an Excel file. This would facilitate adding additional information and later uploading it back using PyApacheAtlas.
Efforts have been made to use get_entity
, which requires a GUID for each asset, but manually retrieving this information is tedious and inefficient.
Is there a method to automate this process to export the required details for all assets into an Excel file for seamless updates and re-uploading?