- Support for Non-Microsoft Systems
Microsoft Purview primarily excels within the Microsoft ecosystem (e.g., Microsoft 365, SharePoint, Teams, Exchange) but does extend to some third-party cloud providers and on-premises environments. However, native, out-of-the-box support for platforms like SAP or Oracle is limited. Here's a detailed view of where Purview currently stands:
Cloud Platforms Supported:
- Native integrations with Google Drive and Dropbox.
- Ability to monitor and classify files synced from these platforms.
Third-Party Data Connectors:
- Microsoft offers data connectors (via Purview Data Map) that can collect metadata from sources like SQL databases, AWS S3, and some third-party systems.
- For SAP, Oracle, or custom enterprise systems, Purview typically requires custom connectors or third-party integration solutions.
On-Premises Data Sources:
- Data classification and protection extend to local file shares and on-premises SQL databases (via Microsoft’s Unified Labeling Client and scanner tools).
- For other enterprise platforms (e.g., SAP), customers typically rely on third-party vendors or APIs to integrate Purview.
Key Limitation: The absence of native support for enterprise systems like SAP, Oracle, and non-public cloud platforms means that users must create custom solutions to bridge these gaps.
- Supported File Formats for Classification and Protection
Microsoft Purview can classify and protect many standard file formats, but there are some limitations, particularly with non-standard or media-heavy formats. Here's a more precise breakdown:
File Types that Microsoft Purview Can Classify and Label:
- Microsoft formats: Word, Excel, PowerPoint, OneNote.
- Emails: Outlook (.msg, .eml).
- PDFs: Purview supports PDF classification, but only certain types of PDFs (those that allow tagging and text extraction).
- Text-based formats: TXT, CSV, XML, HTML.
- Standard image formats: JPEG, PNG, GIF (limited to detecting embedded text using Optical Character Recognition [OCR]).
- Text-based formats: TXT, CSV, XML, HTML.
- PDFs: Purview supports PDF classification, but only certain types of PDFs (those that allow tagging and text extraction).
- Emails: Outlook (.msg, .eml).
File Types with Limited or No Support:
- Open Document formats: ODS, ODT, ODP.
- Multimedia files: AVI, MPEG, MP3, MP4 (cannot be classified or labeled by default).
- Scanned PDFs or protected PDFs: If the document is not OCR-processed, Purview cannot classify it.
- Proprietary formats: SAP, Oracle, and other enterprise systems generate files that might not be natively classified.
Given this, your concerns are valid—Purview’s messaging around "all types of data" typically applies to supported file formats within the Microsoft ecosystem and select third-party platforms. For enterprise systems like SAP and Oracle, custom integration will be necessary, and some non-standard formats (e.g., multimedia files) cannot be classified or protected directly.
If the above response helps answer your question, remember to "Accept Answer" so that others in the community facing similar issues can easily find the solution. Your contribution is highly appreciated.
hth
Marcin