Will Document Intelligence Custom Extraction support Microsoft Office Documents in the future?

System Administrator 20 Reputation points Microsoft Employee
2025-02-05T13:23:05.6266667+00:00

Customer wants to know whether Document Intelligence Custom extraction will support .docx files.

If this forum is not the proper place to post this question then how can I get a Roadmap for Document Intelligence features?

Azure AI Document Intelligence
Azure AI Document Intelligence
An Azure service that turns documents into usable data. Previously known as Azure Form Recognizer.
1,895 questions
0 comments No comments
{count} votes

1 answer

Sort by: Most helpful
  1. Saideep Anchuri 1,790 Reputation points Microsoft Vendor
    2025-02-05T15:38:12.0433333+00:00

    Hi System Administrator

    Welcome to Microsoft Q&A Forum, thank you for posting your query here!

    Currently, Document Intelligence does not support Microsoft Office files, including .docx, for Custom Extraction. The supported file formats are primarily PDFs and images (JPEG, PNG, BMP, TIFF, HEIF). Workaround will be converting all docx file to .pdf files using python libraries (python-docx, pdfkit) as pdfs are compatible with custom extraction. There is no indication in the available context that Microsoft Office document support will be added in the future. However, you can keep an eye on the whats-new

    Kindly refer below link: custom-model

    Hope this helps. Do let us know if you any further queries.

     


    If this answers your query, do click Accept Answer and Yes for was this answer helpful.

    Thank You.


Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.