I’ve dealt with similar issues when working with closely spaced columns in tables using Azure Document Intelligence. Here are a few suggestions that might improve your model's accuracy:
Preprocessing: If possible, try pre-processing your documents to enhance table structure clarity. This can include increasing the spacing between columns or adding lines that make column separation more evident. If you’re able to adjust the source documents before extraction, this can help the model distinguish columns more effectively.
Region Labeling: When training your model, ensure that you’re using precise region labeling for each column. If the columns are too close, the model may interpret them as a single region. You could try breaking down the table into smaller labeled sections and see if that improves the accuracy.
Table Structure Configuration: Look into configuring the table structure recognition settings. Azure Document Intelligence allows some customization to better identify table boundaries and column breaks. You may need to adjust settings related to cell boundaries, merging logic, or how the model handles grid recognition for closely spaced tables.
Multiple Passes for Extraction: Consider running multiple passes with different extraction models or combining table extraction with form extraction. Sometimes breaking down a table into parts or using a hybrid approach for different sections can yield better results.
Post-Processing: You might also want to implement post-processing logic after extraction to clean up or restructure the data. This could include splitting merged cells or adjusting column positions in your data based on certain rules.
Document Layout Changes: In cases where document layout flexibility exists, slightly adjusting the layout by increasing column spacing or using borders could make a significant difference in extraction accuracy.
It might take a combination of these methods to achieve the best results, but refining your model’s labeling and exploring these options should help improve its performance.