Textract missing some columns, tables

0

For certain documents, Textract appears to be missing some columns, skipping entire tables, or incorrectly identifying tables. I have attempted to use some different pre-processing methods such as extracting each page to an image file and rebuilding the PDFs, but this has not yielded any better results. Any suggestions are appreciated. Here are some examples, you can see the tables as identified by Textract highlighted in red:

Example #1 Example #2

behal
gefragt vor 8 Monaten235 Aufrufe
1 Antwort
0

Thank you for using Amazon Textract. As a managed machine learning service, we are continuously improving the quality of our models and releasing new features. In order to help us improve our models for your documents, please open a customer support ticket and share details to help us analyze further. Additionally, please look out for announcements regarding our model quality updates and new feature announcements that are announced on the AWS Textract what’s new post channel.

AWS
beantwortet vor 7 Monaten

Du bist nicht angemeldet. Anmelden um eine Antwort zu veröffentlichen.

Eine gute Antwort beantwortet die Frage klar, gibt konstruktives Feedback und fördert die berufliche Weiterentwicklung des Fragenstellers.

Richtlinien für die Beantwortung von Fragen