- Newest
- Most votes
- Most comments
Hello good afternoon,
Thank you for your question. There is a library published in AWS Samples that can help you called Amazon Textract Textractor. Link: https://github.com/aws-samples/amazon-textract-textractor?tab=readme-ov-file
It has sub modules as described below:
amazon-textract-caller (to simplify calling Amazon Textract without additional dependencies) amazon-textract-response-parser (to parse the JSON response returned by Textract APIs) amazon-textract-overlayer (to draw bounding boxes around the document entities on the document image) amazon-textract-prettyprinter (convert Amazon Textract response to CSV, text, markdown, ...) amazon-textract-geofinder (extract specific information from document with methods that help navigate the document using geometry and relations, e. g. hierarchical key/value pairs)
Probably you can use the amazon-textract-response-parser to separate non table data. Check this link: https://pypi.org/project/amazon-textract-response-parser/
Let me know if it helps.
Thank you.
Relevant content
- asked a year ago
- AWS OFFICIALUpdated 2 years ago
- AWS OFFICIALUpdated 3 years ago
Yes, thank you. I appreciate the help. I am familiar with the documentation and code samples. I have not come across anything yet that I recognized as a possible solution.