Questions tagged with Amazon Textract
Content language: English
Select up to 5 tags to filter
Sort by most recent
Browse through the questions and answers listed below or filter and sort to narrow down your results.
I was testing out the latency of Textract + HITL via Mechanical Turk.
I created a flow defintion, a human loop config and then submitted a request on a document using:
```
humanLoopConfig = {
...
1
answers
0
votes
245
views
asked a year agolg...
I am trying to extract and parse an invoice pdf that has tabular data, using Python. The table has a few columns that have rowspan of 2. Textract is unable to fetch the text for such rows. In the...
1
answers
0
votes
283
views
asked a year agolg...
I am getting a `INVALID_DOCUMENT_TYPE` error when trying to process a given PDF with Textract even though the PDF is only 1MB. However, the PDF is about 105"x35" which I know is greater than the...
1
answers
0
votes
237
views
asked a year agolg...
Hi I am trying to feed a document with text and three column table with header Yes and No. I have tried multiple combinations of queries too as well as table analysis but all came blank.
![Enter...
1
answers
0
votes
243
views
asked a year agolg...
Has anyone found a good way to parse tables that come back from Textract that are inverted? Specifically, getting the table name or column headers as they are not recognized via the API itself. Here...
0
answers
0
votes
76
views
asked a year agolg...
I have recently seen the news on the console Textract tool for [Bulk Upload](https://docs.aws.amazon.com/textract/latest/dg/bulk-uploader-best-practices.html). Is there a way to use this service via...
2
answers
0
votes
496
views
asked a year agolg...
Hi everyone,
I am experiencing an issue with the query feature of the Textract Analyze Document service. When I scan a black image and ask for the date and time of the document, Textract gives me a...
1
answers
0
votes
254
views
asked a year agolg...
How does Textract process PDFs with searchable and selectable text? Compared to the "scanned" PDFs?lg...
I couldn't find information if Textract working differently with these PDFs.
I ponder if there is even a need for Textract if PDF already contains text (which is typically the case for machine...
1
answers
0
votes
269
views
asked a year agolg...
this is a sample pdf of my data and all the pdfs have same format:
![Enter image description here](/media/postImages/original/IM35LB0AWPQSuhGSqkntlIhA)
Now I am first using textract on this and then...
1
answers
0
votes
255
views
asked a year agolg...
Hi,
Are there any tutorials or other resources available which demonstrate how to structure and load AWS Textract results in a vector database?
1
answers
0
votes
348
views
asked a year agolg...
Hi,
For PDFs with columns, such as this:
![screenshot of german language pdf with columns](/media/postImages/original/IMtsWfCpfYQo6dJek-soEacw)
The Textract result reads the text left to right across...
1
answers
0
votes
218
views
asked a year agolg...
Hi,
I'm trying to run textract API on a document but it's returning an error: botocore.errorfactory.UnsupportedDocumentException: An error occurred (UnsupportedDocumentException) when calling the...
2
answers
0
votes
873
views
asked a year agolg...