Questions tagged with Amazon Textract
Content language: English
Select up to 5 tags to filter
Sort by most recent
Browse through the questions and answers listed below or filter and sort to narrow down your results.
I uploaded a sample document to the AWS Textract demo web interface via our in-house portal, which consumes AWS Textract via the synchronous AnalyzeDocument API, and the table results provided by the...
1
answers
0
votes
292
views
asked a year agolg...
Hello. Using Textract, I am able to OCR extract the data in the tables / forms to json downloads. However, I have observed that in json file, every word and statement in a line has a unique ID...
3
answers
0
votes
1333
views
asked a year agolg...
I was testing out the latency of Textract + HITL via Mechanical Turk.
I created a flow defintion, a human loop config and then submitted a request on a document using:
```
humanLoopConfig = {
...
1
answers
0
votes
252
views
asked a year agolg...
I am trying to extract and parse an invoice pdf that has tabular data, using Python. The table has a few columns that have rowspan of 2. Textract is unable to fetch the text for such rows. In the...
1
answers
0
votes
298
views
asked a year agolg...
I am getting a `INVALID_DOCUMENT_TYPE` error when trying to process a given PDF with Textract even though the PDF is only 1MB. However, the PDF is about 105"x35" which I know is greater than the...
1
answers
0
votes
250
views
asked a year agolg...
Hi I am trying to feed a document with text and three column table with header Yes and No. I have tried multiple combinations of queries too as well as table analysis but all came blank.
![Enter...
1
answers
0
votes
253
views
asked a year agolg...
Has anyone found a good way to parse tables that come back from Textract that are inverted? Specifically, getting the table name or column headers as they are not recognized via the API itself. Here...
0
answers
0
votes
79
views
asked a year agolg...
I have recently seen the news on the console Textract tool for [Bulk Upload](https://docs.aws.amazon.com/textract/latest/dg/bulk-uploader-best-practices.html). Is there a way to use this service via...
2
answers
0
votes
517
views
asked a year agolg...
Hi everyone,
I am experiencing an issue with the query feature of the Textract Analyze Document service. When I scan a black image and ask for the date and time of the document, Textract gives me a...
1
answers
0
votes
268
views
asked a year agolg...
How does Textract process PDFs with searchable and selectable text? Compared to the "scanned" PDFs?lg...
I couldn't find information if Textract working differently with these PDFs.
I ponder if there is even a need for Textract if PDF already contains text (which is typically the case for machine...
1
answers
0
votes
282
views
asked a year agolg...
this is a sample pdf of my data and all the pdfs have same format:
![Enter image description here](/media/postImages/original/IM35LB0AWPQSuhGSqkntlIhA)
Now I am first using textract on this and then...
1
answers
0
votes
264
views
asked a year agolg...
Hi,
Are there any tutorials or other resources available which demonstrate how to structure and load AWS Textract results in a vector database?
1
answers
0
votes
361
views
asked a year agolg...