Questions tagged with Amazon Textract
Content language: English
Select up to 5 tags to filter
Sort by most recent
Browse through the questions and answers listed below or filter and sort to narrow down your results.
I am using Textract's asynchronous functions StartDocumentAnalysis and GetDocumentAnalysis to detect signatures on a document using AWS SDK Python. The JSON data I receive is correct from...
0
answers
0
votes
21
views
asked 2 days agolg...
I was trying to extract invoice number from PDF file (using Amazon Textract - Analyze Expense), I uploaded pdf file and then analyze but it returned this error UnsupportedDocumentException.
Then I...
2
answers
0
votes
41
views
asked 3 days agolg...
In several documents that we have attempted to process using Amazon's OCR feature, we have found that the system does not correctly detect text in some areas, despite it being clearly legible. We have...
0
answers
0
votes
54
views
asked 3 days agolg...
Hello all,
I am using Textract async StartDocumentAnalysis and GetDocumentAnalysis for detecting signatures. However, when I test the code with a PDF document the job status of GetDocumentAnalysis is...
1
answers
0
votes
27
views
asked 5 days agolg...
I have two lambda functions currently, one to specify the queries and start the document analysis. The second function is triggered by a SNS topic and retrieves the document analysis. The problem is...
1
answers
0
votes
77
views
asked 6 days agolg...
I'm trying to use Textract to extract the product descriptions form our PDF catalogs in page order. The Textract analysis picks up the descriptions as text blocks, but how do I go about training...
1
answers
0
votes
65
views
asked 15 days agolg...
I am extracting data from documents that include tables and other text that is not in table format (the documents do not include figures). I would like to separate table data from non-table data...
1
answers
0
votes
102
views
asked 19 days agolg...
Background: I am using Textract Analyze document API to detect Layout response objects in a PDF page. The page has Page Headers, Title, Sub-headers, tables, figures, and text. The page is divided into...
1
answers
0
votes
118
views
asked 19 days agolg...
Is there any way to use Analyze Expense when the receipt or bill is split into multiple images. I have tried combining the images into a single image but this didn't work as expected. I was getting...
1
answers
0
votes
98
views
asked 23 days agolg...
Hi,
Can I know what is the code behind pulling the text of the document in the demo console: https://us-west-2.console.aws.amazon.com/textract/home?region=us-west-2#/demo
The "rawText.txt" in the zip...
1
answers
0
votes
108
views
asked a month agolg...
Hello,
I'm using Textract for extracting information from academic papers. It's working great except for when Greek letters are used, like α, β, σ, etc. It will often translate α as *a*, and β will be...
2
answers
0
votes
109
views
asked a month agolg...
Hello all,
cannot post a sample document in here, but lets say I'm working with invoices, pdf. All of a sudden, I found a couple odd balls today where my scripts that consume textract will...
2
answers
0
votes
117
views
asked a month agolg...