By using AWS re:Post, you agree to the AWS re:Post Terms of Use

Questions tagged with Amazon Textract

Amazon Textract is a machine learning (ML) service that automatically extracts text, handwriting, and data from scanned documents.

Content language: English

Select up to 5 tags to filter
Sort by most recent

Browse through the questions and answers listed below or filter and sort to narrow down your results.

343 results
Hello all, cannot post a sample document in here, but lets say I'm working with invoices, pdf. All of a sudden, I found a couple odd balls today where my scripts that consume textract will consistent...
2
answers
0
votes
264
views
asked 9 months ago
Hi AWS team, I uploaded a PDF file using the online demo (TABLES) and I got more accurate results than using the AWS async API java. `<dependency> <groupId>software.amazon.awssdk</groupId> ...
Accepted AnswerAmazon Textract
1
answers
0
votes
227
views
asked 9 months ago
I am able to extract text from my multi-page PDF using Amazon Textract. Now I want start Human Loop Review. I have already created a workflow and specified the condition there to trigger the Human Loo...
1
answers
0
votes
270
views
asked 9 months ago
I am very new to Textract, so apologies if this is a basic question. I have a document that mixes tables and text throughout, and I want to extract both at once. However, I find that in the "layout" o...
1
answers
0
votes
355
views
asked 9 months ago
Using Textract for a table of contents where each line has** TITLE . . . . Author PageNo.** Resultant table has Title and Author merges ignoring dot-leader as one column and page numbers has 2nd c...
2
answers
0
votes
267
views
asked 9 months ago
I have set up a user in the IAM Identity Center console that is assigned to a group that I'd like to only have access to a few select s3 buckets and the AWS Textract service. I've created a group with...
1
answers
0
votes
770
views
asked 9 months ago
I have been using AWS Textract to scan forms and invoices. Previously, I trained the adapter after auto-labelling it and reviewing annotations. I wanted to prepare my own data components using already...
1
answers
1
votes
837
views
asked 9 months ago
Hi, I have a multi-page PDF document which I can process fine and extract key value pair in Amazon Textract web interface. However, when I try to extract key value pairs in my Python code, it returns ...
0
answers
1
votes
187
views
asked 9 months ago
Hey guys, just wondering if it is possible to train the Textract to return meaningful results. I am trying to use Textract to read some handwritten forms, but sometimes it gives me results that are no...
Accepted AnswerAmazon Textract
1
answers
0
votes
646
views
asked 9 months ago
We are using AWS Textract modules Analyse Document to extract data from enrollment forms which are in pdf and jpeg format. We observed that for online filled pdf forms, textract is giving incorrect re...
1
answers
0
votes
275
views
asked 9 months ago
Hello there, I am working with serverless-offline, so I run my project with sls offline, all good with that. Here is my serverless.yml: ``` service: ${env:APP_SERVICE_NAME} useDotenv: true configV...
0
answers
0
votes
209
views
asked 9 months ago
Hello, My code breaks every time I attempt to analyze a PDF with more than one page. It displays the following error: ``` UnsupportedDocumentException: Request has unsupported document format ...
Accepted AnswerAmazon Textract
1
answers
0
votes
244
views
asked 9 months ago