Explore how you can quickly prepare for, respond to, and recover from security events. Learn more.
Questions tagged with Amazon Textract
Amazon Textract is a machine learning (ML) service that automatically extracts text, handwriting, and data from scanned documents.
Content language: English
Select up to 5 tags to filter
Sort by most recent
Browse through the questions and answers listed below or filter and sort to narrow down your results.
343 results
Hello all,
cannot post a sample document in here, but lets say I'm working with invoices, pdf. All of a sudden, I found a couple odd balls today where my scripts that consume textract will consistent...
Hi AWS team,
I uploaded a PDF file using the online demo (TABLES) and I got more accurate results than using the AWS async API java.
`<dependency>
<groupId>software.amazon.awssdk</groupId>
...
I am able to extract text from my multi-page PDF using Amazon Textract. Now I want start Human Loop Review. I have already created a workflow and specified the condition there to trigger the Human Loo...
I am very new to Textract, so apologies if this is a basic question. I have a document that mixes tables and text throughout, and I want to extract both at once. However, I find that in the "layout" o...
Using Textract for a table of contents where each line has** TITLE . . . . Author PageNo.**
Resultant table has Title and Author merges ignoring dot-leader as one column and page numbers has 2nd c...
I have set up a user in the IAM Identity Center console that is assigned to a group that I'd like to only have access to a few select s3 buckets and the AWS Textract service. I've created a group with...
I have been using AWS Textract to scan forms and invoices. Previously, I trained the adapter after auto-labelling it and reviewing annotations.
I wanted to prepare my own data components using already...
Hi,
I have a multi-page PDF document which I can process fine and extract key value pair in Amazon Textract web interface. However, when I try to extract key value pairs in my Python code, it returns ...
Hey guys, just wondering if it is possible to train the Textract to return meaningful results. I am trying to use Textract to read some handwritten forms, but sometimes it gives me results that are no...
We are using AWS Textract modules Analyse Document to extract data from enrollment forms which are in pdf and jpeg format. We observed that for online filled pdf forms, textract is giving incorrect re...
Hello there,
I am working with serverless-offline, so I run my project with sls offline, all good with that. Here is my serverless.yml:
```
service: ${env:APP_SERVICE_NAME}
useDotenv: true
configV...
Hello,
My code breaks every time I attempt to analyze a PDF with more than one page. It displays the following error:
```
UnsupportedDocumentException: Request has unsupported document format
...