Questions tagged with Amazon Textract

Amazon Textract is a machine learning (ML) service that automatically extracts text, handwriting, and data from scanned documents.

Content language: English

Select up to 5 tags to filter
Sort by most recent

Browse through the questions and answers listed below or filter and sort to narrow down your results.

329 results
I am very new to Textract, so apologies if this is a basic question. I have a document that mixes tables and text throughout, and I want to extract both at once. However, I find that in the "layout"...
1
answers
0
votes
326
views
cwasims
asked 7 months ago
Using Textract for a table of contents where each line has** TITLE . . . . Author PageNo.** Resultant table has Title and Author merges ignoring dot-leader as one column and page numbers has 2nd...
2
answers
0
votes
242
views
asked 7 months ago
I have set up a user in the IAM Identity Center console that is assigned to a group that I'd like to only have access to a few select s3 buckets and the AWS Textract service. I've created a group with...
1
answers
0
votes
647
views
asked 7 months ago
I have been using AWS Textract to scan forms and invoices. Previously, I trained the adapter after auto-labelling it and reviewing annotations. I wanted to prepare my own data components using already...
1
answers
1
votes
800
views
asked 7 months ago
Hi, I have a multi-page PDF document which I can process fine and extract key value pair in Amazon Textract web interface. However, when I try to extract key value pairs in my Python code, it returns...
0
answers
1
votes
181
views
sgral
asked 7 months ago
Hey guys, just wondering if it is possible to train the Textract to return meaningful results. I am trying to use Textract to read some handwritten forms, but sometimes it gives me results that are...
Accepted AnswerAmazon Textract
1
answers
0
votes
552
views
Aaron
asked 7 months ago
We are using AWS Textract modules Analyse Document to extract data from enrollment forms which are in pdf and jpeg format. We observed that for online filled pdf forms, textract is giving incorrect...
1
answers
0
votes
246
views
asked 7 months ago
Hello there, I am working with serverless-offline, so I run my project with sls offline, all good with that. Here is my serverless.yml: ``` service: ${env:APP_SERVICE_NAME} useDotenv:...
0
answers
0
votes
206
views
GerLC
asked 7 months ago
Hello, My code breaks every time I attempt to analyze a PDF with more than one page. It displays the following error: ``` UnsupportedDocumentException: Request has unsupported document format ...
Accepted AnswerAmazon Textract
1
answers
0
votes
206
views
asked 7 months ago
Hi everyone, this is my first postQuestion, so im sorry if i broke any rule 😅. I try to extract some text from a PDF file and when the script run i have this message error with queries method analyze...
Accepted AnswerAmazon Textract
1
answers
0
votes
454
views
profile picture
Claudio
asked 7 months ago
We use a custom tag (cost-center) to better understand our AWS expenses. Is there a way to add a tag to a textract detect_document_text() call? The AWS Cost Explorer included Textract as a service...
1
answers
0
votes
473
views
marcosw
asked 7 months ago
I have a Textract/A2I process setup and it works as expected. However, I need to change the workflow and am looking for suggestions. Context: we are using Textract/A2I to process historical...
2
answers
0
votes
696
views
Patrick
asked 8 months ago