Questions tagged with Amazon Textract

Amazon Textract is a machine learning (ML) service that automatically extracts text, handwriting, and data from scanned documents.

Content language: English

Select up to 5 tags to filter
Sort by most recent

Browse through the questions and answers listed below or filter and sort to narrow down your results.

344 results
Hello Amaqzon Textract is quite a useful service to use. Pricing is clearly articulated here - https://aws.amazon.com/textract/pricing/ Only challenge is how to track the utilization. When I look ...
Accepted AnswerAmazon Textract
1
answers
0
votes
26
views
asked 2 days ago
1. Assume we've 50-250 data points that need to be extracted from PDF files. Each PDF file may be 4-15 pages. 2. The format and layout of each PDF file may be different. A datapoint we're searching fo...
2
answers
0
votes
16
views
asked 9 days ago
Amazon Textract's pricing page says that the Free Tier lasts for 3 months, but does it count 3 full months from the time you have the service? Or is it 3 calendar months? I started a free trial in the...
1
answers
0
votes
20
views
asked 10 days ago
I am developing a Bank Statement PDF Converter that converts PDF files into CSV format. I have resolved many bugs in the output and am nearing completion. However, during testing with sample files, I...
1
answers
0
votes
19
views
asked 10 days ago
Issue: textract identifies enumerated paragraph number as a different layout box, and misses the enumeration Example: ![phone contract example - `layout text 2` should be part of `layout text `](/medi...
Accepted AnswerAmazon Textract
1
answers
0
votes
16
views
AWS
asked 12 days ago
I'm implementing a form data capturer from images using AWS Textract, but ... ![Simple checkbox form](/media/postImages/original/IM8vx6TyPGRNCBEe4HcTl2ng) As you can see in the AWS test page, the mo...
2
answers
0
votes
19
views
asked 19 days ago
Seems like Textract is having problem extracting this part of the table (green box) ![Enter image description here](/media/postImages/original/IMugtZUKWjSDCAfESDxTv8IQ) The only data that Textract s...
1
answers
0
votes
34
views
asked 19 days ago
I am attempting to use TextractAsyncClient in Kotlin, but I can't find it. I can find it in Java and can make do if necessary, but is there really no native Kotlin implementation?
1
answers
0
votes
22
views
asked 19 days ago
I am using the Java Textract SDK to evaluate Textract, and the documentation states this: *Amazon Textract can be used to detect the layout of a document by finding the locations of different elemen...
2
answers
1
votes
43
views
asked 23 days ago
My PDF files have a table of "testers". In the image below, we can see "Tester 2" is "SENT-11-3". ![PDF Excerpt](/media/postImages/original/IMNJo-biEQStmkW2qMVRSNqA) However, the TEXTRACT output giv...
2
answers
0
votes
30
views
asked a month ago
Periodically for some documents (PDFs) but I'm not able to understand why. Textract will not be able to determine the layout even though the layout information is in the blocks. For example: ``` new...
1
answers
0
votes
36
views
asked a month ago
Hi, I'm using Textract to read certain forms where Textract mixes up these cases - 1 & I 0 & O S & 5 Z & 2 i & 1 .... Can some one clarify if there are any fonts in which the Textract would mix up t...
1
answers
0
votes
44
views
asked a month ago