Explore how you can quickly prepare for, respond to, and recover from security events. Learn more.
Questions tagged with Amazon Textract
Amazon Textract is a machine learning (ML) service that automatically extracts text, handwriting, and data from scanned documents.
Content language: English
Select up to 5 tags to filter
Sort by most recent
Browse through the questions and answers listed below or filter and sort to narrow down your results.
344 results
Hello
Amaqzon Textract is quite a useful service to use.
Pricing is clearly articulated here - https://aws.amazon.com/textract/pricing/
Only challenge is how to track the utilization.
When I look ...
1. Assume we've 50-250 data points that need to be extracted from PDF files. Each PDF file may be 4-15 pages.
2. The format and layout of each PDF file may be different. A datapoint we're searching fo...
Amazon Textract's pricing page says that the Free Tier lasts for 3 months, but does it count 3 full months from the time you have the service? Or is it 3 calendar months? I started a free trial in the...
I am developing a Bank Statement PDF Converter that converts PDF files into CSV format. I have resolved many bugs in the output and am nearing completion.
However, during testing with sample files, I...
Issue: textract identifies enumerated paragraph number as a different layout box, and misses the enumeration
Example:
![phone contract example - `layout text 2` should be part of `layout text `](/medi...
I'm implementing a form data capturer from images using AWS Textract, but ...
![Simple checkbox form](/media/postImages/original/IM8vx6TyPGRNCBEe4HcTl2ng)
As you can see in the AWS test page, the mo...
Seems like Textract is having problem extracting this part of the table (green box)
![Enter image description here](/media/postImages/original/IMugtZUKWjSDCAfESDxTv8IQ)
The only data that Textract s...
I am attempting to use TextractAsyncClient in Kotlin, but I can't find it. I can find it in Java and can make do if necessary, but is there really no native Kotlin implementation?
I am using the Java Textract SDK to evaluate Textract, and the documentation states this:
*Amazon Textract can be used to detect the layout of a document by finding the locations of different elemen...
My PDF files have a table of "testers". In the image below, we can see "Tester 2" is "SENT-11-3".
![PDF Excerpt](/media/postImages/original/IMNJo-biEQStmkW2qMVRSNqA)
However, the TEXTRACT output giv...
Periodically for some documents (PDFs) but I'm not able to understand why. Textract will not be able to determine the layout even though the layout information is in the blocks.
For example:
```
new...
Hi,
I'm using Textract to read certain forms where Textract mixes up these cases -
1 & I
0 & O
S & 5
Z & 2
i & 1
....
Can some one clarify if there are any fonts in which the Textract would mix up t...