Questions tagged with Amazon Textract

Amazon Textract is a machine learning (ML) service that automatically extracts text, handwriting, and data from scanned documents.

Content language: English

Select up to 5 tags to filter
Sort by most recent

Browse through the questions and answers listed below or filter and sort to narrow down your results.

343 results
I have asked a similar question, but this more straight forward. Can I use Amazon Textract to detect strikethrough / strikeout characters in a PDF? And if so, how? And if not, can these feature be ...
1
answers
0
votes
301
views
asked 6 months ago
Could you please tell me the pricing based differences of Textract if I send 30 pages asynchronously for Textract analysis and if I send each page of the pdf as a single pdf for analysis. Also could y...
1
answers
0
votes
275
views
asked 6 months ago
I am trying to follow the guidance from this AWS Textract article: https://aws.amazon.com/blogs/machine-learning/amazon-textracts-new-layout-feature-introduces-efficiencies-in-general-purpose-and-ge...
1
answers
0
votes
388
views
asked 6 months ago
I've been trying to analize a document. I have tried with S3 buckets as with Base64 image upload with no luck at all. The image is uploaded via a web browser, it is commented the section where it trie...
0
answers
0
votes
213
views
asked 6 months ago
I want to send a complete pdf document to Textract, some pages have tables and some don't how can I send the entire document such that only tables are extracted and CSVs are generated (CSV generation ...
1
answers
0
votes
236
views
asked 6 months ago
I am trying to create my own browser-based console that I can use instead of the AWS Custom Queries console to create and manage Textract adapters. I've found [this answer](https://repost.aws/questio...
2
answers
0
votes
228
views
asked 6 months ago
AWS textract is superb and gives good results for documents which contains images. However what we observed is that everytime there is **c.** like in attached image ![Sample Image](/media/postImages/o...
1
answers
0
votes
184
views
asked 6 months ago
Hi AWS, we are working on a project that requires real-time document processing and we are encountering latency issues with AWS Textract for multipage, large PDF files. Despite using the asynchronous ...
1
answers
0
votes
227
views
profile picture
asked 7 months ago
Preamble: We need to extract a unique ID from an image. The image consists of 8 numbers and a character, like so: 99999999C. This code is within a Spanish document in the form of a PNG file, the api ...
1
answers
0
votes
198
views
asked 7 months ago
This question should be in Textract category, but system places it in ML for some unknown reason. I am interfacing with the Textract API using the AWS PHP SDK. I'd like to use the new Textract Layou...
1
answers
0
votes
285
views
asked 7 months ago
I'm using the async workflow and paging through the results of GetDocumentTextDetection and have a couple questions. For context, I'm using these results to determine the "primary" orientation of the...
Accepted AnswerAmazon Textract
1
answers
0
votes
221
views
asked 7 months ago
I use Textract to read tables that have been filled in with handwriting. In general it works great, but there is a recurring issue of Textract not recognizing '1' or interpreting it as a column separa...
1
answers
0
votes
262
views
asked 7 months ago