All Content tagged with Amazon Textract

Amazon Textract is a machine learning (ML) service that automatically extracts text, handwriting, and data from scanned documents.

Content language: English

Select tags to filter
Sort by most recent
352 results
Hello, We are running a service in region eu-west-1 with a synchronous operations quota of 5RPS. We use synchronous operation: https://docs.aws.amazon.com/textract/latest/dg/API_AnalyzeDocument.html w...
1
answers
0
votes
28
views
asked 5 days ago
Hi All, Using Textract service, I am submitting about 11 images for OCR asynchronously by invoking textract.start_document_text_detection. I am running into a problem where all the jobs finish in as ...
1
answers
0
votes
18
views
asked 8 days ago
Hi, without any warning and in the middle of processing documents, Textract just stopped working. All tech issue have been discarded. Most likely related to going over the free tier, but then billing...
1
answers
0
votes
23
views
asked 13 days ago
Hi, I'm having issues with the OCR text-extract service recognizing specific acronyms that are perfectly readable in a PDF. The problem is with the abbreviation MVA, which is omitted when I run the OC...
1
answers
0
votes
27
views
asked 23 days ago
I tried Amazon Textract to analyze a single-page PDF document via the console and Lambda. The document has multiple bullet points of text sentences. I phrased exactly as the word appears in the docume...
1
answers
0
votes
49
views
asked a month ago
A single page PDF Textract job which used to take about 10 seconds to complete, is now taking about 3 min.
1
answers
0
votes
35
views
asked a month ago
Which is better? Performing the OCR and querying only with AWS Textract, or separating it into two steps, only OCR with Textract and another model for understanding I find myself in need of extractin...
2
answers
0
votes
78
views
asked 2 months ago
We are looking to build solution which extracts data from PDF file and turns that into a more structured data that we can use on our application. We tried the DocumentAnalysis from Textract which rea...
2
answers
0
votes
55
views
asked 2 months ago
https://docs.aws.amazon.com/textract/latest/dg/limits-document.html On the above document, under "File Size and Page Count Limits", it states there are different quotas for "Document" Size versuse "Fi...
1
answers
0
votes
42
views
AWS
asked 2 months ago
Textract is not detecting BlockType 'QUERY' or 'QUERY_RESULT' from some PDF files.. Have uploaded in AWS Textract environment in the webpage and is getting the output for the query question. But same ...
2
answers
1
votes
76
views
asked 2 months ago
Hi, I need help with getting the Python code for extracting section_headers from a multi-page PDF.
2
answers
0
votes
131
views
asked 2 months ago
I am trying to build a document comparison application as a part of my work. The documents I work with contains a lot of text and tabular data. They are pdf files with around 100 pages . I want the a...
1
answers
0
votes
175
views
asked 3 months ago