2回答
- 新しい順
- 投票が多い順
- コメントが多い順
1
You can find the Textract limits on the AWS documentation https://docs.aws.amazon.com/textract/latest/dg/limits.html
For PDF the limits are 10MB and 1 page for synchronous operations. 500MB and 3000 pages for asynchronous operations.
About MS Word... are you sure that you need and OCR? You can access directly to the text of doc and docx files without using a OCR.
回答済み 2年前
0
Thanks for quick response. As per my understanding only single page PDF we can go for synchronous operations. More than one page PDF we need to go for asynchronous irrespective of file size. Is it correct? Please advise
回答済み 2年前
関連するコンテンツ
- 質問済み 6年前
Correction: It is actually 1 page for synchronous operations, not 10. Some customers benefit from the Tables, Forms or Queries features even for Word documents or use the OCR geometry information. You can convert to PDF and then run against Textract.
Thanks @Martin Schade AWS fixed.