AWS Textract Analyze Document Demo Web interface producing better results than AnalyzeDocument Synchronous API using Boto 3 client.

0

I uploaded a sample document to the AWS Textract demo web interface via our in-house portal, which consumes AWS Textract via the synchronous AnalyzeDocument API, and the table results provided by the web interface are more accurate than the results provided by the API. For example, all the headers within the table were properly extracted within the interface, while some were missing when using the API on the same sample document. The Boto client was updated to the latest version, boto3 1.26.160, botocore 1.29.160 and the region used is us-east-1.

已提问 1 年前285 查看次数
1 回答
0

I noticed that while performing the operation via the console, a StartExpenseAnalysis API call was being made, which is an Asynchronous API call to analyze invoices and receipts. However, you’re making use of Synchronous operation - AnalyzeExpense.

Please allow me to inform you that Asynchronous workflow involves different pre-processing component than synchronous one to process PDFs and Images. Although we try to align them in terms of functionalities and behavior, we do see discrepancy sometimes. Thus, I’d suggest you to use the Asynchronous operations

AWS
已回答 1 年前

您未登录。 登录 发布回答。

一个好的回答可以清楚地解答问题和提供建设性反馈,并能促进提问者的职业发展。

回答问题的准则