Persistent Text Detection Issue in Documents

0

In several documents that we have attempted to process using Amazon's OCR feature, we have found that the system does not correctly detect text in some areas, despite it being clearly legible. We have conducted numerous tests and have exhausted multiple approaches to resolve this issue. We have even compared the results with other OCR services, such as RapidOCR, which do not exhibit this problem.

The measures we have taken so far include:

  • Sending the document in PDF format
  • Try sending a high-resolution image.
  • Resizing the image to improve detection.
  • Adjusting contrast levels to optimize text readability.
  • Try cut/remove the white border for only focus on text for sugestion othere similas issues

However, regardless of these efforts, the problem persists, particularly at the top of the documents.

Attached to this message, you will find a sample of the original document and a screenshot illustrating the text that was not detected correctly.

We appreciate your attention and action on this matter.

Enter image description here

Enter image description here

질문됨 한 달 전127회 조회
1개 답변
0

Hi thank you for using textract. We are sorry that you're facing facing regarding accuracy of detection. Would you be able to share document through some medium so we can help furhter?

AWS
답변함 10일 전

로그인하지 않았습니다. 로그인해야 답변을 게시할 수 있습니다.

좋은 답변은 질문에 명확하게 답하고 건설적인 피드백을 제공하며 질문자의 전문적인 성장을 장려합니다.

질문 답변하기에 대한 가이드라인

관련 콘텐츠