1 Answer
- Newest
- Most votes
- Most comments
0
Using bounding boxes might be helpful. You should try the Textractor Package (amazon-textract-overlayer)
answered 21 days ago
Relevant content
- asked 2 months ago
- asked 2 years ago
- asked 2 years ago
- AWS OFFICIALUpdated 15 days ago
- AWS OFFICIALUpdated 2 years ago
- AWS OFFICIALUpdated 2 years ago
Thanks for your answer. Yes, I have been trying that, using bounding boxes to identify the x-min and y-min of response objects and then trying to devise a way to order them. But, the challenge is even using the x-min coordinate, I am not able to differentiate which response objects fall in Column 1, Column 2, or Column 3 of the page. In the output, I have to first order all the objects of column 1, with an increasing value of y-min, followed by that of column 2, and so on. Is there any way or algorithm you can think of to help me achieve this?