Textract multiple answers missing geometry

Question

Hello, I would like to know if anything changed in the way Textract gives back answers.
Meaning:
If I ask : What is the title of this doc? and set it up to look on page1, I get an answer with text and coordinates.
However, if I get 'interpreted' answers e.g. What are the standards of this doc, same lookup on page1: I have geometry set given back on None

query is TBlock(geometry=None, id='d1a1bac6-8c00-4b8b-91ef-72ff7d3398d9', block_type='QUERY', relationships=[TRelationship(type='ANSWER', ids=['d3c0611d-a7ba-48ed-9d4a-031e64a3d4f3'])], confidence=None, text=None, column_index=None, column_span=None, entity_types=None, page=1,
row_index=None, row_span=None, selection_status=None, text_type=None, custom=None, query=TQuery(text='what are the standards of the certified weight?', alias='tc_certified_shipping_standards'))

rels is TRelationship(type='ANSWER', ids=['d3c0611d-a7ba-48ed-9d4a-031e64a3d4f3'])
[TBlock(geometry=None, id='d3c0611d-a7ba-48ed-9d4a-031e64a3d4f3', block_type='QUERY_RESULT', relationships=None, confidence=43.0, text='GRS, GRS', column_index=None, column_span=None, entity_types=None, page=1, row_index=None, row_span=None, selection_status=None, text_type=None, custom=None, query=None)]

I have a quite big chunk of code depending on coordinates and for 5 months straight, I had no issue. I did check for having same other libraries related to Textract to the old version and tested on old git branches.

So, is this a new way Textract answers to questions?

Please and thank you!

Answer

On May 15, 2023, Amazon Textract's Query feature in the AnalyzeDocument API received an update that improved the quality of its machine-learning models [1]. This reduced latency when using the AnalyzeDocument API with the Queries feature. Furthermore, the update improved the data extraction accuracy for 14 new document types.
To take advantage of these improvements, please ensure that you have updated your AWS CLI/SDK to the latest version.

If the issue persists, I suggest opening a case with AWS Premium Support. Their team has access to internal tools that can help identify and resolve the root cause of the issue.

[1]: https://aws.amazon.com/about-aws/whats-new/2023/05/amazon-textract-updates-queries-analyze-document-api/

Textract multiple answers missing geometry

Relevant content