- Newest
- Most votes
- Most comments
Based on the information provided, I don't see any specific documentation about issues with the digit "7" in Amazon Transcribe. The transcription behavior you're experiencing where "7" is being duplicated in number sequences is not a documented limitation or known issue in the AWS Transcribe service.
Amazon Transcribe does have specific handling for numbers, which varies by language. For languages with number transcription support, Amazon Transcribe converts spoken numbers into their digit form rather than word form. Amazon Transcribe Medical, for example, has specific rules for transcribing digits as numbers instead of words, such as converting cardinal numbers greater than 10 to numbers.
The issue you're experiencing might be related to:
- The specific language model for "en-IN" (English-India) and how it processes certain number pronunciations
- Audio quality or clarity issues when pronouncing the digit "7"
- Potential configuration issues with the streaming transcription parameters
I would recommend:
- Testing with different language codes to see if the issue persists
- Checking if the issue occurs in batch transcription jobs as well as streaming
- Ensuring your audio input has sufficient clarity and appropriate volume levels
- Reviewing the failure reasons that Amazon Transcribe provides if jobs are failing
If the issue continues, you may want to contact AWS Support as this appears to be a specific behavior not documented in the service information.
Sources
Transcribing numbers and punctuation - Amazon Transcribe
Transcribing numbers - Amazon Transcribe
TranscriptionJob - Amazon Transcribe
Relevant content
- asked 2 years ago
- AWS OFFICIALUpdated 5 years ago
- AWS OFFICIALUpdated 10 months ago
