1 Answer
- Newest
- Most votes
- Most comments
0
Other than a clean audio recording, I'd optimize the following factors:
- When using custom vocabularies: keep the list small, and provide IPA pronunciations if you can.
- When using real-time streams: two to five speakers seem best.
- When using an audio source: set the "Maximum number of speaker" to the actual number of speakers in the file.
For sources and more information:
https://aws.amazon.com/transcribe/faqs/
https://docs.aws.amazon.com/transcribe/latest/dg/diarization.html
answered 2 years ago
Relevant content
- asked 2 years ago
- asked a year ago
- AWS OFFICIALUpdated a year ago
- AWS OFFICIALUpdated a year ago
- AWS OFFICIALUpdated 3 months ago
- AWS OFFICIALUpdated 5 months ago