- Newest
- Most votes
- Most comments
The problem appears to be in log-2 (not log-1 as labeled in your question), where the customer says "um second of january twenty twenty six" but the assistant responds with "I'd be happy to find an available time. Which date would work best for you?" This indicates that Nova Sonic is not properly understanding the spoken date information.
Amazon Nova Sonic is designed for speech-to-speech conversational tasks, but it has limitations in how it processes certain types of information in spoken form. When dealing with dates, numbers, and complex information, the speech model may struggle more than text-based models because:
-
Speech comprehension differs from text comprehension - information that's clear in text may need to be reformulated for speech contexts.
-
Memory constraints in spoken interactions are different from text - spoken dates and numbers can be particularly challenging to process accurately.
For better results with Nova Sonic when handling dates and appointments, the system prompt should be optimized for speech interactions. For example, it should be designed to:
- Request one piece of information at a time
- Confirm understanding by repeating back information
- Break down complex information (like dates) into smaller chunks
- Use natural conversational flow rather than formal structures
The model is likely struggling with the spoken date format, which is a common challenge in speech-to-speech AI systems.
Sources
Amazon Nova Sonic - AWS AI Service Cards
Amazon Nova Sonic prompting best practices - Amazon Nova
Best practices for the Amazon Nova Sonic system prompt - Amazon Nova
Relevant content
- asked 7 months ago
- AWS OFFICIALUpdated 3 years ago

Okay, thank you for your comment. You mentioned that the Nova Sonic model has some limitations. That’s fine; we can use the Nova 2 Sonic model instead. It is the latest model released by AWS on December 2, 2025.
Would this model be better trained than the Nova Sonic model?