News from the AI & ML world
Danilo Poccia@AWS News Blog
//
Amazon has unveiled Nova Sonic, a new foundation model available on Amazon Bedrock, aimed at revolutionizing voice interactions within generative AI applications. This unified model streamlines the development of speech-enabled applications by integrating speech recognition and generation into a single system. This eliminates the traditional need for multiple fragmented models, reducing complexity and enhancing the naturalness of conversations. Nova Sonic seeks to provide more human-like interactions by understanding contextual nuances, tone, prosody, and speaking style.
Amazon Nova Sonic powers Alexa+ and is already incorporated into Alexa+, Amazon’s upgraded voice assistant. Rohit Prasad, Amazon’s head of AI, explained that Nova Sonic is good at deciding when to pull information from the internet or other apps. For example, if you ask about the weather, it checks a weather website. If you want to order groceries, it connects to your shopping list. This integrated approach reduces complexity when building conversational applications and delivers expressive speech generation and real-time text transcription without requiring a separate model, resulting in adaptive speech responses.
The model is designed to recognize when users pause, hesitate, or even interrupt, responding fluidly to mimic natural human conversation. Developers can leverage function calling and agentic workflows to connect Nova Sonic with external services and APIs. The model currently supports American and British English, with plans to add more languages soon. This commitment to responsible AI also includes built-in protections for content moderation and watermarking. Amazon claims that the new model is 80% cheaper to use than OpenAI’s GPT-4o and also faster.
ImgSrc: d2908q01vomqb2.
References :
- AWS News Blog: Amazon Nova Sonic is a new foundation model on Amazon Bedrock that streamlines speech-enabled applications by offering unified speech recognition and generation capabilities, enabling natural conversations with contextual understanding while eliminating the need for multiple fragmented models.
- AI News | VentureBeat: Amazon is best known as an e-commerce giant and then somewhere perhaps slightly further down the list of notable offerings is its Alexa AI voice assistant product, which just got a big intelligence upgrade last month thanks in part to Amazon Nova and Amazon’s investment Anthropic. Now Alexa will
- The Tech Basic: Amazon is best known as an e-commerce giant and then somewhere perhaps slightly further down the list of notable offerings is its Alexa AI voice assistant product, which just got a big intelligence upgrade last month thanks in part to Amazon Nova and Amazon’s investment Anthropic. Now Alexa will
Classification:
- HashTags: #AI #Amazon #NovaSonic
- Company: Amazon
- Target: Developers
- Product: Nova Sonic
- Feature: Voice Conversation
- Type: AI
- Severity: Informative