News from the AI & ML world
Danilo Poccia@AWS News Blog
//
Amazon has unveiled Nova Sonic, a new foundation model available on Amazon Bedrock, aimed at revolutionizing voice interactions within generative AI applications. This unified model streamlines the development of speech-enabled applications by integrating speech recognition and generation into a single system. This eliminates the traditional need for multiple fragmented models, reducing complexity and enhancing the naturalness of conversations. Nova Sonic seeks to provide more human-like interactions by understanding contextual nuances, tone, prosody, and speaking style.
Amazon Nova Sonic powers Alexa+ and is already incorporated into Alexa+, Amazon’s upgraded voice assistant. Rohit Prasad, Amazon’s head of AI, explained that Nova Sonic is good at deciding when to pull information from the internet or other apps. For example, if you ask about the weather, it checks a weather website. If you want to order groceries, it connects to your shopping list. This integrated approach reduces complexity when building conversational applications and delivers expressive speech generation and real-time text transcription without requiring a separate model, resulting in adaptive speech responses.
The model is designed to recognize when users pause, hesitate, or even interrupt, responding fluidly to mimic natural human conversation. Developers can leverage function calling and agentic workflows to connect Nova Sonic with external services and APIs. The model currently supports American and British English, with plans to add more languages soon. This commitment to responsible AI also includes built-in protections for content moderation and watermarking. Amazon claims that the new model is 80% cheaper to use than OpenAI’s GPT-4o and also faster.
ImgSrc: d2908q01vomqb2.
References :
- THE DECODER: Details how Amazon's new Nova Sonic powers Alexa+.
- thetechbasic.com: Reports on Amazon’s new AI model, Nova Sonic, aiming to make Alexa sound more human.
- AI News | VentureBeat: Reports on Amazon launching new realtime voice model Nova Sonic for third-party enterprise development.
- the-decoder.com: Amazon's new Nova Sonic powers Alexa+
- The Tech Basic: Amazon just introduced Nova Sonic, a new artificial intelligence model that makes talking to computers feel natural. Unlike older voice assistants like Alexa or Siri, Nova Sonic can understand pauses, emotions, and even background noise. It is faster and cheaper than similar models from OpenAI and Google, according to Amazon.
- AWS News Blog: Introducing Amazon Nova Sonic: Human-like voice conversations for generative AI applications
- Last Week in AI: Amazon unveils Nova Act, an AI agent that can control a web browser
- Analytics India Magazine: Amazon Rolls Out Nova Sonic and Nova Reel 1.1 for Generative Voice and Video AI
- AWS News Blog: Amazon Nova Reel 1.1 enables the generation of multi-shot videos up to 2-minutes in length and style consistency across shots, and quality and latency improvements over Amazon Nova Reel 1.0.
- aithority.com: Introducing Amazon Nova Sonic: A New Gen AI Model for Building Voice Applications and Agents
Classification:
- HashTags: #AmazonAI #NovaSonic #GenerativeAI
- Company: Amazon
- Target: Generative AI developers
- Product: Nova Sonic
- Feature: Voice Conversations
- Type: AI
- Severity: Informative