News from the AI & ML world

DeeperML - #novaact

mike@marketingaiinstitute.com (Mike@marketingaiinstitute.com //
References: AWS News Blog , Bernard Marr ,
Amazon is aggressively pursuing advancements in artificial intelligence, marking a significant push into the AI agent arena. The company has unveiled Nova Act, an AI system designed to control web browsers and autonomously perform tasks such as booking reservations, ordering food, and filling out forms. This new AI has the potential to streamline and automate various online activities, reducing the need for human intervention. The integration of Nova Act into the upcoming Alexa+ upgrade could put this powerful AI agent into the hands of millions of users worldwide.

Amazon is also introducing Nova Sonic, a new foundation model aimed at creating human-like voice conversations for generative AI applications. Nova Sonic unifies speech recognition and generation into a single model. It enables developers to create natural, conversational AI experiences. This integrated approach streamlines development and reduces complexity when building voice-enabled applications. The model delivers expressive speech generation and real-time text transcription without requiring a separate model.

These advancements reflect Amazon's commitment to investing in AI for future growth. CEO Andy Jassy highlighted the importance of aggressive AI investments in a recent shareholder letter, noting plans to spend over $100 billion on capital expenditure in 2025. He described AI as a "once-in-a-lifetime reinvention of everything we know". The move towards agentic AI, as demonstrated by Nova Act and Nova Sonic, is expected to revolutionize various aspects of customer experiences and workplace productivity.

Recommended read:
References :
  • AWS News Blog: Introducing Amazon Nova Sonic: Human-like voice conversations for generative AI applications
  • Bernard Marr: Amazon's new agentic AI system, "Nova Act" is set to transform how we interact with technology at home, potentially outperforming competitors from OpenAI and Anthropic.
  • www.techrepublic.com: The Nova Sonic voice AI model can respond to the speaker’s words, as well as their tone, inflection, and pacing.

Allison Siu@NVIDIA Blog //
Amazon has recently introduced two significant advancements in the realm of artificial intelligence: Nova Act, an AI model designed for browser-based task automation, and a testing phase for the ‘Buy for Me’ feature in its mobile shopping application. Nova Act, currently available as a research preview, prioritizes the reliable execution of simple commands over complex workflows. Amazon aims to unlock the potential of truly autonomous and capable AI agents. The Nova Act SDK allows developers to experiment with the model's capabilities, enabling agents to complete tasks such as submitting out-of-office requests and configuring automatic replies.

The company stresses that genuine AI agents should not primarily focus on conversation or knowledge retrieval, differentiating them from current AI-powered assistants. According to Amazon, Nova Act is designed to complete tasks and act in digital and physical environments on behalf of the user. The potential applications extend to complex, multi-step workflows, such as organizing a wedding or handling complex IT tasks. The company has designed Nova Act to prioritize reliability by accurately completing simpler, low-level actions that, according to the company, trip rival models more often, such as date picking or navigating drop-downs and pop-ups.

Simultaneously, Amazon is testing the ‘Buy for Me’ feature, which integrates AI agents into the mobile shopping app to facilitate purchases from third-party brand websites, even for products not directly sold by Amazon. This feature, in limited beta for select iOS and Android users in the U.S., allows users to authorize Amazon to complete transactions on external brand sites, utilizing Amazon’s Nova AI, along with Anthropic’s Claude via Bedrock, to securely handle payment and shipping details. While the brand handles fulfillment, customer service, and returns, customers can track their purchases within the Amazon app, representing a narrowly scoped, highly-specialized AI agent doing something useful.

Recommended read:
References :
  • Data Phoenix: Amazon unveiled Nova Act, an AI model for browser-based task completion. Available as a research preview, Nova Act prioritizes reliability in executing simple commands rather than higher-level workflows as the key to unlock genuine AI agents that are both capable and autonomous.
  • www.producthunt.com: AI Agent that shops in other sites
  • shellypalmer.com: Amazon is testing a new feature in its mobile shopping app that lets users buy products Amazon doesn’t sell—without leaving the app.

Nishant N@MarkTechPost //
References: AI ? SiliconANGLE , AI News ,
Amazon has unveiled Nova Act, a new AI model designed to automate web browser tasks and build AI agents. This research preview, from the Amazon AGI San Francisco Lab, allows AI to take control of web browsers and perform independent actions. The goal is to create agents capable of performing tangible, multi-step tasks in diverse digital and physical environments, such as organizing a wedding or handling complex IT tasks. Amazon envisions agents as more than just responders, but as entities capable of performing these tasks to increase business productivity.

To help facilitate the development of these agents, Amazon is releasing a research preview of the Amazon Nova Act SDK. The SDK enables developers to create agents capable of automating web tasks like submitting out-of-office notifications, scheduling calendar holds, or enabling automatic email replies. It breaks down complex workflows into dependable "atomic commands," such as searching, checking out, or interacting with specific interface elements. This SDK supports browser manipulation via Playwright, API calls, Python integrations, and parallel threading to overcome web page load delays, further enhancing accuracy and control.

Recommended read:
References :
  • AI ? SiliconANGLE: Amazon.com Inc. today introduced Nova Act, a new artificial intelligence agent that can take control of web browsers and take independent actions. The new AI agent is a research preview built by Amazon’s newly opened Amazon AGI San Francisco Lab, which was behind the release of the Amazon Nova foundation models in December.
  • AI News: Amazon has introduced Nova Act, an advanced AI model engineered for smarter agents that can execute tasks within web browsers. While large language models popularised the concept of “agents†as tools that answer queries or retrieve information via methods such as Retrieval-Augmented Generation (RAG), Amazon envisions something more robust.
  • TestingCatalog: Discover Amazon's Nova Act, a new AI model for automating web tasks. Released as a research preview, it excels in reliability and developer control. Try it now!

Nishant N@MarkTechPost //
Amazon has unveiled Nova Act, a new AI agent designed to interact with web browsers and automate tasks. Released as a research preview, the Nova Act SDK allows developers to create AI agents capable of automating tasks such as filling out forms, navigating web pages, and managing workflows. U.S.-based users can access the SDK through the nova.amazon.com platform.

Nova Act distinguishes itself by focusing on reliability in completing complex, multi-step tasks by breaking down workflows into atomic commands and integrating with tools like Playwright for direct browser manipulation. Developers can enhance functionality further by interleaving Python code. Early benchmarks suggest Nova Act outperforms competitors like OpenAI’s CUA and Anthropic’s Claude 3.7 Sonnet on specific web interaction tasks, demonstrating Amazon’s commitment to advancing agentic AI.

Recommended read:
References :
  • Analytics India Magazine: The Nova Act SDK is built to automate workflows by breaking down complex tasks into smaller commands, such as searching, completing checkouts, and answering questions based on on-screen content.
  • THE DECODER: Amazon launches AI agent toolkit with Nova Act SDK
  • Flipboard Tech Desk: Amazon has unveiled Nova Act, a general-purpose AI agent that can take control of a web browser and independently perform some simple actions like making dinner reservations or filling out online forms. Read more at .
  • GeekWire: ‘Nova Act’ moves Amazon further into the AI agent race
  • TestingCatalog: Discover Amazon's Nova Act, a new AI model for automating web tasks. Released as a research preview, it excels in reliability and developer control. Try it now!
  • WIRED: Amazon's AGI Lab Reveals Its First Work: Advanced AI Agents
  • Quartz: Amazon wants its new AI agent to do stuff on the web for you
  • AWS Machine Learning Blog: In this post, we explore how CrewAI’s open source agentic framework, combined with Amazon Bedrock, enables the creation of sophisticated multi-agent systems that can transform how businesses operate.
  • AI ? SiliconANGLE: Amazon.com Inc. today introduced Nova Act, a new artificial intelligence agent that can take control of web browsers and take independent actions.
  • THE DECODER: Nova Act is Amazon's foray into agentic AI that navigates your browser
  • www.it-daily.net: Amazon Nova Act: AI agent for browser control presented
  • Techzine Global: Amazon is making access to its frontier intelligence models easier with the launch of nova.amazon.com.
  • AI News: Amazon Nova Act: A step towards smarter, web-native AI agents
  • MarkTechPost: Meet Amazon Nova Act: An AI Agent that can Automate Web Tasks
  • AI News | VentureBeat: What you need to know about Amazon Nova Act: the new AI agent SDK challenging OpenAI, Microsoft, Salesforce
  • www.infoq.com: Amazon has announced an expansion of its generative AI capabilities with the introduction of nova.amazon.com, a platform designed to give developers easier access to its foundation models. This includes the newly unveiled Amazon Nova Act, an AI model specifically trained to execute actions within web browsers. By Robert KrzaczyÅ„ski
  • Data Phoenix: Amazon's Nova Act joins OpenAI and Anthropic's computer using AI agents