Nishant N@MarkTechPost
//
Amazon has unveiled Nova Act, a new AI agent designed to interact with web browsers and automate tasks. Released as a research preview, the Nova Act SDK allows developers to create AI agents capable of automating tasks such as filling out forms, navigating web pages, and managing workflows. U.S.-based users can access the SDK through the nova.amazon.com platform.
Nova Act distinguishes itself by focusing on reliability in completing complex, multi-step tasks by breaking down workflows into atomic commands and integrating with tools like Playwright for direct browser manipulation. Developers can enhance functionality further by interleaving Python code. Early benchmarks suggest Nova Act outperforms competitors like OpenAI’s CUA and Anthropic’s Claude 3.7 Sonnet on specific web interaction tasks, demonstrating Amazon’s commitment to advancing agentic AI.
Recommended read:
References :
- Analytics India Magazine: The Nova Act SDK is built to automate workflows by breaking down complex tasks into smaller commands, such as searching, completing checkouts, and answering questions based on on-screen content.
- THE DECODER: Amazon launches AI agent toolkit with Nova Act SDK
- Flipboard Tech Desk: Amazon has unveiled Nova Act, a general-purpose AI agent that can take control of a web browser and independently perform some simple actions like making dinner reservations or filling out online forms. Read more at .
- GeekWire: ‘Nova Act’ moves Amazon further into the AI agent race
- TestingCatalog: Discover Amazon's Nova Act, a new AI model for automating web tasks. Released as a research preview, it excels in reliability and developer control. Try it now!
- WIRED: Amazon's AGI Lab Reveals Its First Work: Advanced AI Agents
- Quartz: Amazon wants its new AI agent to do stuff on the web for you
- AWS Machine Learning Blog: In this post, we explore how CrewAI’s open source agentic framework, combined with Amazon Bedrock, enables the creation of sophisticated multi-agent systems that can transform how businesses operate.
- AI ? SiliconANGLE: Amazon.com Inc. today introduced Nova Act, a new artificial intelligence agent that can take control of web browsers and take independent actions.
- THE DECODER: Nova Act is Amazon's foray into agentic AI that navigates your browser
- www.it-daily.net: Amazon Nova Act: AI agent for browser control presented
- Techzine Global: Amazon is making access to its frontier intelligence models easier with the launch of nova.amazon.com.
- AI News: Amazon Nova Act: A step towards smarter, web-native AI agents
- MarkTechPost: Meet Amazon Nova Act: An AI Agent that can Automate Web Tasks
- AI News | VentureBeat: What you need to know about Amazon Nova Act: the new AI agent SDK challenging OpenAI, Microsoft, Salesforce
Nitika Sharma@Analytics Vidhya
//
China's Manus AI, developed by Monica, is generating buzz as an invite-only multi-agent AI product. This AI agent is designed to autonomously tackle complex, real-world tasks by operating as a multi-agent system. It utilizes a planner optimized for strategic reasoning, and an executor driven by Claude 3.5 Sonnet, incorporating code execution, web browsing, and multi-file code management.
The AI agent has sparked considerable global attention, igniting discussions about its technological and ethical implications, as well as its potential impact on the AI landscape. Manus reportedly outperformed OpenAI's o3-powered Deep Research agent on benchmarks, as showcased on the Manus website, leading some to believe it is among the most effective autonomous agents currently available. However, there is some skepticism due to it appearing to be a Claude wrapper with a jailbreak and tools optimized for the GAIA benchmark.
Recommended read:
References :
- Maginative: Manus AI, China's new autonomous agent, is making waves with its ability to independently analyze, plan, and execute tasks. With industry leaders calling it “the AI agent we were promised,� it's raising the stakes in the global AI race.
- MarkTechPost: In today’s digital era, the way we work is rapidly evolving, yet many challenges persist. Conventional AI assistants and manual workflows struggle to keep pace with the complexity and volume of modern tasks. Professionals and businesses face repetitive manual processes, inefficient research methods, and a lack of true automation. While traditional tools offer suggestions and […] The post appeared first on .
- Fello AI: Manus AI is a newly announced autonomous AI agent developed by the Chinese startup Monica. It has been designed as a general AI agent that goes beyond simple text generation by autonomously planning, executing, and delivering complex tasks. The system is positioned as a breakthrough in AI technology, offering capabilities that mimic a human team working […] The post appeared first on .
- Analytics Vidhya: Ever felt buried under a mountain of tasks, wishing for an extra set of hands to get things done? What if you could offload those tasks and get results without being glued to your screen? Manus – an AI agent from China gaining attention for its ability to handle general tasks with ease. In a […] The post appeared first on .
- The Rundown AI: PLUS: China's Manus demos ‘world’s first fully autonomous’ AI agent
- Craig Smith: Forbes discusses China’s Autonomous Agent, Manus, Changes Everything
- AI News | VentureBeat: What you need to know about Manus, the new AI agentic system from China
- AI Accelerator Institute: China’s new AI agent, Manus, operates autonomously, sparking debate on its impact, ethics, and global AI competition. Here’s what you need to know.
- thezvi.wordpress.com: The Manus Marketing Madness
- Analytics Vidhya: This article talks about comparison between China's new AI agent 'Manus' and OpenAI 'Operator'
- The Register - Software: Prompts see it scour the web for info and turn it into decent documents at reasonable speed Chinese researchers’ AI prowess is again a hot topic after a startup called Monica.im last week revealed “Manus�, a service it bills as a “general agent� that might improve on tools offered by Western companies.
- AIwire: China’s Manus AI: A Game-Changer or Just Another Overhyped Agent?
- bdtechtalks.com: What is Manus, the AI agent taking on OpenAI Deep Research
- OODAloop: China’s new AI agent, Manus, operates autonomously, sparking debate on its impact, ethics, and global AI competition. Here’s what you need to know.
- pub.towardsai.net: Discussion on Manus AI's architecture, performance, and potential.
- Tech News | Euronews RSS: A new Chinese AI platform is causing a frenzy. But is it worth the hype? Euronews Next takes a look.
- techxplore.com: What to know about Manus, China's latest AI assistant
- www.laptopmag.com: What is Manus AI? The autonomous assistant that wants to do the work for you
- techstrong.ai: Chinese Startup’s Manus AI Agent Generates Hype, Skepticism
- www.tomsguide.com: Manus AI is the new challenger to DeepSeek — everything you need to know
- Gradient Flow: Manus: What You Need To Know
- hackernoon.com: Founder of China’s New AI Model Says His Agent is More Autonomous Than Rivals'
- iHLS: Introducing Manus: The World’s First Fully Autonomous AI Agent
- TechNode: China’s AI agent Manus gains traction amid growing demand for autonomous AI
@Techmeme
//
OpenAI has unveiled its first AI agent, named Operator, designed to autonomously handle tasks on the web. This innovative tool, currently in a research preview for ChatGPT Pro subscribers in the US, is powered by the Computer-Using Agent (CUA) model. CUA leverages GPT-4o's visual capabilities coupled with advanced reasoning to interact with websites like a human. Operator can navigate pages, input text, click buttons, and scroll to accomplish a variety of tasks, including making dinner reservations, filling out online forms, and ordering groceries.
Operator's functionality is not just limited to basic web interactions, it can also learn from its mistakes, "self-correcting" and giving users back control when needed. The AI agent operates through a dedicated web browser and keeps users updated on its actions through explanations. OpenAI plans to integrate Operator into all of its ChatGPT clients after this initial release, and hopes to make the CUA available for developers through its API. While still in early stages, Operator represents a significant stride towards more user-friendly and practical AI applications.
Recommended read:
References :
- arstechnica.com: OpenAI launches Operator, an AI agent that can operate your computer
- go.theregister.com: OpenAI's Operator agent wants to tackle your online chores – just don’t expect it to nail every task
- Quartz: OpenAI's agent that can do work for you is here
- TechCrunch: OpenAI launches Operator, an AI agent that performs tasks autonomously
- www.technologyreview.com: OpenAI says Operator is powered by Computer-Using Agent, or CUA, which combines GPT-4o's vision capabilities with reasoning abilities of more advanced models (Will Douglas Heaven/MIT Technology Review)
- Latest from TechRadar: OpenAI's first AI Agent is here, and Operator can make a dinner reservation and complete other tasks on the web for you
- Ars OpenForum: OpenAI launches Operator, an AI agent that can operate your computer
- The Register - Software: OpenAI's Operator agent wants to tackle your online chores – just don’t expect it to nail every task
- Dataconomy: Every question about OpenAI Operator—answered
- www.heise.de: With Operator, OpenAI offers an AI agent for almost all activities on the web The operator can search for and order things on the web independently. The AI agent is still a preview and initially reserved for US subscribers to ChatGPT Pro.
- www.theverge.com: OpenAI releases a "research preview" of its Operator AI agent that can automate web-based tasks, launching in the US to subscribers of the $200/month Pro tier (Jay Peters/The Verge)
- techcrunch.com: OpenAI says it may store deleted Operator data for up to 90 days
- www.producthunt.com: ChatGPT Operator
- Techmeme: OpenAI releases a "research preview" of its Operator AI agent that can automate web-based tasks, launching in the US to subscribers of the $200/month Pro tier (Jay Peters/The Verge)
- PCMag Middle East ai: ChatGPT Pro subscribers can ask the tool to plan trips, order groceries, and more.
- Pivot to AI: On Monday, OpenAI launched a “research preview� of Operator, an AI agent that browses the web — “you give it a task and it will execute it� — for anyone paying $200/month for ChatGPT Pro.
- Ars Technica: OpenAI launches Operator, an AI agent that can operate your computer New research "Computer-Use Agent" AI model can jump in and help users with on-screen tasks.
- heise online English: With Operator, OpenAI offers an AI agent for almost all activities on the web The operator can search for and order things on the web independently. The AI agent is still a preview and initially reserved for US subscribers to ChatGPT Pro.
- every.to: Hands-on with Operator: limited in what it can browse, can perform repetitive workflows, and can do lengthy tasks on its own with minimal prompting
- techcrunch.com: OpenAI wants to take over your browser.
Thomas Claburn@The Register
//
Opera has introduced "Browser Operator," a new native AI agent integrated directly into its browser. This AI agent is designed to automate repetitive tasks, enhancing user convenience by performing actions such as purchasing products, completing online forms, and gathering web content. Unlike separate tools like Google AI assistant or ChatGPT, Browser Operator is an extension of the browser itself, processing tasks locally to empower users and streamline their online activities.
Opera's AI agent utilizes natural language processing powered by Opera’s AI Composer Engine to interpret written instructions and execute corresponding tasks within the browser. It allows users to delegate tasks like buying socks, booking flights, or searching the web. Opera emphasized the privacy-focused architecture, claiming that the AI agent is faster and more secure than cloud-based alternatives because it does not take screenshots or capture videos of your screen. The tool is the latest in a long line of AI developments at the Norwegian company, which launched a fully AI-enabled browser in 2023.
Recommended read:
References :
- The Next Web: Thenextweb reports Opera browser unveils AI agent that handles online tasks for you
- AI News: Opera has introduced “Browser Operator,� a native AI agent designed to perform tasks for users directly within the browser. Rather than acting as a separate tool, Browser Operator is an extension of the browser itself—designed to empower users by automating repetitive tasks like purchasing products, completing online forms, and gathering web content. Unlike server-based AI […] The post appeared first on .
- The Register - Software: Phantom of the Opera: AI agent now lurks within browser, for the lazy
- PCWorld: On Monday, browser maker Opera published a seriously impressive demo of what it calls “Browser Operator,� showing off its upcoming AI-powered browser technology that allows you to assign shopping tasks to Opera, which it then pursues independently.
- Towards AI: Opera Unveils AI Browser Operator & Web Automation
- www.windowscentral.com: Microsoft's Windows Recall should've been everything Opera's Browser Operator promises to be on paper — an AI agent with a pause button" to preserve user privacy
- www.computerworld.com: Opera adds ‘Browser Operator,’ an AI agent, to its browser
- pub.towardsai.net: Opera Unveils AI Browser Operator & Web Automation
@Techmeme
//
OpenAI has launched a research preview of its new Operator AI agent, designed to automate web-based tasks. This tool, available initially in the US for subscribers of the $200/month ChatGPT Pro tier, can navigate and interact with webpages using its own browser, performing actions like typing, clicking, and scrolling. Operator aims to handle online chores such as booking travel, making restaurant reservations, and shopping, leveraging a "Computer-Using Agent" model that combines GPT-4o’s vision capabilities with advanced reasoning. OpenAI cautions that Operator may not always work perfectly, particularly with complex interfaces and will prompt the user to take over when sensitive information is needed.
Despite the promise of automation, OpenAI has raised privacy concerns regarding data retention. The company states that it may store deleted Operator data, including chats and associated screenshots, for up to 90 days, even after users manually delete them. This extended retention period, compared to the 30 days for ChatGPT, is intended to allow OpenAI to better understand and review potential abuse of the tool. This policy allows authorized OpenAI personnel and trusted service providers access to the data for fraud monitoring and other legal purposes. Additionally, some users are reporting billing issues related to a vector store of 0kb size, with concerns about whether they'll be charged and how they fit within the free 1GB category.
Recommended read:
References :
- The Register - Software: OpenAI's Operator agent wants to tackle your online chores – just don’t expect it to nail every task
- techcrunch.com: OpenAI says it may store deleted Operator data for up to 90 days
- www.producthunt.com: ChatGPT Operator
- www.theverge.com: OpenAI releases a "research preview" of its Operator AI agent that can automate web-based tasks, launching in the US to subscribers of the $200/month Pro tier
- Latest from TechRadar: OpenAI's first AI Agent is here, and Operator can make a dinner reservation and complete other tasks on the web for you
- Ars OpenForum: OpenAI launches Operator, an AI agent that can operate your computer
- every.to: Hands-on with Operator: limited in what it can browse, can perform repetitive workflows, and can do lengthy tasks on its own with minimal prompting
@the-decoder.com
//
OpenAI has expanded the availability of its AI agent, Operator, to numerous countries including Australia, Brazil, Canada, India, Japan, Singapore, South Korea, and the United Kingdom. This expansion makes Operator available in most locations where ChatGPT is accessible, with the exception of the EU, Switzerland, Norway, Liechtenstein, and Iceland, although efforts are underway to include these regions in the future. Operator, which initially launched in the U.S. in January 2025, is designed to independently operate a web browser to complete tasks for users.
Operator is currently exclusive to ChatGPT Pro subscribers, who pay $200 per month for access. The tool operates through a dedicated web page, with plans to integrate it across all ChatGPT clients in the future. As a browser-use agent, Operator faces competition from entities like Google, Anthropic, and Rabbit, each developing similar agent technologies. Early testing indicates that despite the hype around consumer tasks like ordering pizza, its future may lie in more sophisticated research and task execution, possibly in combination with tools like Deep Research.
Recommended read:
References :
- techcrunch.com: OpenAI rolls out its AI agent, Operator, in several countries
- THE DECODER: OpenAI rolls out Operator to more countries
|
|