News from the AI & ML world

DeeperML - #aiagent

@zdnet.com //
Microsoft has introduced a new AI-powered agent for settings control in Copilot+ PCs, designed to simplify how users adjust their computer settings. The agent utilizes on-device AI to understand natural language queries, allowing users to ask questions like "how to control my PC by voice" or "my mouse pointer is too small." The AI will then either provide an answer or automatically make the requested changes, streamlining the user experience and eliminating the need to navigate through complex menus. Initially, this feature will support English language queries and is being rolled out to Copilot+ PCs equipped with Snapdragon chips, with plans to expand support to Intel and AMD-powered computers in the near future.

Microsoft is also enhancing the capabilities of its Click to Do feature for Copilot AI assistance. This feature, accessible while a computer screen is active, will now be able to act on text or images. Examples include creating bulleted lists from selected text or drafting copy into Microsoft Word, improving efficiency in content creation. Additionally, new actions will include scheduling meetings, sending messages via Microsoft Teams, and transferring data to Microsoft Excel. The agent will also support a computer's Reading Coach and Immersive Reader modes.

These AI enhancements aim to seamlessly integrate AI into everyday computing tasks. Beyond the settings control agent and Click to Do improvements, Windows search is receiving AI-driven upgrades, enabling users to find files using natural language. Copilot will also gain support for screen sharing through Copilot Vision on Windows. Microsoft will also add enhanced search to its Photos app, showcasing Microsoft's commitment to leveraging AI to improve the overall Windows 11 and Copilot+ PC user experience.

Recommended read:
References :
  • www.engadget.com: Microsoft delivers new Copilot AI PC features with Windows 11’s 2024 update
  • www.zdnet.com: Microsoft's new AI skills are coming to Copilot+ PCs - including some for all Windows 11 users
  • www.engadget.com: Microsoft introduces agent for AI-powered settings controls in Copilot+ PCs
  • PCMag Middle East ai: Microsoft Tests Using Copilot AI to Adjust Windows 11 Settings for You
  • www.laptopmag.com: Microsoft's new AI can change your laptop's settings — if you trust it to

Megan Crouse@eWEEK //
Recent research indicates a significant shift in how people are utilizing generative AI, with users increasingly turning to these tools for digital therapy, companionship, and life organization. This represents a departure from earlier expectations that AI would primarily serve technical tasks like coding and content creation. Ex-OpenAI CEO and other power users have raised concerns about "sycophancy" in AI chatbots, specifically, the tendency of models to excessively flatter and agree with users. This can be problematic if the AI supports potentially harmful or misguided ideas.

OpenAI is actively addressing the issue of AI "sycophancy" in ChatGPT, particularly after a recent update to GPT-4o. Users have reported that the chatbot has become overly agreeable, even to dubious suggestions. OpenAI CEO Sam Altman acknowledged these concerns, stating that the model's personality had become "too sycophant-y and annoying". He further added that fixes were being implemented immediately, with more improvements planned for the near future. Model designer Aidan McLaughlin confirmed the rollout of an initial fix to remedy this "glazing/sycophancy" behavior.

In other news, OpenAI has expressed interest in potentially acquiring the Chrome browser, should a court force Google to divest it as part of an antitrust case. This statement was made by Nick Turley, Head of Product at ChatGPT, during testimony in the U.S. Department of Justice's antitrust trial against Google. Meanwhile, OpenAI continues to innovate in the shopping space. OpenAI is introducing shopping features to all tiers of ChatGPT. The AI will think about your preferences and return several shopping suggestions for you to choose from.

Recommended read:
References :
  • Bernard Marr: AI's Shocking Pivot: From Work Tool To Digital Therapist And Life Coach
  • AI News | VentureBeat: Ex-OpenAI CEO and power users sound alarm over AI sycophancy and flattery of users

Allison Siu@NVIDIA Blog //
Amazon is currently testing a new feature called "Buy for Me" within its mobile shopping app. This innovative tool allows users to purchase products from third-party brand websites that are not directly sold by Amazon, all without ever leaving the Amazon app environment. The feature leverages AI agents to seamlessly complete the purchase process on these external sites. "Buy for Me" is in a limited beta release for select iOS and Android users in the U.S.

When a customer searches for an item not available on Amazon, the app will display qualifying products from external brand sites in a dedicated section titled "Shop brand sites directly". Tapping on one of these items opens a product detail page within the Amazon app. From this page, users can select the "Buy for Me" option, granting Amazon permission to complete the transaction. Amazon's AI, combined with Anthropic's Claude, securely enters the payment and shipping information, while the brand handles fulfillment, customer service, and any potential returns.

This initiative showcases the potential of narrowly scoped, highly specialized AI agents in providing useful services. It keeps customers within Amazon's ecosystem while extending functionality beyond its own inventory. Retailers can deepen customer engagement, enhance their offerings and maintain a competitive edge in a rapidly shifting digital marketplace by tapping into AI agents.

Recommended read:
References :
  • Data Phoenix: Amazon's Nova Act joins OpenAI and Anthropic's computer using AI agents
  • NVIDIA Newsroom: From Browsing to Buying: How AI Agents Enhance Online Shopping
  • Shelly Palmer: Amazon is testing a new feature in its mobile shopping app that lets users buy products Amazon doesn’t sell—without leaving the app.
  • gHacks Technology News: Amazon is taking artificial intelligence to the next-level with its newly announced “Buy for me†feature.
  • Maginative: Amazon Tests AI Shopping Agent That Can Make Purchases from Other Retailers for You

Allison Siu@NVIDIA Blog //
Amazon has recently introduced two significant advancements in the realm of artificial intelligence: Nova Act, an AI model designed for browser-based task automation, and a testing phase for the ‘Buy for Me’ feature in its mobile shopping application. Nova Act, currently available as a research preview, prioritizes the reliable execution of simple commands over complex workflows. Amazon aims to unlock the potential of truly autonomous and capable AI agents. The Nova Act SDK allows developers to experiment with the model's capabilities, enabling agents to complete tasks such as submitting out-of-office requests and configuring automatic replies.

The company stresses that genuine AI agents should not primarily focus on conversation or knowledge retrieval, differentiating them from current AI-powered assistants. According to Amazon, Nova Act is designed to complete tasks and act in digital and physical environments on behalf of the user. The potential applications extend to complex, multi-step workflows, such as organizing a wedding or handling complex IT tasks. The company has designed Nova Act to prioritize reliability by accurately completing simpler, low-level actions that, according to the company, trip rival models more often, such as date picking or navigating drop-downs and pop-ups.

Simultaneously, Amazon is testing the ‘Buy for Me’ feature, which integrates AI agents into the mobile shopping app to facilitate purchases from third-party brand websites, even for products not directly sold by Amazon. This feature, in limited beta for select iOS and Android users in the U.S., allows users to authorize Amazon to complete transactions on external brand sites, utilizing Amazon’s Nova AI, along with Anthropic’s Claude via Bedrock, to securely handle payment and shipping details. While the brand handles fulfillment, customer service, and returns, customers can track their purchases within the Amazon app, representing a narrowly scoped, highly-specialized AI agent doing something useful.

Recommended read:
References :
  • Data Phoenix: Amazon unveiled Nova Act, an AI model for browser-based task completion. Available as a research preview, Nova Act prioritizes reliability in executing simple commands rather than higher-level workflows as the key to unlock genuine AI agents that are both capable and autonomous.
  • www.producthunt.com: AI Agent that shops in other sites
  • shellypalmer.com: Amazon is testing a new feature in its mobile shopping app that lets users buy products Amazon doesn’t sell—without leaving the app.

Nishant N@MarkTechPost //
References: AI ? SiliconANGLE , AI News ,
Amazon has unveiled Nova Act, a new AI model designed to automate web browser tasks and build AI agents. This research preview, from the Amazon AGI San Francisco Lab, allows AI to take control of web browsers and perform independent actions. The goal is to create agents capable of performing tangible, multi-step tasks in diverse digital and physical environments, such as organizing a wedding or handling complex IT tasks. Amazon envisions agents as more than just responders, but as entities capable of performing these tasks to increase business productivity.

To help facilitate the development of these agents, Amazon is releasing a research preview of the Amazon Nova Act SDK. The SDK enables developers to create agents capable of automating web tasks like submitting out-of-office notifications, scheduling calendar holds, or enabling automatic email replies. It breaks down complex workflows into dependable "atomic commands," such as searching, checking out, or interacting with specific interface elements. This SDK supports browser manipulation via Playwright, API calls, Python integrations, and parallel threading to overcome web page load delays, further enhancing accuracy and control.

Recommended read:
References :
  • AI ? SiliconANGLE: Amazon.com Inc. today introduced Nova Act, a new artificial intelligence agent that can take control of web browsers and take independent actions. The new AI agent is a research preview built by Amazon’s newly opened Amazon AGI San Francisco Lab, which was behind the release of the Amazon Nova foundation models in December.
  • AI News: Amazon has introduced Nova Act, an advanced AI model engineered for smarter agents that can execute tasks within web browsers. While large language models popularised the concept of “agents†as tools that answer queries or retrieve information via methods such as Retrieval-Augmented Generation (RAG), Amazon envisions something more robust.
  • TestingCatalog: Discover Amazon's Nova Act, a new AI model for automating web tasks. Released as a research preview, it excels in reliability and developer control. Try it now!

Nishant N@MarkTechPost //
Amazon has unveiled Nova Act, a new AI agent designed to interact with web browsers and automate tasks. Released as a research preview, the Nova Act SDK allows developers to create AI agents capable of automating tasks such as filling out forms, navigating web pages, and managing workflows. U.S.-based users can access the SDK through the nova.amazon.com platform.

Nova Act distinguishes itself by focusing on reliability in completing complex, multi-step tasks by breaking down workflows into atomic commands and integrating with tools like Playwright for direct browser manipulation. Developers can enhance functionality further by interleaving Python code. Early benchmarks suggest Nova Act outperforms competitors like OpenAI’s CUA and Anthropic’s Claude 3.7 Sonnet on specific web interaction tasks, demonstrating Amazon’s commitment to advancing agentic AI.

Recommended read:
References :
  • Analytics India Magazine: The Nova Act SDK is built to automate workflows by breaking down complex tasks into smaller commands, such as searching, completing checkouts, and answering questions based on on-screen content.
  • THE DECODER: Amazon launches AI agent toolkit with Nova Act SDK
  • Flipboard Tech Desk: Amazon has unveiled Nova Act, a general-purpose AI agent that can take control of a web browser and independently perform some simple actions like making dinner reservations or filling out online forms. Read more at .
  • GeekWire: ‘Nova Act’ moves Amazon further into the AI agent race
  • TestingCatalog: Discover Amazon's Nova Act, a new AI model for automating web tasks. Released as a research preview, it excels in reliability and developer control. Try it now!
  • WIRED: Amazon's AGI Lab Reveals Its First Work: Advanced AI Agents
  • Quartz: Amazon wants its new AI agent to do stuff on the web for you
  • AWS Machine Learning Blog: In this post, we explore how CrewAI’s open source agentic framework, combined with Amazon Bedrock, enables the creation of sophisticated multi-agent systems that can transform how businesses operate.
  • AI ? SiliconANGLE: Amazon.com Inc. today introduced Nova Act, a new artificial intelligence agent that can take control of web browsers and take independent actions.
  • THE DECODER: Nova Act is Amazon's foray into agentic AI that navigates your browser
  • www.it-daily.net: Amazon Nova Act: AI agent for browser control presented
  • Techzine Global: Amazon is making access to its frontier intelligence models easier with the launch of nova.amazon.com.
  • AI News: Amazon Nova Act: A step towards smarter, web-native AI agents
  • MarkTechPost: Meet Amazon Nova Act: An AI Agent that can Automate Web Tasks
  • AI News | VentureBeat: What you need to know about Amazon Nova Act: the new AI agent SDK challenging OpenAI, Microsoft, Salesforce
  • www.infoq.com: Amazon has announced an expansion of its generative AI capabilities with the introduction of nova.amazon.com, a platform designed to give developers easier access to its foundation models. This includes the newly unveiled Amazon Nova Act, an AI model specifically trained to execute actions within web browsers. By Robert KrzaczyÅ„ski
  • Data Phoenix: Amazon's Nova Act joins OpenAI and Anthropic's computer using AI agents

Nitika Sharma@Analytics Vidhya //
China's Manus AI, developed by Monica, is generating buzz as an invite-only multi-agent AI product. This AI agent is designed to autonomously tackle complex, real-world tasks by operating as a multi-agent system. It utilizes a planner optimized for strategic reasoning, and an executor driven by Claude 3.5 Sonnet, incorporating code execution, web browsing, and multi-file code management.

The AI agent has sparked considerable global attention, igniting discussions about its technological and ethical implications, as well as its potential impact on the AI landscape. Manus reportedly outperformed OpenAI's o3-powered Deep Research agent on benchmarks, as showcased on the Manus website, leading some to believe it is among the most effective autonomous agents currently available. However, there is some skepticism due to it appearing to be a Claude wrapper with a jailbreak and tools optimized for the GAIA benchmark.

Recommended read:
References :
  • Maginative: Manus AI, China's new autonomous agent, is making waves with its ability to independently analyze, plan, and execute tasks. With industry leaders calling it “the AI agent we were promised,â€� it's raising the stakes in the global AI race.
  • MarkTechPost: In today’s digital era, the way we work is rapidly evolving, yet many challenges persist. Conventional AI assistants and manual workflows struggle to keep pace with the complexity and volume of modern tasks. Professionals and businesses face repetitive manual processes, inefficient research methods, and a lack of true automation. While traditional tools offer suggestions and […] The post appeared first on .
  • Fello AI: Manus AI is a newly announced autonomous AI agent developed by the Chinese startup Monica. It has been designed as a general AI agent that goes beyond simple text generation by autonomously planning, executing, and delivering complex tasks. The system is positioned as a breakthrough in AI technology, offering capabilities that mimic a human team working […] The post appeared first on .
  • Analytics Vidhya: Ever felt buried under a mountain of tasks, wishing for an extra set of hands to get things done? What if you could offload those tasks and get results without being glued to your screen? Manus – an AI agent from China gaining attention for its ability to handle general tasks with ease. In a […] The post appeared first on .
  • The Rundown AI: PLUS: China's Manus demos ‘world’s first fully autonomous’ AI agent
  • Craig Smith: Forbes discusses China’s Autonomous Agent, Manus, Changes Everything
  • AI News | VentureBeat: What you need to know about Manus, the new AI agentic system from China
  • AI Accelerator Institute: China’s new AI agent, Manus, operates autonomously, sparking debate on its impact, ethics, and global AI competition. Here’s what you need to know.
  • thezvi.wordpress.com: The Manus Marketing Madness
  • Analytics Vidhya: This article talks about comparison between China's new AI agent 'Manus' and OpenAI 'Operator'
  • The Register - Software: Prompts see it scour the web for info and turn it into decent documents at reasonable speed Chinese researchers’ AI prowess is again a hot topic after a startup called Monica.im last week revealed “Manusâ€�, a service it bills as a “general agentâ€� that might improve on tools offered by Western companies.
  • AIwire: China’s Manus AI: A Game-Changer or Just Another Overhyped Agent?
  • bdtechtalks.com: What is Manus, the AI agent taking on OpenAI Deep Research
  • OODAloop: China’s new AI agent, Manus, operates autonomously, sparking debate on its impact, ethics, and global AI competition. Here’s what you need to know.
  • pub.towardsai.net: Discussion on Manus AI's architecture, performance, and potential.
  • Tech News | Euronews RSS: A new Chinese AI platform is causing a frenzy. But is it worth the hype? Euronews Next takes a look.
  • techxplore.com: What to know about Manus, China's latest AI assistant
  • www.laptopmag.com: What is Manus AI? The autonomous assistant that wants to do the work for you
  • techstrong.ai: Chinese Startup’s Manus AI Agent Generates Hype, Skepticism
  • www.tomsguide.com: Manus AI is the new challenger to DeepSeek — everything you need to know
  • Gradient Flow: Manus: What You Need To Know
  • hackernoon.com: Founder of China’s New AI Model Says His Agent is More Autonomous Than Rivals'
  • iHLS: Introducing Manus: The World’s First Fully Autonomous AI Agent
  • TechNode: China’s AI agent Manus gains traction amid growing demand for autonomous AI

Thomas Claburn@The Register //
References: The Next Web , AI News , PCWorld ...
Opera has introduced "Browser Operator," a new native AI agent integrated directly into its browser. This AI agent is designed to automate repetitive tasks, enhancing user convenience by performing actions such as purchasing products, completing online forms, and gathering web content. Unlike separate tools like Google AI assistant or ChatGPT, Browser Operator is an extension of the browser itself, processing tasks locally to empower users and streamline their online activities.

Opera's AI agent utilizes natural language processing powered by Opera’s AI Composer Engine to interpret written instructions and execute corresponding tasks within the browser. It allows users to delegate tasks like buying socks, booking flights, or searching the web. Opera emphasized the privacy-focused architecture, claiming that the AI agent is faster and more secure than cloud-based alternatives because it does not take screenshots or capture videos of your screen. The tool is the latest in a long line of AI developments at the Norwegian company, which launched a fully AI-enabled browser in 2023.

Recommended read:
References :
  • The Next Web: Thenextweb reports Opera browser unveils AI agent that handles online tasks for you
  • AI News: Opera has introduced “Browser Operator,â€� a native AI agent designed to perform tasks for users directly within the browser. Rather than acting as a separate tool, Browser Operator is an extension of the browser itself—designed to empower users by automating repetitive tasks like purchasing products, completing online forms, and gathering web content. Unlike server-based AI […] The post appeared first on .
  • The Register - Software: Phantom of the Opera: AI agent now lurks within browser, for the lazy
  • PCWorld: On Monday, browser maker Opera published a seriously impressive demo of what it calls “Browser Operator,â€� showing off its upcoming AI-powered browser technology that allows you to assign shopping tasks to Opera, which it then pursues independently.
  • Towards AI: Opera Unveils AI Browser Operator & Web Automation
  • www.windowscentral.com: Microsoft's Windows Recall should've been everything Opera's Browser Operator promises to be on paper — an AI agent with a pause button" to preserve user privacy
  • www.computerworld.com: Opera adds ‘Browser Operator,’ an AI agent, to its browser
  • pub.towardsai.net: Opera Unveils AI Browser Operator & Web Automation

@the-decoder.com //
References: techcrunch.com , THE DECODER
OpenAI has expanded the availability of its AI agent, Operator, to numerous countries including Australia, Brazil, Canada, India, Japan, Singapore, South Korea, and the United Kingdom. This expansion makes Operator available in most locations where ChatGPT is accessible, with the exception of the EU, Switzerland, Norway, Liechtenstein, and Iceland, although efforts are underway to include these regions in the future. Operator, which initially launched in the U.S. in January 2025, is designed to independently operate a web browser to complete tasks for users.

Operator is currently exclusive to ChatGPT Pro subscribers, who pay $200 per month for access. The tool operates through a dedicated web page, with plans to integrate it across all ChatGPT clients in the future. As a browser-use agent, Operator faces competition from entities like Google, Anthropic, and Rabbit, each developing similar agent technologies. Early testing indicates that despite the hype around consumer tasks like ordering pizza, its future may lie in more sophisticated research and task execution, possibly in combination with tools like Deep Research.

Recommended read:
References :
  • techcrunch.com: OpenAI rolls out its AI agent, Operator, in several countries
  • THE DECODER: OpenAI rolls out Operator to more countries