News from the AI & ML world

DeeperML

@blogs.microsoft.com //
Anthropic, Google DeepMind, and OpenAI are at the forefront of developing AI agents with the ability to interact with computers in a human-like manner. These agents are designed to perform a range of tasks, including web searches, form completion, and button clicks, enabling them to order groceries, request rides, or book flights. The models employ chain-of-thought reasoning to decompose complex instructions into manageable steps, requesting user input when necessary and seeking confirmation before executing final actions.

To address safety concerns such as prompt injection attacks, developers are implementing restrictions, such as preventing the agents from logging into websites or entering payment information. Anthropic was the first to unveil this functionality in October, with its Claude chatbot now capable of "using computers the way humans do." Google DeepMind is developing Mariner, built on top of Google’s Gemini 2 language model and OpenAI launched its computer-use agent (CUA), called Operator.
Original img attribution: https://spectrum.ieee.org/media-library/a-laptop-with-a-smiley-face-on-the-screen-wearing-sunglasses-and-smirking.jpg?id=54626086&width=1200&height=600&coordinates=0%2C125%2C0%2C125
ImgSrc: spectrum.ieee.o

Share: bluesky twitterx--v2 facebook--v1 threads


References :
  • IEEE Spectrum: IEEE Spectrum discusses the development of AI agents that can use computers like humans, highlighting models from Anthropic, Google DeepMind, and OpenAI.
  • IEEE Spectrum: Article discussing OpenAI's computer-use agent, called Operator, and its ability to work with websites.
  • www.anthropic.com: Anthropic was the first to unveil this new functionality, with an announcement in October that its Claude chatbot can now “use computers the way humans do.â€�
Classification: