News from the AI & ML world

DeeperML - #ai

@blogs.nvidia.com //
Cadence has unveiled the Millennium M2000 Supercomputer, a powerhouse featuring NVIDIA Blackwell systems, aimed at revolutionizing AI-driven engineering design and scientific simulations. This supercomputer integrates NVIDIA HGX B200 systems and NVIDIA RTX PRO 6000 Blackwell Server Edition GPUs, coupled with NVIDIA CUDA-X software libraries and Cadence's optimized software. The result is a system capable of delivering up to 80 times higher performance compared to its CPU-based predecessors, marking a significant leap forward in computational capability for electronic design automation, system design, and life sciences workloads.

This collaboration between Cadence and NVIDIA is set to enable engineers to conduct massive simulations, leading to breakthroughs in various fields, including the design and development of autonomous machines, drug molecules, semiconductors, and data centers. NVIDIA's founder and CEO, Jensen Huang, highlighted the transformative potential of AI, stating that it will infuse every aspect of business and product development. Huang also announced NVIDIA's plans to acquire ten Millennium Supercomputer systems based on the NVIDIA GB200 NVL72 platform to accelerate the company’s chip design workflows, emphasizing the importance of this technology for NVIDIA's future endeavors.

In related news, the open-source OpenSearch software has launched its 3.0 version, which includes GPU acceleration to enhance AI workloads through its new OpenSearch Vector Engine. This update leverages NVIDIA GPUs to improve search performance with large-scale vector workloads and reduce index build times, aiming to address scalability issues common in vector databases. OpenSearch 3.0 also supports Anthropic PBC’s Model Context Protocol, facilitating the integration of large language models with external data. The Millennium M2000 Supercomputer harnesses accelerated software from NVIDIA and Cadence for applications including circuit simulation, computational fluid dynamics, data center design and molecular design.

Recommended read:
References :
  • NVIDIA Newsroom: Cadence Taps NVIDIA Blackwell to Accelerate AI-Driven Engineering Design and Scientific Simulation
  • www.networkworld.com: Cadence debuts Nvidia-powered supercomputer to accelerate enterprise engineering, biotech
  • insidehpc.com: Cadence Unveils Millennium M2000 Supercomputer with NVIDIA Blackwell Systems

@siliconangle.com //
OpenSearch 3.0 has been released under the Linux Foundation, marking its first major release and positioning it as a strong competitor to ElasticSearch. This new version boasts significant improvements, including GPU acceleration, which promises to reduce costs and enhance the handling of billions of vectors for AI applications. Organizations leveraging OpenSearch for big data search, analytics, and AI are expected to experience a remarkable 9.5x performance increase due to the implementation of GPUs, specifically benefiting vector database workloads with a 9.3x boost through an experimental GPU-powered indexing mechanism.

The key highlight of OpenSearch 3.0 is the integration of GPU acceleration for vector search, which is particularly beneficial for AI workloads. This new feature utilizes Nvidia's cuVS library, enabling the power of Nvidia GPUs for creating vector indexes and powering vector searches against those indexes. The experimental GPU support aims to accelerate data-intensive workloads and index builds by up to 9.3x, while simultaneously reducing costs by 3.75x compared to CPU-only solutions. This advancement addresses scalability issues commonly encountered when dealing with billions of vectors in AI applications.

OpenSearch 3.0 also introduces Model Context Protocol (MCP) support, facilitating communication between AI agents and the platform. The minimum Java version is now Java 21, which should remove any legacy code. The new gRPC protocol allows faster data transfer. According to Carl Meadows, Governing Board Chair at the OpenSearch Software Foundation and Director of Product Management at AWS, the enterprise search market is projected to reach $8.9 billion by 2030, positioning OpenSearch 3.0 as a pivotal step in supporting the community with an open, scalable platform for the future of search and analytics.

Recommended read:
References :
  • Blocks and Files: OpenSearch 3.0 targets fast AI search with MCP and GPU-powered vectors
  • DEVCLASS: OpenSearch 3.0 hits: First major release under Linux Foundation as it battles ElasticSearch for mindshare
  • OpenSearch Project: 🚀 BIG NEWS: OpenSearch 3.0 is now available! With 9.5x performance improvement over v1.3 and GPU acceleration that cuts costs by 3.75x, it's ready to handle billions of vectors for your AI applications. Check out the major upgrades to our open-source vector database!
  • siliconangle.com: OpenSearch revs up AI workloads with GPU-accelerated vector search
  • BigDATAwire: OpenSearch Gets Parallel Performance Boost Thanks to GPUs

@aithority.com //
References: AiThority , Blocks and Files ,
Nutanix is expanding its cloud capabilities with a focus on cloud-native technologies, external storage enhancements, and generative AI integrations, unveiled at its .NEXT 2025 conference. The company is introducing Cloud Native AOS, offering general availability of Dell PowerFlex support, integrating with Pure Storage FlashArray and FlashStack, and launching Nutanix Enterprise AI initiative with NVIDIA. These updates aim to create a generalized software platform where users can run applications anywhere, addressing the growing need for flexibility and scalability in modern IT environments.

Nutanix is also deepening its integration with NVIDIA AI Enterprise to accelerate the deployment of Agentic AI applications within enterprises. The latest version of Nutanix Enterprise AI (NAI) includes NVIDIA NIM microservices and the NVIDIA NeMo framework, simplifying the building, running, and managing of AI models and inferencing services across various environments, including edge, data centers, and public clouds. This integration aims to provide a streamlined foundation for building and running secure AI agents.

The enhanced NAI solution features shared LLM endpoints, allowing customers to reuse existing deployed model endpoints for multiple applications, reducing hardware and storage costs. The platform incorporates NVIDIA's NeMo Guardrails to filter out non-approved content, ensuring compliance, privacy, and security within AI applications. Nutanix's Cloud Infrastructure solution, combined with NVIDIA's AI Data Platform, is designed to convert data into actionable insights, providing an optimized stack for GPU data processing and deployment across HCI, bare-metal, and cloud Infrastructure-as-a-Service.

Recommended read:
References :
  • AiThority: Nutanix Enables Agentic AI Anywhere with Latest Release of Nutanix Enterprise AI
  • Blocks and Files: Nutanix marries cloud-native infra with Pure Storage and agentic AI
  • Techzine Global: New version of Nutanix Enterprise AI makes agentic AI manageable

Alex Shipps@news.mit.edu //
References: LearnAI , news.mit.edu
MIT and Adobe have jointly developed CausVid, a generative AI tool capable of crafting smooth, high-quality videos in mere seconds. This hybrid AI model utilizes a diffusion model to train an autoregressive system, enabling rapid and stable high-resolution video production. Unlike existing diffusion models like OpenAI's SORA and Google's VEO 2, which process entire sequences at once and can be slow and inflexible, CausVid adopts a unique frame-by-frame approach. This allows for quick generation and on-the-fly modifications, offering a significant advantage in interactive content creation.

The CausVid tool allows users to generate clips, modify them with new prompts in real-time, transform static photos into dynamic scenes, and even extend existing videos. Imagine turning a simple text prompt into a visually stunning clip of a paper airplane morphing into a swan or woolly mammoths trekking through a snowy landscape. Users can also build upon initial prompts, adding new elements and details to their scenes interactively. This dynamic capability significantly streamlines video creation, reducing a process that once involved up to 50 steps into just a few simple actions.

According to researchers at MIT's Computer Science and Artificial Intelligence Laboratory (CSAIL), CausVid has a wide array of potential applications. It could be used in video editing to generate videos that synchronize with audio translations for live streams, helping viewers understand content in different languages. Furthermore, it could aid in rendering new content for video games or quickly producing training simulations for robots. Tianwei Yin, co-lead author of a new paper about the tool, highlights the model’s strength, attributing it to the combination of a pre-trained diffusion-based model with autoregressive architecture.

Recommended read:
References :
  • LearnAI: Hybrid AI model crafts smooth, high-quality videos in seconds | MIT News
  • news.mit.edu: The CausVid generative AI tool uses a diffusion model to teach an autoregressive (frame-by-frame) system to rapidly produce stable, high-resolution videos.

@www.techmeme.com //
References: Techmeme , Ken Yeung ,
A recent report from Amazon Web Services (AWS) indicates a significant shift in IT spending priorities for 2025. Generative AI has overtaken cybersecurity as the primary focus for global IT leaders, with 45% now prioritizing AI investments. This change underscores the increasing emphasis on implementing AI strategies and acquiring the necessary talent, even amidst ongoing skills shortages. The AWS Generative AI Adoption Index surveyed 3,739 senior IT decision makers across nine countries, including the United States, Brazil, Canada, France, Germany, India, Japan, South Korea, and the United Kingdom.

This move to prioritize generative AI doesn't suggest a neglect of security, according to Rahul Pathak, Vice President of Generative AI and AI/ML Go-to-Market at AWS. Pathak stated that customers' security remains a massive priority, and the surge in AI investment reflects the widespread recognition of AI's diverse applications and the pressing need to accelerate its adoption. The survey revealed that 90% of organizations are already deploying generative AI in some capacity, with 44% moving beyond experimental phases to production deployment, indicating a critical inflection point in AI adoption.

The survey also highlights the emergence of new leadership roles within organizations to manage AI initiatives. Sixty percent of companies have already appointed a Chief AI Officer (CAIO) or equivalent, and an additional 26% plan to do so by 2026. This executive-level commitment reflects the growing strategic importance of AI, although the study cautions that nearly a quarter of organizations may still lack formal AI transformation strategies by 2026. These companies are planning ways to bridge the gen AI talent gap this year by creating training plans to upskill their workforce for GenAI.

Recommended read:
References :
  • Techmeme: An AWS survey of 3,739 senior IT decision-makers across nine countries finds 45% plan to prioritize spending on generative AI in 2025, and 30% on cybersecurity
  • Ken Yeung: Generative AI Becomes Top IT Priority in 2025—and Is Getting a Seat in the C-Suite: AWS
  • venturebeat.com: New AWS report reveals 45% of global IT leaders now prioritize generative AI over cybersecurity in 2025 tech budgets as companies race to hire AI talent and implement AI strategies despite persistent skills shortages.

Coen van@Techzine Global //
ServiceNow has announced the launch of AI Control Tower, a centralized control center designed to manage, secure, and optimize AI agents, models, and workflows across an organization. Unveiled at Knowledge 2025 in Las Vegas, this platform provides a holistic view of the entire AI ecosystem, enabling enterprises to monitor and manage both ServiceNow and third-party AI agents from a single location. The AI Control Tower aims to address the growing complexity of managing AI deployments, giving users a central point to see all AI systems, their deployment status, and ensuring governance and understanding of their activities.

The AI Control Tower offers key benefits such as enterprise-wide AI visibility, built-in compliance and AI governance, end-to-end lifecycle management of agentic processes, real-time reporting, and improved alignment. It is designed to help AI systems administrators and other stakeholders monitor and manage every AI agent, model, or workflow within their system, providing real-time reporting for different metrics and embedded compliance and AI governance. The platform helps users understand the different systems by provider and type, improving risk and compliance management.

In addition to the AI Control Tower, ServiceNow introduced AI Agent Fabric, facilitating communication between AI agents and partner integrations. ServiceNow has also partnered with NVIDIA to engineer an open-source model, Apriel Nemotron 15B, designed to drive advancements in enterprise large language models (LLMs) and power AI agents that support various enterprise workflows. The Apriel Nemotron 15B, developed using NVIDIA NeMo and ServiceNow domain-specific data, is engineered for reasoning, drawing inferences, weighing goals, and navigating rules in real time, making it efficient and scalable for concurrent enterprise workflows.

Recommended read:
References :
  • thenewstack.io: Given that ServiceNow is, at its core, all about automating workflows for enterprises, it’s no surprise that
  • AI News | VentureBeat: ServiceNow also announced a way for agents to communicate with others along with its new observability platform.
  • Techzine Global: During Knowledge 2025 , ServiceNow launched AI Control Tower, a centralized control center for managing, securing, and optimizing AI agents, models, and workflows.
  • NVIDIA Blog: Your Service Teams Just Got a New Coworker — and It’s a 15B-Parameter Super Genius Built by ServiceNow and NVIDIA
  • www.zdnet.com: ServiceNow and Nvidia's new reasoning AI model raises the bar for enterprise AI agents
  • www.networkworld.com: ServiceNow unveiled a centralized command center the company says will enable enterprise customers to govern, manage, and secure AI agents from ServiceNow and other third-parties from a unified platform.
  • www.computerworld.com: Nvidia and ServiceNow have created an AI model that can help companies create learning AI agents to automate corporate workloads. The open-source Apriel model, available generally in the second quarter on HuggingFace, will help create AI agents that can make decisions around IT, human resources and customer-service functions.
  • blogs.nvidia.com: ServiceNow is accelerating enterprise AI with a new reasoning model built in partnership with NVIDIA — enabling AI agents that respond in real time, handle complex workflows and scale functions like IT, HR and customer service teams worldwide.
  • NVIDIA Newsroom: ServiceNow is accelerating enterprise AI with a new reasoning model built in partnership with NVIDIA — enabling AI agents that respond in real time, handle complex workflows and scale functions like IT, HR and customer service teams worldwide.
  • techstrong.ai: ServiceNow Inc. kicked off its annual artificial intelligence (AI) conference in Las Vegas Tuesday as it has in previous years -- with a fusillade of product announcements, partnerships and customer stories.
  • techstrong.ai: ServiceNow’s New AI Control Tower Commands AI Agents
  • Ken Yeung: ServiceNow Debuts AI Control Tower to Manage the Chaos of Enterprise AI Agents
  • Ken Yeung: ServiceNow and Nvidia have had a long-standing partnership building generative AI solutions for the enterprise. This week, at ServiceNow’s Knowledge customer conference, the two are introducing the latest fruits of their labor, a new large language model called Apriel Nemotron 15B with reasoning capabilities.
  • CIO Dive - Latest News: ServiceNow, Nvidia develop LLM to fuel enterprise agents
  • Ken Yeung: ServiceNow Reboots Workforce Training With Launch of ServiceNow University, Eyes 3M Learners by 2027
  • AI News: ServiceNow bets on unified AI to untangle enterprise complexity
  • www.artificialintelligence-news.com: ServiceNow bets on unified AI to untangle enterprise complexity
  • techstrong.ai: ServiceNow’s Road to AI Agents Leads to New Workflow Ecosystem, Acquisition of data.world

@venturebeat.com //
Nvidia has launched Parakeet-TDT-0.6B-V2, a fully open-source transcription AI model, on Hugging Face. This represents a new standard for Automatic Speech Recognition (ASR). The model, boasting 600 million parameters, has quickly topped the Hugging Face Open ASR Leaderboard with a word error rate of just 6.05%. This level of accuracy positions it near proprietary transcription models, such as OpenAI’s GPT-4o-transcribe and ElevenLabs Scribe, making it a significant advancement in open-source speech AI. Parakeet operates under a commercially permissive CC-BY-4.0 license.

The speed of Parakeet-TDT-0.6B-V2 is a standout feature. According to Hugging Face’s Vaibhav Srivastav, it can "transcribe 60 minutes of audio in 1 second." Nvidia reports this is achieved with a real-time factor of 3386, meaning it processes audio 3386 times faster than real-time when running on Nvidia's GPU-accelerated hardware. This speed is attributed to its transformer-based architecture, fine-tuned with high-quality transcription data and optimized for inference on NVIDIA hardware using TensorRT and FP8 quantization. The model also supports punctuation, capitalization, and detailed word-level timestamping.

Parakeet-TDT-0.6B-V2 is aimed at developers, researchers, and industry teams building various applications. This includes transcription services, voice assistants, subtitle generators, and conversational AI platforms. Its accessibility and performance make it an attractive option for commercial enterprises and indie developers looking to build speech recognition and transcription services into their applications. With its release on May 1, 2025, Parakeet is set to make a considerable impact on the field of speech AI.

Recommended read:
References :
  • Techmeme: Nvidia launches open-source transcription model Parakeet-TDT-0.6B-V2, topping the Hugging Face Open ASR Leaderboard with a word error rate of 6.05% (Carl Franzen/VentureBeat)
  • @techmeme.com - Techmeme: Nvidia launches open-source transcription model Parakeet-TDT-0.6B-V2, topping the Hugging Face Open ASR Leaderboard with a word error rate of 6.05% (Carl Franzen/VentureBeat)
  • venturebeat.com: An attractive proposition for commercial enterprises and indie developers looking to build speech recognition and transcription services...
  • www.marktechpost.com: NVIDIA Open Sources Parakeet TDT 0.6B: Achieving a New Standard for Automatic Speech Recognition ASR and Transcribes an Hour of Audio in One Second
  • AI News | VentureBeat: Reports Nvidia launches fully open source transcription AI model Parakeet-TDT-0.6B-V2 on Hugging Face
  • MarkTechPost: Reports NVIDIA Open Sources Parakeet TDT 0.6B: Achieving a New Standard for Automatic Speech Recognition ASR and Transcribes an Hour of Audio in One Second
  • www.eweek.com: NVIDIA’s AI Transcription Tool Produces 60 Minutes of Text in 1 Second
  • eWEEK: NVIDIA has released a new version of its Parakeet transcription tool, boasting the lowest error rate of any of its competitors. In addition, the company made the code public on GitHub. Parakeet TDT 0.6B is a 600-million-parameter automatic speech recognition model. It can transcribe 60 minutes of audio per second, Hugging Face data scientist Vaibhav […]

erichs211@gmail.com (Eric@techradar.com //
Google's powerful AI model, Gemini 2.5 Pro, has achieved a significant milestone by completing the classic Game Boy game Pokémon Blue. This accomplishment, spearheaded by software engineer Joel Z, demonstrates the AI's enhanced reasoning and problem-solving abilities. Google CEO Sundar Pichai celebrated the achievement online, highlighting it as a substantial win for AI development. The project showcases how AI can learn to handle complex tasks, requiring long-term planning, goal tracking, and visual navigation, which are vital components in the pursuit of general artificial intelligence.

Joel Z facilitated Gemini's gameplay over several months, livestreaming the AI's progress. While Joel is not affiliated with Google, his efforts were supported by the company's leadership. To enable Gemini to navigate the game, Joel used an emulator, mGBA, to feed screenshots and game data, like character position and map layout. He also incorporated smaller AI helpers, like a "Pathfinder" and a "Boulder Puzzle Solver," to tackle particularly challenging segments. These sub-agents, also versions of Gemini, were deployed strategically by the AI to manage complex situations, showcasing its ability to differentiate between routine and complicated tasks.

Google is also experimenting with transforming its search engine into a Gemini-powered chatbot via an AI Mode. This new feature, currently being tested with a small percentage of U.S. users, delivers conversational answers generated from Google's vast index, effectively turning Search into an answer engine. Instead of a list of links, AI Mode provides rich, visual summaries and remembers prior queries, directly competing with the search features of Perplexity and ChatGPT. While this shift could potentially impact organic SEO tactics, it signifies Google's commitment to integrating AI more deeply into its core products, offering users a more intuitive and informative search experience.

Recommended read:
References :
  • the-decoder.com: Google's reasoning LLM Gemini 2.5 Pro beats Pokémon Blue with a little help
  • thetechbasic.com: Google’s powerful AI model, Gemini 2.5 Pro, has finished playing the old Game Boy game Pokémon Blue.
  • www.techradar.com: Google's Gemini AI Is now a Pokémon Master
  • THE DECODER: Google's reasoning LLM Gemini 2.5 Pro beats Pokémon Blue with a little help
  • The Tech Basic: Google Gemini AI Beats Pokémon Blue With Help and Updates

@the-decoder.com //
OpenAI is making significant strides in the enterprise AI and coding tool landscape. The company recently released a strategic guide, "AI in the Enterprise," offering practical strategies for organizations implementing AI at a large scale. This guide emphasizes real-world implementation rather than abstract theories, drawing from collaborations with major companies like Morgan Stanley and Klarna. It focuses on systematic evaluation, infrastructure readiness, and domain-specific integration, highlighting the importance of embedding AI directly into user-facing experiences, as demonstrated by Indeed's use of GPT-4o to personalize job matching.

Simultaneously, OpenAI is reportedly in the process of acquiring Windsurf, an AI-powered developer platform, for approximately $3 billion. This acquisition aims to enhance OpenAI's AI coding capabilities and address increasing competition in the market for AI-driven coding assistants. Windsurf, previously known as Codeium, develops a tool that generates source code from natural language prompts and is used by over 800,000 developers. The deal, if finalized, would be OpenAI's largest acquisition to date, signaling a major move to compete with Microsoft's GitHub Copilot and Anthropic's Claude Code.

Sam Altman, CEO of OpenAI, has also reaffirmed the company's commitment to its non-profit roots, transitioning the profit-seeking side of the business to a Public Benefit Corporation (PBC). This ensures that while OpenAI pursues commercial goals, it does so under the oversight of its original non-profit structure. Altman emphasized the importance of putting powerful tools in the hands of everyone and allowing users a great deal of freedom in how they use these tools, even if differing moral frameworks exist. This decision aims to build a "brain for the world" that is accessible and beneficial for a wide range of uses.

Recommended read:
References :
  • The Register - Software: OpenAI's contentious plan to overhaul its corporate structure in favor of a conventional for-profit model has been reworked, with the AI giant bowing to pressure to keep its nonprofit in control, even as it presses ahead with parts of the restructuring.
  • the-decoder.com: OpenAI restructures as public benefit corporation under non-profit control
  • www.theguardian.com: OpenAI reverses plans to spin off its for-profit arm, maintaining control under its non-profit entity.
  • techxplore.com: OpenAI reverses course and says its nonprofit will continue to control its business
  • www.techradar.com: OpenAI will transition to running under the oversight of a non-profit, and its profit side is to become a Public Benefit Corporation.
  • Maginative: The company will transition its for-profit arm into a Public Benefit Corporation.
  • THE DECODER: OpenAI will remain under the control of its non-profit entity instead of forming a separate for-profit company as previously planned, according to reports.
  • Mashable: The nonprofit status of OpenAI is one of the biggest controversies in Silicon Valley. On Monday, May 5, CEO Sam Altman said the company structure is "evolving."
  • SiliconANGLE: OpenAI is reversing course on a plan to spin off its for-profit arm, stating that its nonprofit will remain in charge.
  • Artificial Lawyer: OpenAI is restructuring its for-profit LLC into a Public Benefit Corporation (PBC) while maintaining the non-profit's control.
  • www.itpro.com: OpenAI's board decided to keep its non-profit in control of the company's operations, backing away from its previous plan for a for-profit spinoff
  • The Rundown AI: OpenAI ends for-profit push
  • shellypalmer.com: OpenAI Supercharges ChatGPT Search with Shopping Tools
  • Effective Altruism Forum: Evolving OpenAI’s Structure
  • WIRED: The startup behind ChatGPT is going to remain in nonprofit control, but it still needs regulatory approval.
  • the-decoder.com: The Decoder reports on OpenAI's potential $3 billion acquisition of Windsurf.
  • www.marktechpost.com: OpenAI Releases a Strategic Guide for Enterprise AI Adoption: Practical Lessons from the Field
  • THE DECODER: The Decoder's report on OpenAI's Windsurf deal boosting coding AI.
  • AI News | VentureBeat: Report: OpenAI is buying AI-powered developer platform Windsurf — what happens to its support for rival LLMs?
  • John Werner: OpenAI Strikes $3 Billion Deal To Buy Windsurf: Reports
  • Latest from ITPro in News: OpenAI is closing in on its biggest acquisition to date – and it could be a game changer for software developers and ‘vibe coding’ fanatics
  • www.artificialintelligence-news.com: Sam Altman: OpenAI to keep nonprofit soul in restructuring
  • AI News: OpenAI CEO Sam Altman has laid out their roadmap, and the headline is that OpenAI will keep its nonprofit core amid broader restructuring.
  • Analytics India Magazine: OpenAI to Acquire Windsurf for $3 Billion to Dominate AI Coding Space
  • THE DECODER: Elon Musk’s lawyer says OpenAI restructuring is a transparent dodge
  • futurism.com: OpenAI may be raking in the investor dough, but thanks in part to erstwhile cofounder Elon Musk, the company won't be going entirely for-profit anytime soon.
  • thezvi.wordpress.com: Your voice has been heard. OpenAI has ‘heard from the Attorney Generals’ of Delaware and California, and as a result the OpenAI nonprofit will retain control of OpenAI under their new plan, and both companies will retain the original mission. …
  • www.computerworld.com: OpenAI reaffirms nonprofit control, scales back governance changes
  • thezvi.wordpress.com: OpenAI Claims Nonprofit Will Retain Nominal Control

@www.medianama.com //
Visa, Mastercard, and PayPal have recently unveiled agent-ready platforms, marking a significant shift in the landscape of online commerce. These platforms enable AI systems to autonomously shop, make decisions, and handle payments on behalf of users. Visa's Intelligent Commerce platform allows AI agents to securely make purchases using credit cards while adhering to consumer-defined spending limits. Similarly, Mastercard's Agent Pay facilitates the integration of tokenized card credentials directly into AI-driven workflows. PayPal has launched an Agent Toolkit, allowing developers to embed payment, invoicing, and shipping functionalities into AI assistants.

This development signals a move towards autonomous shopping and agentic commerce, potentially revolutionizing online checkout processes for both consumers and businesses. With Visa's Intelligent Commerce, AI agents can now go beyond simply recommending products to actually completing transactions. The system replaces traditional card details with secure, tokenized digital credentials accessible to authorized AI agents. Users maintain control by setting parameters such as spending limits and preferred merchant categories. The collaboration between Visa, Mastercard, and PayPal aims to establish secure frameworks for AI agents to conduct financial transactions.

These platforms offer numerous potential benefits. For consumers, this could mean more efficient and personalized shopping experiences. For example, an AI assistant could be instructed to book a flight within a specific budget or order weekly groceries, completing the purchase without manual input. Businesses stand to gain from new opportunities for customer interaction and increased sales through hyper-personalized offers and real-time transaction data. As these platforms mature, retailers may need to adapt by creating "agent-readable" catalogs and promotions, as brand equity shifts towards the agents consumers choose to use.

Recommended read:
References :
  • shellypalmer.com: Mastercard, Visa, and PayPal just rewrote online checkout. Within 24 hours, all three networks launched agent-ready platforms that let autonomous AI systems shop, decide, and pay for us.
  • futurism.com: Visa — yes, that Visa — is wading into the world of AI agents. On Wednesday, the credit card monolith announced it would be teaming up with some of the AI industry's leading developers to connect its vast payments network to their AI systems.  The end game? Letting an autonomous AI model — an agent — control your credit card and make purchases ranging from groceries to clothing on your behalf, based on your budget and preferences. "We think this could be really important," Jack Forestell, Visa's chief product and strategy officer, told the Associated Press. "Transformational, on the order […]
  • www.medianama.com: Visa and Mastercard Let AI Agents Spend for You: But What’s the Risk?
  • Tor Constantino: Mastercard And Visa Unleash AI Agents To Shop For You
  • venturebeat.com: Visa launches Intelligent Commerce platform enabling AI assistants to make secure purchases with your credit card, transforming online shopping with personalized automation and consumer-controlled spending limits.
  • TechSpot: By bridging the gap between AI's growing capabilities and secure payment processing, Visa is positioning itself to play a pivotal role in the next evolution of commerce.
  • www.zdnet.com: Imagine AI agents finding and ordering products for you. With this latest Visa announcement, that future just got a little closer.
  • Shelly Palmer: Mastercard, Visa, and PayPal just rewrote online checkout. Within 24 hours, all three networks launched agent-ready platforms that let autonomous AI systems shop, decide, and pay for us.
  • Shelly Palmer: Mastercard, Visa, and PayPal just rewrote online checkout. Within 24 hours, all three networks launched agent-ready platforms that let autonomous AI systems shop, decide, and pay for us.
  • techxplore.com: Artificial intelligence "agents" are supposed to be more than chatbots. The tech industry has spent months pitching AI personal assistants that know what you want and can do real work on your behalf.

@the-decoder.com //
Google is integrating its Gemini AI model deeper into its search engine with the introduction of 'AI Mode'. This new feature, currently in a limited testing phase in the US, aims to transform the search experience into a conversational one. Instead of the traditional list of links, AI Mode delivers answers generated directly from Google’s index, functioning much like a Gemini-powered chatbot. The search giant is also dropping the Labs waitlist, allowing any U.S. user who opts in to try the new search function.

The AI Mode includes visual place and product cards, enhanced multimedia features, and a left-side panel for managing past searches. This provides more organized results for destinations, products, and services. Users can ask contextual follow-up questions, and the AI Mode will populate a sidebar with cards referring to the sources it's using to formulate its answers. It can also access Google's Shopping Graph and localized data from Maps.

This move is seen as Google's direct response to AI-native upstarts that are recasting the search bar as a natural-language front end to the internet. Google CEO Sundar Pichai is hopeful to have an agreement with Apple to have Gemini as an option as part of Apple Intelligence by middle of this year. The rise of AI in search raises concerns for marketers. Organic SEO tactics built on blue links will erode and there will be a need to prepare content for zero‑click, AI‑generated summaries.

Recommended read:
References :
  • shellypalmer.com: Google’s AI Mode: The Chatbot Comes to Search
  • Android Faithful: A Simple Google Search Is Now a Thing of the Past
  • The Tech Portal: Google is now reportedly preparing to expand access to its Gemini AI chatbot, including Gemini for children under 13, in its search engine.
  • www.computerworld.com: Google is making changes to its venerable search interface so users can more naturally interact with its AI features.
  • www.socialmediatoday.com: Google's giving more people access to its new "AI Mode" in Search.
  • Shelly Palmer: Google's AI Mode: The Chatbot Comes to Search

@the-decoder.com //
Google is enhancing its AI capabilities across several platforms. NotebookLM, the AI-powered research tool, is expanding its "Audio Overviews" feature to approximately 75 languages, including less common ones such as Icelandic, Basque, and Latin. This enhancement will enable users worldwide to listen to AI-generated summaries of documents, web pages, and YouTube transcripts, making research more accessible. The audio for each language is generated by AI agents using metaprompting, with the Gemini 2.5 Pro language model as the underlying system, moving towards audio production technology based entirely on Gemini’s multimodality.

These Audio Overviews are designed to distill a mix of documents into a scripted conversation between two synthetic hosts. Users can direct the tone and depth through prompts, and then download an MP3 or keep playback within the notebook. This expansion rebuilds the speech stack and language detection while maintaining a one-click flow. Early testers have reported that multilingual voices make long reading lists easier to digest and provide an alternative channel for blind or low-vision audiences.

In addition to NotebookLM enhancements, Google Gemini is receiving AI-assisted image editing capabilities. Users will be able to modify backgrounds, swap objects, and make other adjustments to both AI-generated and personal photos directly within the chat interface. These editing tools are being introduced gradually for users on web and mobile devices, supporting over 45 languages in most countries. To access the new features on your phone, users will need the latest version of the Gemini app.

Recommended read:
References :
  • www.techradar.com: Google reveals powerful NotebookLM app for Android and iOS with release date – here's what it looks like
  • TestingCatalog: Google expands NotebookLM with Audio Overviews in over 50 languages
  • THE DECODER: Google Gemini brings AI-assisted image editing to chat
  • the-decoder.com: Google Gemini brings AI-assisted image editing to chat
  • www.tomsguide.com: Google Gemini adds new image-editing tools — here's what they can do
  • The Tech Basic: Google Brings NotebookLM AI Research Assistant to Mobile With Offline Podcasts and Enhanced Tools
  • PCMag Middle East ai: Google CEO: Gemini Could Be Integrated Into Apple Intelligence This Year
  • gHacks Technology News: Google is rolling out an update for its Gemini app that adds a quality-of-life feature. Users can now access the AI assistant directly from their home screens, bypassing the need to navigate
  • PCMag Middle East ai: Research in Your Pocket: Google's Powerful NotebookLM AI Tool Coming to iOS, Android