News from the AI & ML world

DeeperML - #openaiagents

Jesus Rodriguez@TheSequence //
OpenAI has recently launched new audio features and tools aimed at enhancing the capabilities of AI agents. The releases include updated transcription and text-to-speech models, as well as tools for building AI agents. The audio models, named gpt-4o-transcribe and gpt-4o-mini-transcribe, promise better performance than the previous Whisper models, achieving lower word error rates across multiple languages and demonstrating improvements in challenging audio conditions like varying accents and background noise. These models are built on top of language models, making them potentially vulnerable to prompt injection attacks.

OpenAI also unveiled new tools for AI agent development, featuring a Responses API, built-in web search, file search, and computer use functionalities, alongside an open-source Agents SDK. Furthermore, they introduced o1 Pro, a new reasoning model, positioned for complex reasoning tasks, comes with a high cost, priced at $150 per million input tokens and $600 per million output tokens. The gpt-4o-mini-tts text-to-speech model introduces "steerability", allowing developers to control the tone and delivery of the model.

Recommended read:
References :
  • Data Phoenix: OpenAI Launches New Tools for Building AI Agents
  • Fello AI: OpenAI's new o1 Pro pricing strategy with a substantial markup compared to previous models.
  • TheSequence: The Sequence Engineering #513: A Deep Dive Into OpenAI's New Tools for Developing AI Agents
  • AI News | VentureBeat: OpenAI’s new voice AI model gpt-4o-transcribe lets you add speech to your existing text apps in seconds
  • Windows Copilot News: Canadian Media Outlets Sue OpenAI Over Copyright Infringement
  • www.techrepublic.com: Have Some Spare Cash? You’ll Need it for OpenAI’s New API
  • bsky.app: Discussion of OpenAI's new o1-Pro API pricing and its implications for the AI community.
  • Maginative: OpenAI Unveils New Audio Models to Make AI Agents Sound More Human Than Ever
  • bsky.app: This blog post discusses OpenAI's new audio models, noting their promising features but also mentioning the issue of mixing instructions and data in the same token stream.
  • www.techrepublic.com: This article reports on OpenAI's new text-to-speech and speech-to-text tools based on GPT-4o, highlighting their capabilities and potential applications but also mentioning a possible similar path for video.
  • Analytics Vidhya: OpenAI's Audio Models: How to Access, Features, Applications, and More
  • MarkTechPost: OpenAI Introduced Advanced Audio Models ‘gpt-4o-mini-tts’, ‘gpt-4o-transcribe’, and ‘gpt-4o-mini-transcribe’: Enhancing Real-Time Speech Synthesis and Transcription Capabilities for Developers
  • Simon Willison's Weblog: OpenAI announced today, for both text-to-speech and speech-to-text. They're very promising new models, but they appear to suffer from the ever-present risk of accidental (or malicious) instruction following.
  • THE DECODER: OpenAI releases new AI voice models with customizable speaking styles
  • Composio: Finally, OpenAI gave in and launched a new agentic framework called Agents SDK.
  • Last Week in AI: Our 204th episode with a summary and discussion of last week's big AI news! Recorded on 03/21/2025 Hosted by and . Feel free to email us your questions and feedback at and/or  Read out our text newsletter and comment on the podcast at . https://discord.gg/nTyezGSKwP In this episode: Baidu launched two new multimodal models, Ernie 4.5 and Ernie X1, boasting competitive pricing and capabilities compared to Western counterparts like GPT-4.5 and DeepSeek R1. OpenAI introduced new audio models, including impressive speech-to-text and text-to-speech systems, and added O1 Pro to their developer API at high costs, reflecting efforts for more profitability. Nvidia and Apple announced significant hardware advancements, including Nvidia's future GPU plans and Apple's new Mac Studio offering that can run DeepSeek R1. DeepSeek employees are facing travel restrictions, suggesting China is treating its AI development with increased secrecy and urgency, emphasizing a wartime footing in AI competition.

Ellie Ramirez-Camara@Data Phoenix //
References: Data Phoenix , Fello AI , TheSequence ...
OpenAI has recently unveiled a suite of new tools aimed at simplifying the development of AI agents. This release includes the Responses API, designed as a flexible foundation for building agents, along with built-in capabilities for web search, file search, and computer use. An open-source Agents SDK is also part of the package, intended to help developers orchestrate both single-agent and multi-agent workflows, providing the essential building blocks for creating reliable AI agents.

OpenAI has also introduced o1 Pro, its latest AI reasoning model, but it comes with a steep price tag. At $150 per million input tokens and $600 per million output tokens, o1 Pro is significantly more expensive than previous models, costing ten times the price of the standard o1 model and twice as much as GPT-4.5. OpenAI claims that o1 Pro "thinks harder" and provides "consistently better" responses, especially for complex tasks, but early benchmarks suggest the improvements may be incremental.

Recommended read:
References :
  • Data Phoenix: OpenAI has launched a comprehensive suite of new tools including the Responses API, built-in capabilities for web search, file search, and computer use, and an open-source Agents SDK—all designed to make it significantly easier for developers to build AI agents.
  • Fello AI: Move over, GPT-4.5, there’s a new overpriced AI model in town. OpenAI has just launched o1 Pro, its latest AI reasoning model, and it comes with a price tag so absurd it makes previous models look like dollar-store knockoffs.
  • Shelly Palmer: Last week, Google and OpenAI asked the White House for permission to train AI on copyrighted content, arguing that restrictive laws will cripple U.S. innovation while China advances unchecked.
  • TheSequence: Responses API, file and web search and multi agent coordination are some of the key capabilities of the new stack.
  • bsky.app: Bsky post about OpenAI's new text-to-speech and speech-to-text models

Matt Marshall@AI News | VentureBeat //
OpenAI has unveiled its Agents SDK, along with a revamped Responses API, built-in tools, and an open-source SDK. These tools simplify the development of AI agents for enterprise use by consolidating the complex ecosystem into a unified framework. This platform allows developers to create AI agents capable of performing tasks autonomously. The Responses API integrates with OpenAI’s existing Chat Completions API and Assistants API to assist in agent construction, while the Agents SDK helps users orchestrate both single and multi-agent workflows.

This initiative addresses AI agent reliability issues, recognizing that external developers can offer innovative solutions. The SDK reduces the complexity of AI agent development, enabling projects that previously required multiple frameworks and specialized databases to be achieved through a single, standardized platform. This marks a critical turning point as OpenAI recognizes the value of external contributions to the advancement of AI agent technology. With web search, file search, and computer use integrated, the Responses API enables agents to interact with real-world data and internal proprietary business contexts more effectively.

Recommended read:
References :
  • Gradient Flow: Deep Dive into OpenAI’s Agent Ecosystem
  • techstrong.ai: OpenAI Introduces Developer Tools to Build AI Agents
  • venturebeat.com: OpenAI’s strategic gambit: The Agents SDK and why it changes everything for enterprise AI
  • www.itpro.com: OpenAI wants to simplify how developers build AI agents
  • Latent.Space: Nikunj Handa and Romain Huet from OpenAI join us to preview their new Agents APIs: Responses, Web Search, and Computer Use, as well as a new agents SDK.
  • Analytics Vidhya: How to Use OpenAI’s Responses API & Agent SDK?
  • Analytics Vidhya: Guardrails in OpenAI Agent SDK: Ensuring Integrity in Educational Support Systems
  • Windows Copilot News: Microsoft unleashes autonomous Copilot AI agents in public preview
  • www.infoq.com: OpenAI Launches New API, SDK, and Tools to Develop Custom Agents
  • Gradient Flow: AI This Week: New Agents, Open Models, and the Race for Productivity
  • Shelly Palmer: Details how OpenAI's new Responses API makes it dramatically easier to create AI agents.
  • Data Phoenix: OpenAI Launches New Tools for Building AI Agents
  • Windows Copilot News: This article discusses the potential for OpenAI's Response API to revolutionize AI agent development, emphasizing its ability to enable real-time web search, file search, and computer interactions, making AI agents more powerful and versatile.
  • TheSequence: The Sequence Engineering #513: A Deep Dive Into OpenAI's New Tools for Developing AI Agents
  • neptune.ai: How to Build an LLM Agent With AutoGen: Step-by-Step Guide
  • Developer Tech News: OpenAI has launched a comprehensive suite of new tools including the Responses API, built-in capabilities for web search, file search, and computer use, and an open-source Agents SDK—all designed to make it significantly easier for developers to build AI agents.

Matt Marshall@AI News | VentureBeat //
OpenAI has unveiled a new suite of APIs and tools aimed at simplifying the development of AI agents for enterprises. The firm is releasing building blocks designed to assist developers and businesses in creating practical and dependable agents, defined as systems capable of independently accomplishing tasks. These tools are designed to address challenges faced by software developers in building production-ready applications, with the goal of automating and streamlining operations.

The newly launched platform includes the Responses API, which is a superset of the chat completion API, along with built-in tools, the OpenAI Agents SDK, and enhanced Observability features. Nikunj Handa and Romain Huet from OpenAI previewed new Agents APIs such as Responses, Web Search, and Computer Use, and also introduced a new Agents SDK. The Responses API is positioned as a more flexible foundation for developers working with OpenAI models, offering functionalities like Web Search, Computer Use, and File Search.

Recommended read:
References :
  • Analytics Vidhya: New Tools for Building AI Agents: OpenAI Agent SDK, Response API and More
  • Maginative: OpenAI Launches Responses API and Agents SDK for AI Agents
  • TestingCatalog: OpenAI released new tools and APIs for AI agent development
  • AI News | VentureBeat: OpenAI unveils Responses API, open source Agents SDK, letting developers build their own Deep Research and Operator
  • The Tech Portal: OpenAI releases new APIs and tools for businesses to create AI agents
  • Developer Tech News: OpenAI launches tools to build AI agents faster
  • www.infoworld.com: OpenAI takes on rivals with new Responses API, Agents SDK
  • techstrong.ai: OpenAI Introduces Developer Tools to Build AI Agents
  • www.zdnet.com: Why OpenAI's new AI agent tools could change how you code
  • www.itpro.com: OpenAI wants to simplify how developers build AI agents
  • Latent.Space: Nikunj Handa and Romain Huet from OpenAI join us to preview their new Agents APIs: Responses, Web Search, and Computer Use, as well as a new agents SDK.
  • Analytics Vidhya: Guardrails in OpenAI Agent SDK: Ensuring Integrity in Educational Support Systems
  • Gradient Flow: Deep Dive into OpenAI’s Agent Ecosystem
  • venturebeat.com: OpenAI’s strategic gambit: The Agents SDK and why it changes everything for enterprise AI
  • pub.towardsai.net: This article focuses on the development of AI agents and the role of OpenAI in simplifying the process. It emphasizes the importance of OpenAI's new Agent SDK and its potential to transform how developers create systems that can autonomously handle complex, multi-step tasks.
  • Windows Report: This article highlights OpenAI's new AI Agents and its promise to revolutionize AI development. It discusses the company's release of a comprehensive suite of tools and APIs designed to simplify the development of AI agents, capable of autonomously handling complex, multi-step tasks.
  • Windows Copilot News: OpenAI has unveiled new tools and APIs designed to streamline the creation of AI agents for enterprises. These tools are aimed at transforming how developers construct AI systems capable of autonomously handling intricate, multi-step tasks.
  • www.infoq.com: OpenAI Launches New API, SDK, and Tools to Develop Custom Agents
  • Gradient Flow: AI This Week: New Agents, Open Models, and the Race for Productivity
  • Upward Dynamism: AI Agents 101 – The Next Big Thing in AI You Shouldn’t Ignore
  • Shelly Palmer: AI Agents Are Coming—and OpenAI Just Made Them Easier to Deploy
  • Unite.AI: Developer Barriers Lowered as OpenAI Simplifies AI Agent Creation