OpenAI Advances AI Agents and Reasoning Capabilities

Megan Crouse@techrepublic.com //

OpenAI Advances AI Agents and Reasoning Capabilities

OpenAI has unveiled a suite of advancements, including enhanced audio models and a significantly more expensive AI reasoning model called o1 Pro. The new audio models, including gpt-4o-transcribe and gpt-4o-mini-transcribe, offer improved transcription capabilities compared to Whisper, although they are susceptible to prompt injection attacks due to their foundation on language models. Users can access these models via the Realtime API, enabling real-time transcription from microphone input using a standalone Python script.

OpenAI's o1 Pro comes with a steep price tag of $150 per million input tokens and $600 per million output tokens. This makes it ten times more expensive than the standard o1 model and twice as costly as GPT-4.5. While OpenAI claims o1 Pro "thinks harder" and delivers superior responses for complex reasoning tasks, early benchmarks suggest only incremental improvements. Access to o1 Pro is currently limited to developers who have spent at least $5 on OpenAI's API services, targeting users building AI agents and automation tools.

Original img attribution: https://assets.techrepublic.com/uploads/2025/03/tr_20250321-news-openai-agentic-voice-models-text-to-speech-to-text.jpg

ImgSrc: assets.techrepu

References :

Fello AI: OpenAI Just Dropped Its Most Expensive AI Model Yet, And It Costs a Fortune
www.techrepublic.com: OpenAI Gives Its Agents a Voice â€“ Now a â€˜Medieval Knightâ€™ Can Read Your Work Emails
AI News | VentureBeat: Describes OpenAI’s new voice AI model gpt-4o-transcribe and its ability to add speech to existing text apps.
MarkTechPost: Explains the release of advanced audio models gpt-4o-mini-tts, gpt-4o-transcribe, and gpt-4o-mini-transcribe by OpenAI.
THE DECODER: OpenAI releases new AI voice models with customizable speaking styles
Maginative: OpenAI Unveils New Audio Models to Make AI Agents Sound More Human Than Ever
www.producthunt.com: OpenAI GPT-4o Audio Models
Analytics Vidhya: OpenAIâ€™s Audio Models: How to Access, Features, Applications, and More

Classification:

HashTags: #OpenAI #AIModels #VoiceAI
Company: OpenAI
Target: Developers
Product: GPT-4o
Feature: Voice Support
Type: AI
Severity: Informative

News from the AI & ML world

DeeperML

OpenAI Advances AI Agents and Reasoning Capabilities

Classification: