News from the AI & ML world

DeeperML

Megan Crouse@techrepublic.com //
OpenAI has unveiled a suite of advancements, including enhanced audio models and a significantly more expensive AI reasoning model called o1 Pro. The new audio models, including gpt-4o-transcribe and gpt-4o-mini-transcribe, offer improved transcription capabilities compared to Whisper, although they are susceptible to prompt injection attacks due to their foundation on language models. Users can access these models via the Realtime API, enabling real-time transcription from microphone input using a standalone Python script.

OpenAI's o1 Pro comes with a steep price tag of $150 per million input tokens and $600 per million output tokens. This makes it ten times more expensive than the standard o1 model and twice as costly as GPT-4.5. While OpenAI claims o1 Pro "thinks harder" and delivers superior responses for complex reasoning tasks, early benchmarks suggest only incremental improvements. Access to o1 Pro is currently limited to developers who have spent at least $5 on OpenAI's API services, targeting users building AI agents and automation tools.
Original img attribution: https://assets.techrepublic.com/uploads/2025/03/tr_20250321-news-openai-agentic-voice-models-text-to-speech-to-text.jpg
ImgSrc: assets.techrepu

Share: bluesky twitterx--v2 facebook--v1 threads


References :
  • Fello AI: OpenAI Just Dropped Its Most Expensive AI Model Yet, And It Costs a Fortune
  • www.techrepublic.com: OpenAI Gives Its Agents a Voice – Now a ‘Medieval Knight’ Can Read Your Work Emails
  • AI News | VentureBeat: Describes OpenAI’s new voice AI model gpt-4o-transcribe and its ability to add speech to existing text apps.
  • MarkTechPost: Explains the release of advanced audio models gpt-4o-mini-tts, gpt-4o-transcribe, and gpt-4o-mini-transcribe by OpenAI.
  • THE DECODER: OpenAI releases new AI voice models with customizable speaking styles
  • Maginative: OpenAI Unveils New Audio Models to Make AI Agents Sound More Human Than Ever
  • www.producthunt.com: OpenAI GPT-4o Audio Models
  • Analytics Vidhya: OpenAI’s Audio Models: How to Access, Features, Applications, and More
Classification: