News from the AI & ML world

DeeperML - #openaimodels

Mark Tyson@tomshardware.com //
OpenAI has recently launched its newest reasoning model, o3-pro, making it available to ChatGPT Pro and Team subscribers, as well as through OpenAI’s API. Enterprise and Edu subscribers will gain access the following week. The company touts o3-pro as a significant upgrade, emphasizing its enhanced capabilities in mathematics, science, and coding, and its improved ability to utilize external tools.

OpenAI has also slashed the price of o3 by 80% and o3-pro by 87%, positioning the model as a more accessible option for developers seeking advanced reasoning capabilities. This price adjustment comes at a time when AI providers are competing more aggressively on both performance and affordability. Experts note that evaluations consistently prefer o3-pro over the standard o3 model across all categories, especially in science, programming, and business tasks.

O3-pro utilizes the same underlying architecture as o3, but it’s tuned to be more reliable, especially on complex tasks, with better long-range reasoning. The model supports tools like web browsing, code execution, vision analysis, and memory. While the increased complexity can lead to slower response times, OpenAI suggests that the tradeoff is worthwhile for the most challenging questions "where reliability matters more than speed, and waiting a few minutes is worth the tradeoff.”

Recommended read:
References :
  • Maginative: OpenAI’s new o3-pro model is now available in ChatGPT and the API, offering top-tier performance in math, science, and coding—at a dramatically lower price.
  • AI News | VentureBeat: OpenAI's most powerful reasoning model, o3, is now 80% cheaper, making it more affordable for businesses, researchers, and individual developers.
  • Latent.Space: OpenAI just dropped the price of their o3 model by 80% today and launched o3-pro.
  • THE DECODER: OpenAI has lowered the price of its o3 language model by 80 percent, CEO Sam Altman said.
  • Simon Willison's Weblog: OpenAI's Adam Groth explained that the engineers have optimized inference, allowing a significant price reduction for the o3 model.
  • the-decoder.com: OpenAI lowered the price of its o3 language model by 80 percent, CEO Sam Altman said.
  • AI News | VentureBeat: OpenAI released the latest in its o-series of reasoning model that promises more reliable and accurate responses for enterprises.
  • bsky.app: The OpenAI API is back to running at 100% again, plus we dropped o3 prices by 80% and launched o3-pro - enjoy!
  • Sam Altman: We are past the event horizon; the takeoff has started. Humanity is close to building digital superintelligence, and at least so far it’s much less weird than it seems like it should be.
  • siliconangle.com: OpenAI’s newest reasoning model o3-pro surpasses rivals on multiple benchmarks, but it’s not very fast
  • SiliconANGLE: OpenAI’s newest reasoning model o3-pro surpasses rivals on multiple benchmarks, but it’s not very fast
  • bsky.app: the OpenAI API is back to running at 100% again, plus we dropped o3 prices by 80% and launched o3-pro - enjoy!
  • bsky.app: OpenAI has launched o3-pro. The new model is available to ChatGPT Pro and Team subscribers and in OpenAI’s API now, while Enterprise and Edu subscribers will get access next week. If you use reasoning models like o1 or o3, try o3-pro, which is much smarter and better at using external tools.
  • The Algorithmic Bridge: OpenAI o3-Pro Is So Good That I Can’t Tell How Good It Is

Mark Tyson@tomshardware.com //
References: bsky.app , the-decoder.com , Maginative ...
OpenAI has launched o3-pro, a new and improved version of its AI model designed to provide more reliable and thoughtful responses, especially for complex tasks. Replacing the o1-pro model, o3-pro is accessible to Pro and Team users within ChatGPT and through the API, marking OpenAI's ongoing effort to refine its AI technology. The focus of this upgrade is to enhance the model’s reasoning capabilities and maintain consistency in generating responses, directly addressing shortcomings found in previous models.

The o3-pro model is designed to handle tasks requiring deep analytical thinking and advanced reasoning. While built upon the same transformer architecture and deep learning techniques as other OpenAI chatbots, o3-pro distinguishes itself with an improved ability to understand context. Some users have noted that o3-pro feels like o3, but is only modestly better in exchange for being slower.

Comparisons with other leading models such as Claude 4 Opus and Gemini 2.5 Pro reveal interesting insights. While Claude 4 Opus has been praised for prompt following and understanding user intentions, Gemini 2.5 Pro stands out for its price-to-performance ratio. Early user experiences suggest o3-pro might not always be worth the expense due to its speed, except for research purposes. Some users have suggested that o3-pro hallucinates modestly less, though this is still being debated.

Recommended read:
References :
  • bsky.app: the OpenAI API is back to running at 100% again, plus we dropped o3 prices by 80% and launched o3-pro - enjoy!
  • the-decoder.com: OpenAI cuts o3 model prices by 80% and launches o3-pro today
  • AI News | VentureBeat: OpenAI launches o3-pro AI model, offering increased reliability and tool use for enterprises — while sacrificing speed
  • Maginative: OpenAI’s new o3-pro model is now available in ChatGPT and the API, offering top-tier performance in math, science, and coding—at a dramatically lower price.
  • THE DECODER: OpenAI has lowered the price of its o3 language model by 80 percent, CEO Sam Altman said. The article appeared first on The Decoder.
  • AI News | VentureBeat: OpenAI announces 80% price drop for o3, it’s most powerful reasoning model
  • www.cnbc.com: The figure includes sales from the company’s consumer products, ChatGPT business products and its application programming interface, or API.
  • Latent.Space: OpenAI dropped o3 pricing 80% today and launched o3-pro. Ben Hylak of Raindrop.ai returns with the world's first early review.
  • siliconangle.com: OpenAI’s newest reasoning model o3-pro surpasses rivals on multiple benchmarks, but it’s not very fast
  • SiliconANGLE: Silicon Angle reports on OpenAI’s newest reasoning model o3-pro surpassing rivals.
  • bsky.app: OpenAI has launched o3-pro. The new model is available to ChatGPT Pro and Team subscribers and in OpenAI’s API now, while Enterprise and Edu subscribers will get access next week. If you use reasoning models like o1 or o3, try o3-pro, which is much smarter and better at using external tools.
  • The Algorithmic Bridge: OpenAI o3-Pro Is So Good That I Can’t Tell How Good It Is
  • Windows Report: OpenAI rolls out new o3-pro AI model to ChatGPT Pro subscribers
  • Windows Report: OpenAI Rolls out new o3-pro AI Model to ChatGPT Pro Subscribers
  • www.indiatoday.in: OpenAI introduces O3 PRO for ChatGPT, highlighting significant improvements in performance and cost-efficiency.
  • www.marketingaiinstitute.com: [The AI Show Episode 153]: OpenAI Releases o3-Pro, Disney Sues Midjourney, Altman: “Gentle Singularity†Is Here, AI and Jobs & News Sites Getting Crushed by AI Search
  • datafloq.com: OpenAI is now considered another name of innovation in the field of AI, and with the launch of OpenAI o3, this claim is becoming stronger.
  • composio.dev: OpenAI finally unveiled their much expensive O3, the O3-Pro. The model is available in their API and Pro plan, which costs $200

Chris McKay@Maginative //
OpenAI has unveiled its latest advancements in AI technology with the launch of the GPT-4.1 family of models. This new suite includes GPT-4.1, GPT-4.1 mini, and GPT-4.1 nano, all accessible via API, and represents a significant leap forward in coding capabilities, instruction following, and context processing. Notably, these models feature an expanded context window of up to 1 million tokens, enabling them to handle larger codebases and extensive documents. The GPT-4.1 family aims to cater to a wide range of developer needs by offering different performance and cost profiles, with the goal of creating more advanced and efficient AI applications.

These models demonstrate superior results on various benchmarks compared to their predecessors, GPT-4o and GPT-4o mini. Specifically, GPT-4.1 showcases a substantial improvement on the SWE-bench Verified coding test with a 54.6% increase, and a 38.3% increase on Scale’s MultiChallenge for instruction following. Each model is designed with a specific purpose in mind: GPT-4.1 excels in high-level cognitive tasks like software development and research, GPT-4.1 mini offers a balanced performance with reduced latency and cost, while GPT-4.1 nano provides the quickest and most affordable option for tasks such as classification. All three models have knowledge updated through June 2024.

The introduction of the GPT-4.1 family also brings about changes in OpenAI's existing model offerings. The GPT-4.5 Preview model in the API is set to be deprecated on July 14, 2025, due to GPT-4.1 offering comparable or better utility at a lower cost. In terms of pricing, GPT-4.1 is 26% less expensive than GPT-4o for median queries, along with increased prompt caching discounts. Early testers have already noted positive outcomes, with improvements in code review suggestions and data retrieval from large documents. OpenAI emphasizes that many underlying improvements are being integrated into the current GPT-4o version within ChatGPT.

Recommended read:
References :
  • TestingCatalog: OpenAI debuts GPT-4.1 family offering 1M token context window
  • venturebeat.com: OpenAI slashes prices for GPT-4.1, igniting AI price war among tech giants
  • Interconnects: OpenAI's latest models optimizing on intelligence per dollar.
  • THE DECODER: OpenAI launches GPT-4.1: New model family to improve agents, long contexts and coding
  • Simon Willison's Weblog: OpenAI three new models this morning: GPT-4.1, GPT-4.1 mini and GPT-4.1 nano. These are API-only models right now, not available through the ChatGPT interface (though you can try them out in OpenAI's ).
  • Analytics Vidhya: All About OpenAI’s Latest GPT 4.1 Family
  • pub.towardsai.net: TAI #148: New API Models from OpenAI (4.1) & xAI (grok-3); Exploring Deep Research’s Scaling Laws
  • Towards AI: The GPT-4.1 models, accessible via API, provide a significant advancement in AI capabilities and offer an intriguing alternative for developers looking for high performance at lower cost.
  • Towards AI: TAI #148: New API Models from OpenAI (4.1) & xAI (grok-3); Exploring Deep Research’s Scaling Laws
  • venturebeat.com: OpenAI’s new GPT-4.1 models can process a million tokens and solve coding problems better than ever
  • techstrong.ai: Just days after announcing its plans to retire GPT-4 in ChatGPT, OpenAI on Monday launched a new set of flagship models named GPT-4.1. The release, which The Verge anticipated in an article last week, included the standard version GPT-4.1 model, along with two smaller models — GPT-4.1 mini, and GPT-4.1 nano which OpenAI touts as […]
  • the-decoder.com: OpenAI launches GPT-4.1: New model family to improve agents, long contexts and coding
  • www.tomsguide.com: OpenAI's latest model is here but it isn't GPT-5, it's 4.1, a model all about coding
  • shellypalmer.com: Shelly Palmer discusses the launch of GPT-4.1 and its improved capabilities.
  • felloai.com: OpenAI Quietly Launched GPT‑4.1 – A GPT-4o Successor That’s Crushing Benchmarks
  • thezvi.wordpress.com: The Zvi discusses the mini upgrade from GPT-4.1.
  • bdtechtalks.com: GPT-4.1: OpenAI’s most confusing model
  • Fello AI: OpenAI Quietly Launched GPT‑4.1 – A GPT-4o Successor That’s Crushing Benchmarks
  • www.eweek.com: eWeek reports on the pros and cons of OpenAI's new GPT-4.1 model.
  • Last Week in AI: Last Week in AI discusses the new GPT 4.1 model release by OpenAI
  • Fello AI: OpenAI’s language models have become part of everyday life for millions of people—whether you’re using ChatGPT to get quick answers, brainstorm ideas, or even generate code. With each new version, the models get faster, smarter, and more capable.
  • thezvi.wordpress.com: Yesterday’s news alert, nevertheless: The verdict is in. GPT-4.1-Mini in particular is an excellent practical model, offering strong performance at a good price. The full GPT-4.1 is an upgrade to OpenAI’s more expensive API offerings, it is modestly better but …
  • composio.dev: GPT-4.1 vs. Deepseek v3 vs. Sonnet 3.7 vs. GPT-4.5
  • hackernoon.com: OpenAI announced GPT-4.1, featuring a staggering 1M-token context window and perfect needle-in-a-haystack accuracy.
  • Shelly Palmer: OpenAI has launched GPT-4.1, along with GPT-4.1 Mini and GPT-4.1 Nano. These models are for developers and will not show up in your ChatGPT model picker.
  • eWEEK: OpenAI is releasing new language models, GPT-4.1, GPT-4.1 mini, and GPT-4.1 nano.

Chris McKay@Maginative //
OpenAI has launched a new series of GPT-4.1 models, including GPT-4.1, GPT-4.1 mini, and GPT-4.1 nano. These API-only models are not accessible via the ChatGPT interface but offer significant improvements in coding, instruction following, and context handling. All three models support a massive 1 million token context window, and they have a May 31, 2024 cutoff date.

GPT-4.1 demonstrates enhanced performance in coding benchmarks, surpassing GPT-4o by 21.4% on industry benchmarks. The models are also more cost-effective, with GPT-4.1 being 26% cheaper than GPT-4o and offering better latency. The GPT-4.1 nano model is OpenAI's cheapest model yet, priced at $0.10 per million input tokens and $0.40 per million output tokens. As a result of GPT-4.1's improved performance, OpenAI will be deprecating GPT-4.5 Preview on July 14, 2025.

The GPT-4.1 series excels in several key areas, including coding capabilities and instruction following. The models have achieved impressive scores on benchmarks like SWE-bench Verified and Scale’s MultiChallenge, demonstrating real-world software engineering skills and enhanced adherence to requested formats. Several companies have reported significant improvements in their specialized applications, with GPT-4.1 scoring higher on internal coding benchmarks, providing better code review suggestions, and improving the extraction of granular financial data from complex documents.

Recommended read:
References :
  • Simon Willison's Weblog: Simon Willison reports on three new million token input models from OpenAI, including their cheapest model yet.
  • Maginative: OpenAI has rolled out GPT-4.1, GPT-4.1 mini, and GPT-4.1 nano—faster, cheaper models with sharper coding, better instruction following, and support for 1 million-token context windows.
  • bsky.app: New release of my llm-openai plugin supporting today's three new GPT-4.1 models from OpenAI: llm install -U llm-openai-plugin llm -m openai/gpt-4.1 "Generate an SVG of a pelican riding a bicycle"
  • TestingCatalog: OpenAI debuts GPT-4.1 family offering 1M token context window
  • venturebeat.com: VentureBeat's report on the launch of GPT-4.1.
  • Interconnects: OpenAI's GPT-4.1 and separating the API from ChatGPT
  • the-decoder.com: OpenAI launches GPT-4.1: New model family to improve agents, long contexts and coding
  • THE DECODER: OpenAI launches GPT-4.1: New model family to improve agents, long contexts and coding
  • venturebeat.com: OpenAI’s new GPT-4.1 models can process a million tokens and solve coding problems better than ever
  • Analytics Vidhya: All About OpenAI’s Latest GPT 4.1 Family
  • pub.towardsai.net: TAI #148: New API Models from OpenAI (4.1) & xAI (grok-3); Exploring Deep Research’s Scaling Laws
  • Latent.Space: GPT 4.1: The New OpenAI Workhorse
  • www.tomsguide.com: OpenAI launches another model before GPT 5 — here’s what this one can do
  • techstrong.ai: Details the launch of a new set of flagship models named GPT-4.1 which included the standard version GPT-4.1 model, along with two smaller models.
  • Towards AI: Details the launch of GPT-4.1 models, emphasizing coding and instruction-following capabilities.
  • Towards AI: The GPT-4.1 model series release through Azure AI Foundry represents a major step forward in AI capabilities.
  • techstrong.ai: OpenAI Introduces GPT-4.1 with Improved Coding
  • www.analyticsvidhya.com: OpenAI's new models show improvement in multiple benchmarks, excelling in long-context processing (up to 1 million tokens).
  • thezvi.wordpress.com: GPT-4.1 Is a Mini Upgrade
  • felloai.com: OpenAI has just launched a brand-new series of GPT models—GPT‑4.1, GPT‑4.1 mini, and GPT‑4.1 nano—that promise major advances in coding, instruction following, and the ability to handle incredibly long contexts.
  • shellypalmer.com: While admitting that they "suck at naming their models," OpenAI has launched GPT-4.1, along with GPT-4.1 Mini and GPT-4.1 Nano.
  • thezvi.wordpress.com: GPT-4.1 Is a Mini Upgrade
  • www.analyticsvidhya.com: How to Build Agentic RAG Using GPT-4.1?
  • felloai.com: Ultimate Comparison of GPT-4.1 vs GPT-4o: Which One Should You Use?
  • www.eweek.com: OpenAI released GPT-4.1, the newest successor to its GPT-4o series of AI language models.
  • Fello AI: OpenAI has just launched a brand-new series of GPT models—GPT‑4.1, GPT‑4.1 mini, and GPT‑4.1 nano—that promise major advances in coding, instruction following, and the ability to handle incredibly long contexts.
  • Shelly Palmer: While admitting that they "suck at naming their models," OpenAI has launched GPT-4.1, along with GPT-4.1 Mini and GPT-4.1 Nano. These models are for developers and will not show up in your ChatGPT model picker.
  • eWEEK: OpenAI announced on Monday the release of GPT-4.1, the newest successor to its GPT-4o series of AI language models.
  • bdtechtalks.com: GPT-4.1: OpenAI’s most confusing model
  • composio.dev: GPT-4.1 vs. Deepseek v3 vs. Sonnet 3.7 vs. GPT-4.5
  • thezvi.wordpress.com: OpenAI has upgraded its entire suite of models. By all reports, they are back in the game for more than images. GPT-4.1 and especially GPT-4.1-mini are their new API non-reasoning models. All reports are that GPT-4.1-mini especially is very good.
  • thezvi.wordpress.com: Greg Brockman (OpenAI): Just released o3 and o4-mini! These models feel incredibly smart.
  • Last Week in AI: Analyzes OpenAI's new AI models, focusing on their enhanced coding capabilities and concerns about reduced resources for safety testing.
  • techcrunch.com: OpenAI's new GPT-4.1 AI models focus on coding, Google’s newest Gemini AI model focuses on efficiency, and more!
  • TheSequence: The Sequence Radar #526: The OpenAI Blitz: From GPT-4.1 to Windsurf