News from the AI & ML world

DeeperML - #gpt-5

Tripty@techvro.com //
OpenAI is making significant strides in artificial intelligence, announcing plans to merge its diverse ChatGPT models into a unified and more powerful GPT-5 system. This strategic consolidation aims to simplify user experience, eliminating the need for users to navigate between various models like GPT-4o or Codex-1. VP Jerry Tworek confirmed that GPT-5 will enhance existing capabilities with less model switching, streamlining tasks like code generation, reasoning, web search, and RAG (Retrieval-Augmented Generation) into a single API call. The integration will also encompass tools like Codex and Operator, creating a seamless and intuitive AI interaction for developers and users alike.

The company is not only focusing on software advancements but also venturing into hardware. OpenAI has acquired "io," a startup founded by former Apple design guru Jony Ive and OpenAI CEO Sam Altman, for a staggering $6.5 billion. This acquisition signals OpenAI's commitment to developing AI-driven hardware, envisioning a device that could redefine human-computer interaction. While specific details about the device remain under wraps, leaked information suggests it won't be a smartphone or glasses but a small, pocket-sized device aware of its user's surroundings.

The collaboration between OpenAI and Jony Ive aims to revolutionize how we engage with AI, moving beyond traditional screens and interfaces. Ive, known for his minimalist design philosophy, envisions a "new design movement" centered on creating subtle, useful tools that seamlessly integrate into everyday life. Altman expressed that current technology falls short of realizing the full potential of AI, prompting the creation of a device that acts as a "central facet of using OpenAI." The ambition is clear: to ship 100 million AI "companions," potentially through a subscription model where users receive updated hardware, marking a bold step toward an AI-native computing future.

Recommended read:
References :

@www.marktechpost.com //
OpenAI has launched the Evals API, a new tool designed to streamline the evaluation of large language models (LLMs) for developers and teams. The Evals API introduces programmatic evaluation capabilities, allowing developers to define tests, automate evaluation runs, and iterate on prompts directly from their workflows. Previously, evaluations were accessible only through the OpenAI dashboard, but the new API enables a more integrated and systematic approach to assessing model performance.

The Evals API aims to address the challenges of manually evaluating LLM performance, which can be time-consuming, especially when scaling applications across diverse domains. By providing a systematic approach, OpenAI hopes to improve custom test case assessments, measure improvements across prompt iterations, and automate quality assurance in development pipelines. This will enable developers to treat evaluation as a core part of their development cycle, similar to unit tests in traditional software engineering.

Despite these advancements, OpenAI has announced a delay in the launch of GPT-5 by a few months. According to CEO Sam Altman, the delay is due to the company's efforts to significantly improve the model and ensure enough capacity to support expected high demand. In the meantime, OpenAI plans to release o3 and o4-mini models. The company has also faced capacity issues with current features, as seen with the restrictions placed on the image generation software after its launch.

Recommended read:
References :
  • www.marktechpost.com: In a significant move to empower developers and teams working with large language models (LLMs), OpenAI has introduced the Evals API, a new toolset that brings programmatic evaluation capabilities to the forefront. While evaluations were previously accessible via the OpenAI dashboard, the new API allows developers to define tests, automate evaluation runs, and iterate on […] The post appeared first on .
  • www.tomsguide.com: While you'll have to wait a bit longer for the new model, OpenAI does have a couple of exciting announcements
  • the-decoder.com: OpenAI has introduced an Evals API that enables programmatic test creation and automation.
  • THE DECODER: OpenAI releases Evals API for systematic prompt testing

@Latest from Tom's Guide //
OpenAI has announced a shift in its release strategy for GPT-5, delaying the launch by several months. CEO Sam Altman revealed the change in plans, stating that the company will first release its reasoning models, o3 and o4-mini, in the coming weeks. This reverses earlier plans to integrate these models directly into GPT-5. The full GPT-5 model is now expected to arrive "in a few months."

Altman cited several reasons for the delay. Integrating all components into a single unified system proved more challenging than initially anticipated. Additionally, the extra development time has revealed the potential to make GPT-5 "much better than we originally thought." Ensuring sufficient computing capacity to meet the expected "unprecedented demand" was also a key factor. The o3 model, in particular, has undergone significant improvements since its internal preview, with Altman stating that "people will be happy" with the advancements.

The o3 and o4-mini models are classified as reasoning models, designed to perform complex thinking tasks. These models have demonstrated stronger performance than conventional language models in areas like coding and mathematics. OpenAI first introduced its o3 model in December 2024, marking a major advancement in complex reasoning tasks, followed by the more affordable and faster o3-mini version in late January 2025. While users will need to wait longer for GPT-5, the upcoming release of o3 and o4-mini promises exciting advancements in AI capabilities.

Recommended read:
References :
  • Simon Willison's Weblog: change of plans: we are going to release o3 and o4-mini after all, probably in a couple of weeks, and then do GPT-5 in a few months — Tags: , , , ,
  • THE DECODER: OpenAI is revising its release strategy for GPT-5. It is now delayed by several months.
  • www.techradar.com: ChatGPT-5 is delayed, but o3 and o4 mini LLMs to be released in a couple of weeks.
  • www.tomsguide.com: While you'll have to wait a bit longer for the new model, OpenAI does have a couple of exciting announcements

Chris McKay@Maginative //
OpenAI is shaking up its AI model release strategy, announcing plans to launch o3 and o4-mini in the coming weeks before the much-anticipated GPT-5. This marks a reversal from earlier plans to consolidate efforts around GPT-5. CEO Sam Altman cited technical integration challenges and the need for sufficient capacity to handle expected demand as factors influencing the decision. Altman expressed confidence that the delay will allow OpenAI to make GPT-5 "much better than we originally thought," promising substantial improvements to the flagship model.

The unexpected addition of o4-mini indicates that OpenAI isn't slowing down its pace of innovation. The release of o3 and o4-mini comes as OpenAI faces increasing competition in the AI market. In a strategic move targeting the education sector, OpenAI is now offering free ChatGPT Plus subscriptions to college students. This initiative aims to escalate competition with Anthropic, particularly following Anthropic's unveiling of "Claude for Education" and partnerships with several universities.

In addition to model development, OpenAI is reportedly finalizing a significant funding deal, potentially worth $40 billion, with SoftBank. The funds are intended to further advance the capabilities of the models and address any safety concerns. The influx of capital could solidify OpenAI's position as a leading force in the rapidly evolving AI landscape, enabling them to pursue ambitious research and development projects while navigating the competitive pressures from rivals like Google and Anthropic.

Recommended read:
References :
  • Maginative: OpenAI Reshuffles Roadmap: Two New Models to Precede "Much Better" GPT-5
  • The Algorithmic Bridge: GPT-5, o3, and o4-mini Are Coming Soon
  • Simon Willison's Weblog: change of plans: we are going to release o3 and o4-mini after all, probably in a couple of weeks, and then do GPT-5 in a few months
  • www.techradar.com: ChatGPT-5 is on hold as OpenAI changes plans and releases new o3 and o4-mini models