News from the AI & ML world

DeeperML - #efficiency

Michal Langmajer@Fello AI //
OpenAI has announced the release of GPT-4.5, its latest language model which they are calling their 'last non-chain-of-thought model.' According to OpenAI, GPT-4.5 offers substantial enhancements over its predecessors, particularly in advanced reasoning, problem-solving, and contextual understanding. Sam Altman, CEO of OpenAI, described it as the "first model that feels like talking to a thoughtful person," noting moments of astonishment at the quality of advice received from the AI.

However, the rollout is facing challenges due to GPU shortages. Altman stated they are "out of GPUs," leading to a staggered release, initially limited to ChatGPT Pro subscribers who pay $200 a month. While GPT-4.5 is available to developers across all paid API tiers, OpenAI plans to expand access to Plus and Team tiers next week, with tens of thousands of GPUs expected to arrive to alleviate the supply constraints. Despite not being a reasoning model, OpenAI estimates that GPT-4.5 is 30 times more expensive to run than GPT-4o.

Recommended read:
References :
  • Fello AI: OpenAI’s GPT‑4.5 Finally Arrived: Can It Beat Grok 3 and Claude 3.7?
  • Shelly Palmer: Shelly Palmer discusses the release of OpenAI's GPT-4.5.
  • Analytics Vidhya: Everything You Need to Know About OpenAI’s GPT-4.5
  • www.tomshardware.com: Tom's Hardware reports on Sam Altman's statement about GPU shortages delaying the GPT-4.5 release.
  • venturebeat.com: VentureBeat reports OpenAI releases GPT-4.5 claiming 10X efficiency over GPT-4, but says it’s ‘not a frontier model’
  • Gradient Flow: Scaling Up, Costs Up: GPT-4.5 and the Intensifying AI Competition
  • Pivot to AI: OpenAI releases GPT-4.5 with ridiculous prices for a mediocre model
  • Techstrong.ai: TechStrong.ai article on OpenAI's GPT-4.5 AI model.
  • THE DECODER: OpenAI has released GPT-4.5 as a "Research Preview".
  • eWEEK: OpenAI releases GPT-4.5, a “Warm” Generative AI Model, for Paid Plans and APIs
  • www.windowscentral.com: Sam Altman on GPT-4.5: Expensive, yet the closest thing to a thoughtful conversational partner we've seen
  • THE DECODER: OpenAI's largest model GPT-4.5 delivers on vibes instead of benchmarks
  • 9to5Mac: OpenAI announces GPT-4.5, ChatGPT’s largest and best model for chat
  • www.engadget.com: OpenAI's new GPT-4.5 model is a better, more natural conversationalist
  • The Verge: Anthropic’s new ‘hybrid reasoning’ AI model is its smartest yet
  • THE DECODER: OpenAI has presented its largest language model to date. According to Mark Chen, Chief Research Officer at OpenAI, GPT 4.5 shows that the scaling of AI models has not yet reached its limits.
  • The Verge: OpenAI is launching GPT-4.5 today, its newest and largest AI language model.
  • NextBigFuture.com: OpenAI GPT 4.5 Has BIG Coding Improvement – Claims Scaling Still Works – Expensive
  • TechCrunch: OpenAI unveils GPT-4.5 ‘Orion,’ its largest AI model yet
  • PCMag Middle East ai: Reporting that OpenAI has launched GPT-4.5 but is limiting it to priciest tiers due to GPU shortages.
  • Analytics Vidhya: Two days ago, on 27 Feb 2025, OpenAI dropped GPT-4.5, expectations were sky-high. But instead of a groundbreaking leap forward, we got a model prioritizing emotional intelligence over raw reasoning power.
  • AI News | VentureBeat: GPT-4.5 for enterprise: Do its accuracy and knowledge justify the cost?
  • Windows Report: OpenAI released GPT-4.5, but it’s not much of an upgrade from GPT-4o. After DeepSeek was unleashed into the world, everyone wondered what OpenAI would do now that another AI company had developed an extremely powerful model at a tiny fraction of the budget.
  • iHLS: OpenAI Unveils GPT-4.5
  • Data Phoenix: OpenAI releases the long-awaited GPT-4.5/Orion, its last non-chain-of-thought model
  • Towards AI: GPT-4.5: The Next Evolution in AI
  • Towards AI: Towards AI article on TAI #142: GPT-4.5 Released.
  • www.marketingaiinstitute.com: [The AI Show Episode 138]: Introducing GPT-4.5, Claude 3.7 Sonnet, Alexa+, Deep Research Now in ChatGPT Plus & How AI Is Disrupting Writing
  • Analytics Vidhya: Now, this is a shocker, despite a lot of backlash on the cost of GPT 4.5, it becomes #1 in the Chatbot Arena LLM Leaderboard! Securing over 3,200+ votes, OpenAI’s latest model has emerged as number one across all evaluation categories, prominently excelling in Style Control and Multi-Turn interactions.

Michal Langmajer@Fello AI //
OpenAI has launched its latest AI model, GPT-4.5, described as the company's most advanced language model to date. This new model features substantial enhancements over its predecessors, particularly in advanced reasoning, problem-solving, and contextual understanding. GPT-4.5 is designed to offer a more natural and engaging conversational experience, with improvements including superior capabilities in handling complex reasoning tasks, enhanced creativity, and the ability to manage intricate logic problems while maintaining nuanced conversations with improved contextual recall.

However, the launch of GPT-4.5 is facing challenges due to a shortage of GPUs, according to OpenAI CEO Sam Altman. This limitation is restricting access to the priciest tiers of ChatGPT Pro subscribers and developers initially. Altman stated that OpenAI has "run out of GPUs" due to growing demand, leading to a staggered rollout. The company plans to add tens of thousands of GPUs next week and expand access to Plus, Team, Enterprise, and Edu users in the following weeks.

Recommended read:
References :
  • AI News | VentureBeat: OpenAI has announced the release of GPT-4.5, which CEO Sam Altman previously said would be the last non-chain-of-thought (CoT) model. The company said the new model “is not a frontier modelâ€� but is still its biggest large language model (LLM), with more computational efficiency.
  • Analytics Vidhya: Since the beginning of 2025, we have been seeing the launch of one amazing model after another – from DeepSeek-R1 and o3-mini to Grok 3 and Claude 3.7 Sonnet. The latest addition to this ever-expanding list of advanced AI models is the much-awaited OpenAI GPT-4.5. This new model in the GPT series brings “Vibe Checkâ€� ...
  • Fello AI: OpenAI’s GPT‑4.5 Finally Arrived: Can It Beat Grok 3 and Claude 3.7?
  • Shelly Palmer: GPT-4.5: The Last LLM
  • www.tomshardware.com: Sam Altman says that OpenAI has to stagger the release of GPT-4.5 due to GPU shortages.
  • Techstrong.ai: OpenAI’s GPT‑4.5 AI Is Ready for ‘Natural Conversation’
  • eWEEK: OpenAI Releases GPT-4.5, a “Warmâ€� Generative AI Model, for Paid Plans and APIs
  • Gradient Flow: Scaling Up, Costs Up: GPT-4.5 and the Intensifying AI Competition
  • THE DECODER: OpenAI has presented its largest language model to date. According to Mark Chen, Chief Research Officer at OpenAI, GPT 4.5 shows that the scaling of AI models has not yet reached its limits.
  • PCMag Middle East ai: OpenAI unveiled a new AI model today, GPT-4.5, but the launch did not go as planned. The company ran out of GPUs, or computing power, ahead of the reveal, CEO Sam Altman.
  • Analytics Vidhya: OpenAI has introduced GPT-4.5, an advanced AI model designed to enhance chatbot interactions with improved natural language processing, a richer knowledge base, and better contextual understanding.
  • THE DECODER: OpenAI has released GPT-4.5 as a "Research Preview". The new language model is intended to be more natural and less hallucinatory, but is significantly more expensive than its predecessors.
  • AI News | VentureBeat: OpenAI has announced the release of GPT-4.5, a research preview of its latest and most powerful large language model (LLM) for chat applications
  • Analytics Vidhya: Two days ago, on 27 Feb 2025, OpenAI dropped GPT-4.5, expectations were sky-high. But instead of a groundbreaking leap forward, we got a model prioritizing emotional intelligence over raw reasoning power.
  • AI News | VentureBeat: GPT-4.5 for enterprise: Do its accuracy and knowledge justify the cost?
  • TechCrunch: OpenAI launches GPT-4.5, its largest model to date
  • Windows Report: OpenAI released GPT-4.5, but it’s not much of an upgrade from GPT-4o
  • Analytics Vidhya: The article discusses GPT-4.5 becoming #1 on the Chatbot Arena.
  • Data Phoenix: This article discusses OpenAI's release of GPT-4.5, its strengths and weaknesses.
  • iHLS: This article covers the launch of GPT-4.5 and its capabilities.
  • THE DECODER: The article discusses OpenAI's GPT-4.5 release and its performance compared to previous versions.
  • Towards AI: TAT #142: GPT-4.5 Released — But Can It Stack Up Against Reasoning Models?
  • pub.towardsai.net: TAT #142: GPT-4.5 Released — But Can It Stack Up Against Reasoning Models?
  • LessWrong: On GPT-4.5
  • Analytics Vidhya: Top 5 Generative AI Breakthroughs of February 2025: GPT-4.5, Grok-3, and More!
  • Towards AI: GPT-4.5: The Next Evolution in AI

Tris Warkentin@The Official Google Blog //
Google AI has released Gemma 3, a new family of open-source AI models designed for efficient and on-device AI applications. Gemma 3 models are built with technology similar to Gemini 2.0, intended to run efficiently on a single GPU or TPU. The models are available in various sizes: 1B, 4B, 12B, and 27B parameters, with options for both pre-trained and instruction-tuned variants, allowing users to select the model that best fits their hardware and specific application needs.

Gemma 3 offers practical advantages including efficiency and portability. For example, the 27B version has demonstrated robust performance in evaluations while still being capable of running on a single GPU. The 4B, 12B, and 27B models are capable of processing both text and images, and supports more than 140 languages. The models have a context window of 128,000 tokens, making them well suited for tasks that require processing large amounts of information. Google has built safety protocols into Gemma 3, including a safety checker for images called ShieldGemma 2.

Recommended read:
References :
  • MarkTechPost: Google AI Releases Gemma 3: Lightweight Multimodal Open Models for Efficient and On‑Device AI
  • The Official Google Blog: Introducing Gemma 3: The most capable model you can run on a single GPU or TPU
  • AI News | VentureBeat: Google unveils open source Gemma 3 model with 128k context window
  • AI News: Details on the launch of Gemma 3 open AI models by Google.
  • The Verge: Google calls Gemma 3 the most powerful AI model you can run on one GPU
  • Maginative: Google DeepMind’s Gemma 3 Brings Multimodal AI, 128K Context Window, and More
  • TestingCatalog: Gemma 3 sets new benchmarks for open compact models with top score on LMarena
  • AI & Machine Learning: Announcing Gemma 3 on Vertex AI
  • Analytics Vidhya: Gemma 3 vs DeepSeek-R1: Is Google’s New 27B Model a Tough Competition to the 671B Giant?
  • AI & Machine Learning: How to deploy serverless AI with Gemma 3 on Cloud Run
  • The Tech Portal: Google rolls outs Gemma 3, its latest collection of lightweight AI models
  • eWEEK: Google’s Gemma 3: Does the ‘World’s Best Single-Accelerator Model’ Outperform DeepSeek-V3?
  • The Tech Basic: Gemma 3 by Google: Multilingual AI with Image and Video Analysis
  • Analytics Vidhya: Google’s Gemma 3: Features, Benchmarks, Performance and Implementation
  • www.infoworld.com: Google unveils Gemma 3 multi-modal AI models
  • www.zdnet.com: Google claims Gemma 3 reaches 98% of DeepSeek's accuracy - using only one GPU
  • AIwire: Google unveiled open source Gemma 3, is multimodal, comes in four sizes and can now handle more information and instructions thanks to a larger context window. The post appeared first on .
  • Ars OpenForum: Google’s new Gemma 3 AI model is optimized to run on a single GPU
  • THE DECODER: Google DeepMind has unveiled Gemma 3, a new generation of open AI models designed to deliver high performance with a relatively small footprint, making them suitable for running on individual GPUs or TPUs.
  • Gradient Flow: Gemma 3: What You Need To Know
  • Interconnects: Gemma 3, OLMo 2 32B, and the growing potential of open-source AI
  • OODAloop: Gemma 3, Google's newest lightweight, open-source AI model, is designed for multimodal tasks and efficient deployment on various devices.
  • NVIDIA Technical Blog: Google has released lightweight, multimodal, multilingual models called Gemma 3. The models are designed to run efficiently on phones and laptops.
  • LessWrong: Google DeepMind has unveiled Gemma 3, a new generation of open AI models designed to deliver high performance with a relatively small footprint, making them suitable for running on individual GPUs or TPUs.

Koray Kavukcuoglu@The Official Google Blog //
Google has unveiled Gemini 2.5 Pro, touted as its "most intelligent model to date," enhancing AI reasoning and workflow capabilities. This multimodal model, available to Gemini Advanced users and experimentally on Google’s AI Studio, outperforms competitors like OpenAI, Anthropic, and DeepSeek on key benchmarks, particularly in coding, math, and science. Gemini 2.5 Pro boasts an impressive 1 million token context window, soon expanding to 2 million, enabling it to handle larger datasets and understand entire code repositories.

Gemini 2.5 Pro excels in advanced reasoning benchmark tests, achieving a state-of-the-art score on datasets designed to capture human knowledge and reasoning. Its enhanced coding performance allows for the creation of visually compelling web apps and agentic code applications, along with code transformation and editing. Google plans to release pricing for Gemini 2.5 models soon, marking a significant step in their goal of developing more capable and context-aware AI agents.

Recommended read:
References :

Eira May@Stack Overflow Blog //
AI agents are rapidly transforming business operations across various sectors, promising to automate tasks, enhance efficiency, and streamline workflows. Companies are integrating these intelligent systems to modernize customer experiences and unlock enterprise value. To fully leverage the potential of AI agents, businesses need to ensure they have real-time and seamless connections to company databases, internal communication tools, and documents. This integration is crucial for the agents to provide contextually aware and valuable assistance.

Saltbox Mgmt, a Salesforce consulting company, has successfully implemented Agentforce to modernize the buying experience, resulting in improved efficiency and enhanced personalization. Moreover, the integration of AI in real estate technology presents opportunities for strategic transformation, boosting efficiency, value, and decision-making capabilities. However, AI assistants are only as effective as the knowledge base they are connected to, highlighting the importance of comprehensive and up-to-date internal data.

Recommended read:
References :
  • bdtechtalks.com: bdtechtalks.com - Why corporate real estate should adopt AI to unlock enterprise value
  • Stack Overflow Blog: stackoverflow.blog - “Are AI agents ready for the enterprise?”
  • AI Accelerator Institute: aiacceleratorinstitute.com - AI assistants: Only as smart as your knowledge base
  • Salesforce: salesforce.com - Saltbox Mgmt Modernizes the Buying Experience with Agentforce, Boosting Efficiency and Personalization
  • Bernard Marr: AI Agents Are Coming For Your Job Tasks—Here's How To Stay Ahead

matthewthomas@Microsoft Industry Blogs //
References: Source , The Quantum Insider
Microsoft is emphasizing both AI security and advancements in quantum computing. The company is integrating AI features across its products and services, including Microsoft 365, while also highlighting the critical intersection of AI innovation and security. Microsoft will host Microsoft Secure on April 9th, an online event designed to help professionals discover AI innovations for the security lifecycle. Attendees can learn how to harden their defenses, secure AI investments, and discover AI-first tools and best practices.

Microsoft is also continuing its work in quantum computing, recently defending its topological qubit claims at the American Physical Society (APS) meeting. While Microsoft maintains confidence in its results, skepticism remains within the scientific community regarding the verification methods used, particularly the reliability of the topological gap protocol (TGP) in detecting Majorana quasiparticles. Chetan Nayak, a leading theoretical physicist at Microsoft, presented the company’s findings, acknowledging the skepticism but insisting that the team is confident.

Recommended read:
References :
  • Source: AI innovation requires AI security: Hear what’s new at Microsoft Secure
  • The Quantum Insider: Microsoft defends topological qubit claims at APS Meeting Amid Skepticism

Matthew S.@IEEE Spectrum //
Recent research indicates that AI models, particularly large language models (LLMs), can struggle with overthinking and analysis paralysis, impacting their efficiency and success rates. A study has found that reasoning LLMs sometimes overthink problems, which leads to increased computational costs and a reduction in their overall performance. This issue is being addressed through various optimization techniques, including scaling inference-time compute, reinforcement learning, and supervised fine-tuning, to ensure models use only the necessary amount of reasoning for tasks.

The size and training methods of these models play a crucial role in their reasoning abilities. For instance, Alibaba's Qwen team introduced QwQ-32B, a 32-billion-parameter model that outperforms much larger rivals in key problem-solving tasks. QwQ-32B achieves superior performance in math, coding, and scientific reasoning using multi-stage reinforcement learning, despite being significantly smaller than DeepSeek-R1. This advancement highlights the potential of reinforcement learning to unlock reasoning capabilities in smaller models, rivaling the performance of giant models while requiring less computational power.

Recommended read:
References :
  • IEEE Spectrum: It’s Not Just Us: AI Models Struggle With Overthinking
  • Sebastian Raschka, PhD: This article explores recent research advancements in reasoning-optimized LLMs, with a particular focus on inference-time compute scaling that have emerged since the release of DeepSeek R1.

@www.microsoft.com //
Microsoft is pushing forward on multiple fronts in the realm of Artificial Intelligence and Cloud Technology, with recent advancements impacting both user experience and business ROI. One significant development focuses on enhancing video quality within Microsoft Teams through the introduction of Super Resolution (SR), now available in public preview and slated for general availability in March. This feature, initially for Snapdragon X-based Copilot+ PCs, leverages AI to restore video resolution during Teams calls, particularly when network bandwidth is limited and lower-resolution videos are transmitted.

This AI-driven approach significantly improves visual clarity compared to traditional upscaling methods, with user assessments indicating an average increase of +0.6 CMOS in quality. Furthermore, Microsoft is making strides in deploying Large Language Models (LLMs) on edge devices like smartphones and laptops through advances in low-bit quantization. These innovations, including techniques like T-MAC, Ladder, and LUT Tensor Core, improve computational efficiency and hardware compatibility, enabling more efficient operation of LLMs on resource-constrained devices. Microsoft also highlights how real-world businesses are transforming using AI, showcasing 50 new customer stories.

Recommended read:
References :