News from the AI & ML world

DeeperML - #gpu

Michal Langmajer@Fello AI //
OpenAI has announced the release of GPT-4.5, its latest language model which they are calling their 'last non-chain-of-thought model.' According to OpenAI, GPT-4.5 offers substantial enhancements over its predecessors, particularly in advanced reasoning, problem-solving, and contextual understanding. Sam Altman, CEO of OpenAI, described it as the "first model that feels like talking to a thoughtful person," noting moments of astonishment at the quality of advice received from the AI.

However, the rollout is facing challenges due to GPU shortages. Altman stated they are "out of GPUs," leading to a staggered release, initially limited to ChatGPT Pro subscribers who pay $200 a month. While GPT-4.5 is available to developers across all paid API tiers, OpenAI plans to expand access to Plus and Team tiers next week, with tens of thousands of GPUs expected to arrive to alleviate the supply constraints. Despite not being a reasoning model, OpenAI estimates that GPT-4.5 is 30 times more expensive to run than GPT-4o.

Recommended read:
References :
  • Fello AI: OpenAI’s GPT‑4.5 Finally Arrived: Can It Beat Grok 3 and Claude 3.7?
  • Shelly Palmer: Shelly Palmer discusses the release of OpenAI's GPT-4.5.
  • Analytics Vidhya: Everything You Need to Know About OpenAI’s GPT-4.5
  • www.tomshardware.com: Tom's Hardware reports on Sam Altman's statement about GPU shortages delaying the GPT-4.5 release.
  • venturebeat.com: VentureBeat reports OpenAI releases GPT-4.5 claiming 10X efficiency over GPT-4, but says it’s ‘not a frontier model’
  • Gradient Flow: Scaling Up, Costs Up: GPT-4.5 and the Intensifying AI Competition
  • Pivot to AI: OpenAI releases GPT-4.5 with ridiculous prices for a mediocre model
  • Techstrong.ai: TechStrong.ai article on OpenAI's GPT-4.5 AI model.
  • THE DECODER: OpenAI has released GPT-4.5 as a "Research Preview".
  • eWEEK: OpenAI releases GPT-4.5, a “Warm” Generative AI Model, for Paid Plans and APIs
  • www.windowscentral.com: Sam Altman on GPT-4.5: Expensive, yet the closest thing to a thoughtful conversational partner we've seen
  • THE DECODER: OpenAI's largest model GPT-4.5 delivers on vibes instead of benchmarks
  • 9to5Mac: OpenAI announces GPT-4.5, ChatGPT’s largest and best model for chat
  • www.engadget.com: OpenAI's new GPT-4.5 model is a better, more natural conversationalist
  • The Verge: Anthropic’s new ‘hybrid reasoning’ AI model is its smartest yet
  • THE DECODER: OpenAI has presented its largest language model to date. According to Mark Chen, Chief Research Officer at OpenAI, GPT 4.5 shows that the scaling of AI models has not yet reached its limits.
  • The Verge: OpenAI is launching GPT-4.5 today, its newest and largest AI language model.
  • NextBigFuture.com: OpenAI GPT 4.5 Has BIG Coding Improvement – Claims Scaling Still Works – Expensive
  • TechCrunch: OpenAI unveils GPT-4.5 ‘Orion,’ its largest AI model yet
  • PCMag Middle East ai: Reporting that OpenAI has launched GPT-4.5 but is limiting it to priciest tiers due to GPU shortages.
  • Analytics Vidhya: Two days ago, on 27 Feb 2025, OpenAI dropped GPT-4.5, expectations were sky-high. But instead of a groundbreaking leap forward, we got a model prioritizing emotional intelligence over raw reasoning power.
  • AI News | VentureBeat: GPT-4.5 for enterprise: Do its accuracy and knowledge justify the cost?
  • Windows Report: OpenAI released GPT-4.5, but it’s not much of an upgrade from GPT-4o. After DeepSeek was unleashed into the world, everyone wondered what OpenAI would do now that another AI company had developed an extremely powerful model at a tiny fraction of the budget.
  • iHLS: OpenAI Unveils GPT-4.5
  • Data Phoenix: OpenAI releases the long-awaited GPT-4.5/Orion, its last non-chain-of-thought model
  • Towards AI: GPT-4.5: The Next Evolution in AI
  • Towards AI: Towards AI article on TAI #142: GPT-4.5 Released.
  • www.marketingaiinstitute.com: [The AI Show Episode 138]: Introducing GPT-4.5, Claude 3.7 Sonnet, Alexa+, Deep Research Now in ChatGPT Plus & How AI Is Disrupting Writing
  • Analytics Vidhya: Now, this is a shocker, despite a lot of backlash on the cost of GPT 4.5, it becomes #1 in the Chatbot Arena LLM Leaderboard! Securing over 3,200+ votes, OpenAI’s latest model has emerged as number one across all evaluation categories, prominently excelling in Style Control and Multi-Turn interactions.

Allyson Vasquez@NVIDIA Technical Blog //
References: Data Phoenix , AIwire , TechCrunch ...
NVIDIA's GTC 2025 is shaping up to be a major event for AI enthusiasts, packed with networking opportunities, live demos, and discussions on the latest AI innovations. Data Phoenix is highlighting the event as a key gathering, featuring meetups, networking receptions, and hands-on sessions alongside the main conference. They are also co-hosting and supporting key events like the INFRA@GTC Networking Reception and AI Demo Jam.

VAST Data plans to showcase its data platform for enterprise Retrieval Augmented Generation (RAG) use cases at the conference. Microsoft and NVIDIA have also announced a partnership to integrate RTX Neural Shaders into a DirectX preview in April, bringing more AI capabilities to game development. This integration will allow developers to leverage Tensor cores in RTX GPUs to accelerate neural networks within a game's graphics pipeline.

Recommended read:
References :
  • Data Phoenix: DATAPHOENIX: NVIDIA GTC 2025 & Top AI Events of the Week! (March 17-23)
  • AIwire: VAST Fleshes Out Data Platform for Enterprise RAG Use Cases
  • BigDATAwire: NVIDIA GTC 2025: What to Expect From the Ultimate AI Event?
  • TechCrunch: Nvidia GTC 2025: What to expect from this year’s show
  • AIwire: Provides an overview of what to expect from NVIDIA's GTC 2025 event.
  • www.tomsguide.com: Nvidia CEO Jensen Huang is taking center stage in the Nvidia GTC 2025 Keynote to show off what's next in AI — here's the latest news.
  • insidehpc.com: SAN JOSE, Calif. – March 18, 2025– Data infrastructure company NetApp (NASDAQ: NTAP) today made an agentic AI announcement that taps the NVIDIA AI Data Platform reference design.Â
  • AIwire: At its GTC event in San Jose today, Nvidia unveiled updates to its AI infrastructure portfolio, including its next-generation datacenter GPU, the NVIDIA Blackwell Ultra. Expanding on the Blackwell architecture introduced last year, Nvidia is integrating its new 300-series GPUs into two DGX systems: the NVIDIA DGX GB300 and the NVIDIA DGX B300.
  • www.tomshardware.com: Nvidia announces Blackwell Ultra B300 —1.5X faster than B200 with 288GB HBM3e and 15 PFLOPS dense FP4
  • www.laptopmag.com: Nvidia's Jensen Huang says new Blackwell chips make previous-gen feel obsolete
  • NVIDIA Newsroom: Full Steam Ahead: NVIDIA-Certified Program Expands to Enterprise Storage for Faster AI Factory Deployment
  • BigDATAwire: Nvidia unveils updates to its AI infrastructure portfolio, including its next-generation datacenter GPU, the NVIDIA Blackwell Ultra.
  • venturebeat.com: Nvidia’s GTC 2025 keynote: 40x AI performance leap, open-source ‘Dynamo’, and a walking Star Wars-inspired ‘Blue’ robot
  • BigDATAwire: Nvidia used its GTC conference today to introduce new GPU superchips, including the second generation of its current Grace Blackwell chip, as well as the next generation, dubbed the Vera The post appeared first on .
  • IBM - Announcements: IBM Taps NVIDIA AI Data Platform Technologies to Accelerate AI at Scale
  • Analytics Vidhya: Nvidia CEO Jensen Huang's keynote at GTC 2025 is detailed, emphasizing the advancements in AI and accelerated computing, including the introduction of the Blackwell Ultra GB300 chips and the upcoming Blackwell B300 series.
  • AIwire: Nvidia CEO Jensen Huang's keynote highlighted advancements in AI and predictions for industry evolution in the coming years. The keynote showcased Nvidia's next-generation graphics architectures: Blackwell Ultra and Vera Rubin, designed for managing sophisticated AI processes.
  • Analytics Vidhya: Nvidia’s GTC 2025 Announcements That Shook the Stock Market
  • AIwire: Nvidia touts next-generation GPU superchip and new photonic switches, unveiling Blackwell Ultra and Vera Rubin architectures.
  • Gradient Flow: Nvidia’s AI Vision: GTC 2025 and the Road Ahead
  • OODAloop: Nvidia is releasing what it’s calling an AI foundation model for humanoid robotics.
  • Data Phoenix: Nvidia introduces the Blackwell Ultra to support the rise of AI reasoning, agents, and physical AI
  • insideAI News: In what is becoming an annual tradition for the @HPCpodcast, we present “Live from Nvidia GTC 2025,â€� covering highlights from the Nvidia extravaganza with an AI-everywhere theme.
  • BigDATAwire: Reports from Nvidia’s GPU Technology Conference (GTC) 2025, a weeklong event in San Jose, California that be remembered for a long time, if not for the content

ChinaTechNews.com Staff@ChinaTechNews.com //
Nvidia Corp. has signaled a strong trajectory for AI-driven growth into 2025, bolstered by a solid fourth-quarter earnings and revenue beat. The company's revenue jumped 78% year-over-year, surpassing investor expectations, with earnings reaching $0.89 per share, exceeding estimates. Nvidia's guidance for the current quarter indicates continued growth, forecasting sales of $43 billion, which further demonstrates the company's confidence in sustained demand for its AI-related products.

Nvidia's success is attributed to the high demand for its GPUs, particularly for AI applications. The company has begun producing its next-generation Blackwell GPUs, with CEO Jensen Huang noting strong demand. Data Center revenue saw a remarkable increase of 93% year-over-year, reaching $35.6 billion. This performance underscores Nvidia's leadership in providing hardware for AI advancements and its pivotal role in the ongoing AI revolution.

Recommended read:
References :
  • NextBigFuture.com: Nvidia once again beat the quarterly earnings estimate and increased guidance more than expectations. Revenue: $39.3B vs. $38.1B est (+78% YoY) • EPS: $0.89 vs. $0.85 est • Data Center: $35.6B vs $33.5B est (+93% YoY)
  • www.theguardian.com: Nvidia beats Wall Street expectations in first earnings after DeepSeek’s AI debut - Investors were eyeing the firm for signs of slowing demand after revelation high-end chips not necessary, but found few surpassed investor expectations for the fourth quarter of 2024 with a 78% jump in revenue year over year.
  • Dataconomy: Quarterly earnings from Nvidia (NVDA.O) on Wednesday stand as a significant event for markets amid investor scrutiny regarding substantial spending in artificial intelligence (AI).
  • SiliconANGLE: Chipmaker Nvidia Corp. today signaled that it’s on course for yet more artificial intelligence-driven growth in 2025 after delivering a solid fourth-quarter earnings and revenue beat and offering strong guidance for the current quarter.
  • bsky.app: Nvidia’s Q4 revenue soared 78% YoY to $39.3B versus $38.05B expected, driven by strong demand for Blackwell AI chips.
  • ChinaTechNews.com: Nvidia posts $39B quarter: Has the AI chip giant defied market jitters over DeepSeek?
  • The Register - Software: Cash torrent pouring into Nvidia slows – despite booming Blackwell adoption
  • SiliconANGLE: Nvidia’s fine! Besides, who else is going to power all these new AI models?
  • insideAI News: Feb. 28, 2025 — SoftBank Corp., ZutaCore and Hon Hai Technology Group (Foxconn) today announced that they implemented ZutaCore’s two-phase direct liquid cooling technology*1 in an AI server using NVIDIA accelerated computing. The companies said this is the first implementation*2 of ZutaCore’s two-phase DLC*1 using NVIDIA H200 GPUs. In addition, SoftBank designed and developed a rack-integrated […]
  • THE DECODER: Chinese dealers advertise Nvidia's Blackwell processors despite strict US export controls

Ellie Ramirez-Camara@Data Phoenix //
Nvidia's GTC 2025 event showcased the company's latest advancements in AI computing. A key highlight was the introduction of the Blackwell Ultra platform, designed to support the growing demands of AI reasoning, agentic AI, and physical AI applications. This next-generation platform builds upon the Blackwell architecture and includes the GB300 NVL72 rack-scale solution and the HGX B300 NVL16 system.

The Blackwell Ultra platform promises significantly enhanced AI computing power, with the GB300 NVL72 delivering 1.5x more AI performance than its predecessor and increasing revenue opportunities for AI factories by 50x. Major cloud providers and server manufacturers are expected to offer Blackwell Ultra-based products in the second half of 2025. Supporting this hardware is the new NVIDIA Dynamo open-source inference framework, which optimizes reasoning AI services across thousands of GPUs.

Recommended read:
References :
  • NVIDIA Newsroom: Innovation to Impact: How NVIDIA Research Fuels Transformative Work in AI, Graphics and Beyond
  • Data Phoenix: Nvidia introduces the Blackwell Ultra to support the rise of AI reasoning, agents, and physical AI
  • insideAI News: @HPCpodcast: Live from GTC 2025, Among the Crowds for the New AI Compute Landscape
  • Gradient Flow: Nvidia’s AI Vision: GTC 2025 and the Road Ahead
  • AIwire: Jensen Huang Charts Nvidia’s AI-Powered Future
  • BigDATAwire: Reporter’s Notebook: AI Hype and Glory at Nvidia GTC 2025
  • John Werner: Jensen Huang Hypes New Chips In Keynote

staff@insideAI News //
References: insideAI News , insidehpc.com ,
Fluidstack announced on March 25, 2025, its collaboration with Borealis Data Center, Dell Technologies, and NVIDIA to deploy and manage exascale GPU clusters across Iceland and Europe. Fluidstack aims to support AI labs, researchers, and enterprises by rapidly deploying high-density GPU supercomputers powered by 100% renewable energy. Borealis Data Center will provide facilities powered by renewable energy in Iceland and the Nordics, leveraging the region's cold climate and geothermal power. Dell PowerEdge XE9680 servers, optimized for AI workloads with NVIDIA HGX H200 and Quantum-2 InfiniBand networking, will be utilized to ensure performance and reliability.

Reports indicate that China's AI data center boom has lost momentum, leaving billions of dollars in idle infrastructure. Triggered by the rise of generative AI applications, China rapidly expanded its AI infrastructure in 2023-2024, constructing hundreds of new data centers with state and private funding. However, many facilities are now underused, returns are falling, and the market for GPU rentals has collapsed. Some data centers became outdated before they were fully operational due to changing market conditions and poor planning.

Recommended read:
References :
  • insideAI News: Fluidstack to Deploy Exascale GPU Clusters in Europe with NVIDIA, Borealis Data Center and Dell
  • insidehpc.com: Fluidstack to Deploy Exascale GPU Clusters in Europe with NVIDIA, Borealis Data Center and Dell
  • www.tomshardware.com: China's AI data center boom goes bust: Rush leaves billions of dollars in idle infrastructure

staff@insideAI News //
References: insideAI News , insidehpc.com ,
Fluidstack, an AI cloud platform, is collaborating with Borealis Data Center, Dell Technologies, and NVIDIA to deploy and manage exascale GPU clusters across Iceland and Europe. Announced on March 25, 2025, this initiative aims to support AI labs, researchers, and enterprises by providing the computational power needed for AI workloads. Borealis Data Center will provide Fluidstack with facilities powered by 100% renewable energy, leveraging Iceland's cold climate and renewable hydro and geothermal power.

Fluidstack will use Dell PowerEdge XE9680 servers, optimized for AI workloads with NVIDIA HGX H200, combined with NVIDIA's Quantum-2 InfiniBand networking. These Dell servers are designed for performance and reliability in AI workloads. According to Cesar Maklary, co-founder and president of Fluidstack, the goal is to "support the most exceptional AI labs, researchers and enterprises" through the rapid deployment of high-density GPU supercomputers for European and global customers, utilizing 100% renewable energy.

Recommended read:
References :
  • insideAI News: Fluidstack to Deploy Exascale GPU Clusters in Europe with NVIDIA, Borealis Data Center and Dell
  • insidehpc.com: Fluidstack to Deploy Exascale GPU Clusters in Europe with NVIDIA, Borealis Data Center and Dell
  • lambda.ai: Be First, Scale Fast - NVIDIA Blackwell GPU Clusters Now Live on Lambda

Marco Chiappetta@hothardware.com //
NVIDIA continues to assert its dominance in both AI infrastructure and mobile gaming. The company is blitzing the laptop gaming market with its flagship GeForce RTX 5090 Laptop GPU, hailed as the fastest mobile GPU ever tested. This new GPU, based on the Blackwell architecture, promises to deliver enhanced performance and features for gamers and content creators on the go. The RTX 5090 Laptop GPU incorporates updated shader cores, 4th gen RT cores, and 5th gen Tensor cores, supporting DLSS 4 and a new media engine.

Nvidia is taking major steps to promote its open source enterprise AI infrastructure. NVIDIA has announced the open-source release of the KAI Scheduler, a Kubernetes-native GPU scheduling solution, now available under the Apache 2.0 license. Originally developed within the Run:ai platform, KAI Scheduler is now available to the community while also continuing to be packaged and delivered as part of the NVIDIA Run:ai platform. This initiative underscores NVIDIA’s commitment to advancing both open-source and enterprise AI infrastructure, fostering an active and collaborative community, encouraging contributions, feedback, and innovation.

Recommended read:
References :
  • hothardware.com: GeForce RTX 5090 Laptop GPU Review: NVIDIA’s Best Mobile Gaming Chip Yet
  • NVIDIA Technical Blog: NVIDIA Open Sources Run:ai Scheduler to Foster Community Collaboration
  • AIwire: Nvidia’s Strategic Acquisitions Signal Push Toward Full-Stack AI Control