News from the AI & ML world

DeeperML - #gpus

Doug Black@insideAI News //
NVIDIA is actively developing AI chips specifically designed to comply with U.S. export regulations for the Chinese market. This strategic move aims to allow NVIDIA to maintain its presence in China's significant AI market despite increasing restrictions. Concurrently, NVIDIA's CEO Jensen Huang has voiced support for Trump's tariff plan, describing it as "utterly visionary," signaling a complex navigation of both technological and political landscapes. These developments highlight NVIDIA's determination to balance its business interests with evolving geopolitical dynamics.

The new chips, reportedly based on the RTX Pro 6000-series, will have significantly reduced specifications to meet export control requirements. This includes forgoing advanced technologies like Taiwan Semiconductor’s CoWoS packaging and using standard GDDR7 memory instead of high-bandwidth memory. While the specifics of the chips, potentially named RTX Pro 6000D, are still emerging, these adjustments are essential for NVIDIA to continue offering competitive AI solutions in China, where a substantial number of AI developers are located.

Challenges persist, as the company previously absorbed a $4.5 billion hit due to export restrictions, leading to a write-down on Chinese inventory and commitments. The emergence of strong domestic competitors, particularly Huawei, intensifies the pressure on NVIDIA. Huawei's Ascend 910C and 910B processors have gained traction among major Chinese tech firms, and their CloudMatrix 384 rack system directly rivals NVIDIA's Blackwell GB200 NVL72 configuration. Despite these obstacles, NVIDIA remains committed to the Chinese market, viewing it as crucial for maintaining its global leadership in AI technology.

Share: bluesky twitterx--v2 facebook--v1 threads


References :
  • insideAI News: Report: NVIDIA and AMD Devising Export Rules-Compliant Chips for China AI Market
  • The Register - Software: Here’s what it’ll take for Nvidia and other US chipmakers to flog AI chips in China
  • insidehpc.com: Report: NVIDIA and AMD Devising Export Rules-Compliant Chips for China AI Market
  • www.tomshardware.com: Nvidia CEO says Trump's tariff plan is 'utterly visionary'
  • PCMag Middle East ai: Nvidia Criticizes US's China Chip Ban, Stops Short of Blaming Trump Directly
Classification:
Ben Lorica@Gradient Flow //
Nvidia's Dynamo is a new open-source framework designed to tackle the complexities of scaling AI inference operations. Dynamo optimizes how large language models operate across multiple GPUs, balancing individual performance with system-wide throughput. Introduced at the GPU Technology Conference, Nvidia CEO Jensen Huang has described it as "the operating system of an AI factory".

This framework includes components designed to function as an "air traffic control system" for AI processing. These key components include libraries like TensorRT-LLM and SGLang, which provide efficient mechanisms for handling token generation, memory management, and batch processing to improve throughput and reduce latency when serving AI models. Nvidia's nGPT combines transformers and state-space models to reduce costs and increase speed while maintaining accuracy.

Share: bluesky twitterx--v2 facebook--v1 threads


References :
  • Gradient Flow: Diving into Nvidia Dynamo: AI Inference at Scale
  • bdtechtalks.com: Nvidia’s Hymba is an efficient SLM that combines state-space models and transformers
  • MarkTechPost: NVIDIA AI Researchers Introduce FFN Fusion: A Novel Optimization Technique that Demonstrates How Sequential Computation in Large Language Models LLMs can be Effectively Parallelized
Classification:
  • HashTags: #NvidiaDynamo #AIInference #GPUs
  • Company: Nvidia
  • Target: AI community
  • Product: Dynamo
  • Feature: AI Inference
  • Type: ProductUpdate
  • Severity: Informative
staff@insidehpc.com //
Nvidia's GTC 2025 event showcased the company's advancements in AI, particularly highlighting the integration of AI into various industries. CEO Jensen Huang emphasized that every industry is adopting AI and it is becoming critical for future revenue. Nvidia also unveiled an open Physical AI dataset to advance robotics and autonomous vehicle development. The dataset is claimed to be the world’s largest unified and open dataset for physical AI development, enabling the pretraining and post-training of AI models.

Central to Nvidia’s ambitions for Physical AI is its Omniverse platform, a digital development platform connecting spatial computing, 3D design, and physics-based workflows. Originally designed as a simulation and visualization tool, Omniverse has evolved significantly and has now become more of an operating system for Physical AI, allowing users to train autonomous systems before physical deployment. In quantum computing, SEEQC and Nvidia announced they have completed an end-to-end fully digital quantum-classical interface protocol demo between a QPU and GPU.

Share: bluesky twitterx--v2 facebook--v1 threads


References :
  • BigDATAwire: The Rise of Intelligent Machines: Nvidia Accelerates Physical AI Progress
Classification: