News from the AI & ML world

DeeperML - #china

Jaime Hampton@AIwire //
References: AI News , Sify , AIwire ...
DeepSeek's innovative AI models are reshaping China's AI data center infrastructure, leading to a market disruption and potentially underutilized resources. The company's DeepSeek-V3 model has demonstrated performance that rivals ChatGPT but at a significantly reduced cost. This has altered the demand for extensive GPU clusters used in traditional AI training, shifting the focus towards hardware prioritizing low-latency, particularly near tech hubs. This has resulted in increased speculation as well as experienced players who are now posed with the challenge of the DeepSeek V3.

The open-source nature of DeepSeek’s model is also allowing smaller players to compete without the need for extensive pretraining, which is undermining the demand for large data centers. DeepSeek-V3, which runs at 20 tokens per second on a Mac Studio, poses a new challenge for existing AI models. Chinese AI startups are now riding DeepSeek's momentum and building an ecosystem that is revolutionizing the AI landscape. This narrows the technology divide between China and the United States.

Recommended read:
References :
  • AI News: DeepSeek disruption: Chinese AI innovation narrows global technology divide
  • Sify: DeepSeek’s AI Revolution: Creating an Entire AI Ecosystem
  • Composio: Deepseek v3-0324 vs. Claude 3.7 Sonnet
  • AIwire: Report: China’s Race to Build AI Datacenters Has Hit a Wall
  • Quinta?s weblog: DeepSeek-V3 now runs at 20 tokens per second on Mac Studio, and that’s a nightmare for OpenAI

Dashveenjit Kaur@AI News //
References: venturebeat.com , AI News , Nordic APIs ...
Chinese AI startup DeepSeek is shaking up the global technology landscape with its latest large language model, DeepSeek-V3-0324. This new model has been lauded for matching the performance of American AI models, while boasting significantly lower development costs. According to Lee Kai-fu, CEO of Chinese startup 01.AI, the gap between Chinese and American AI capabilities has narrowed dramatically, with China even ahead in some specific areas.

DeepSeek-V3-0324 features enhanced reasoning capabilities and improved performance in multiple benchmarks, particularly in mathematics. The model scored 59.4 on the American Invitational Mathematics Examination (AIME), a significant improvement over its predecessor. Häme University lecturer Kuittinen Petri noted DeepSeek's achievements were realized with just a fraction of the resources available to competitors like OpenAI. This breakthrough has been attributed to DeepSeek’s focus on algorithmic efficiency and novel approaches to model architecture, allowing them to overcome restrictions on access to the latest silicon.

This disruption is not going unnoticed, when DeepSeek launched its R1 model in January, America’s Nasdaq plunged 3.1%, while the S&P 500 fell 1.5%. While DeepSeek claimed a $5.6 million training cost, this represented only the marginal cost of the final training run. SemiAnalysis estimates DeepSeek's actual hardware investment at closer to $1.6 billion, with hundreds of millions in operating costs. The developments present opportunities and challenges for the.

Recommended read:
References :
  • venturebeat.com: DeepSeek-V3 now runs at 20 tokens per second on Mac Studio, and that’s a nightmare for OpenAI
  • AI News: DeepSeek disruption: Chinese AI innovation narrows global technology divide
  • Sify: DeepSeek’s AI Revolution: Creating an Entire AI Ecosystem
  • Nordic APIs: ChatGPT vs. DeepSeek: A Side-by-Side Comparison
  • Composio: Deepseek v3-0324 vs. Claude 3.7 Sonnet

Dashveenjit Kaur@AI News //
References: AI News , MarkTechPost , AI News ...
DeepSeek, a Chinese AI startup, is causing a stir in the AI industry with its new large language model, DeepSeek-V3-0324. Released with little fanfare on the Hugging Face AI repository, the 641-gigabyte model is freely available for commercial use under an MIT license. Early reports indicate it can run directly on consumer-grade hardware, such as Apple’s Mac Studio with the M3 Ultra chip, especially in a 4-bit quantized version that reduces the storage footprint to 352GB. This innovation challenges the previous notion that Silicon Valley held a chokehold on the AI industry.

China's focus on algorithmic efficiency over hardware superiority has allowed companies like DeepSeek to flourish despite restrictions on access to the latest silicon. DeepSeek's R1 model, launched earlier this year, already rivaled OpenAI's ChatGPT-4 at a fraction of the cost. Now the DeepSeek-V3-0324 features enhanced reasoning capabilities and improved performance. This has sparked a gold rush among Chinese tech startups, rewriting the playbook for AI development and allowing smaller companies to believe they have a shot in the market.

Recommended read:
References :
  • AI News: DeepSeek V3-0324 has become the highest-scoring non-reasoning model on the Artificial Analysis Intelligence Index in a landmark achievement for open-source AI.
  • MarkTechPost: Artificial intelligence (AI) has made significant strides in recent years, yet challenges persist in achieving efficient, cost-effective, and high-performance models.
  • Quinta?s weblog: Chinese AI startup DeepSeek has quietly released a new large language model that’s already sending ripples through the artificial intelligence industry — not just for its capabilities, but for how it’s being deployed.
  • AI News: DeepSeek disruption: Chinese AI innovation narrows global technology divide
  • Composio: Deepseek v3 o324, a new checkpoint, has been released by Deepseek in silence, with no marketing or hype, just a tweet and
  • SiliconANGLE: DeepSeek today released an improved version of its DeepSeek-V3 large language model under a new open-source license.
  • Sify: DeepSeek’s AI Revolution: Creating an Entire AI Ecosystem
  • Composio: Deepseek v3-0324 vs. Claude 3.7 Sonnet

Ryan Daws@AI News //
References: SiliconANGLE , venturebeat.com , AI News ...
DeepSeek, a Chinese AI company, has released DeepSeek V3-0324, an updated AI model that demonstrates impressive performance. The model is now running at 20 tokens per second on a Mac Studio. This model is said to contain 685 billion parameters and its cost-effectiveness challenges the dominance of American AI models, signaling that China continues to innovate in AI despite chip restrictions. Reports from early testers show improvements over previous versions and the model tops non-reasoning AI models in open-source first.

This new model runs on consumer-grade hardware, specifically Apple's Mac Studio with the M3 Ultra chip, diverging from the typical data center requirements for AI. It is freely available for commercial use under the MIT license. According to AI researcher Awni Hannun, the model runs at over 20 tokens per second on a 512GB M3 Ultra. The company has made no formal announcement, just an empty README file and the model weights themselves. This stands in contrast to the carefully orchestrated product launches by Western AI companies.

Recommended read:
References :
  • SiliconANGLE: DeepSeek today released an improved version of its DeepSeek-V3 large language model under a new open-source license.
  • venturebeat.com: DeepSeek-V3 now runs at 20 tokens per second on Mac Studio, and that’s a nightmare for OpenAI
  • AI News: Chinese AI innovation is reshaping the global technology landscape, challenging assumptions about Western dominance in advanced computing. Recent developments from companies like DeepSeek illustrate how quickly China has adapted to and overcome international restrictions through creative approaches to AI development.
  • AI News: DeepSeek V3-0324 tops non-reasoning AI models in open-source first
  • MarkTechPost: DeepSeek AI Unveils DeepSeek-V3-0324: Blazing Fast Performance on Mac Studio, Heating Up the Competition with OpenAI
  • Cloud Security Alliance: Cloud Security Alliance: DeepSeek: Behind the Hype and Headlines
  • Quinta?s weblog: DeepSeek-V3 now runs at 20 tokens per second on Mac Studio, and that’s a nightmare for OpenAI
  • Composio: Deepseek v3-0324 vs. Claude 3.7 Sonnet

Ryan Daws@AI News //
References: venturebeat.com , AI News ,
DeepSeek, a Chinese AI startup, is making waves in the artificial intelligence industry with its DeepSeek-V3 model. This model is demonstrating performance that rivals Western AI models like those from OpenAI and Anthropic, but at significantly lower development costs. The release of DeepSeek-V3 is seen as jumpstarting AI development across China, with other startups and established companies releasing their own advanced models, further fueling competition. This has narrowed the technology gap between China and the United States as China has adapted to and overcome international restrictions through creative approaches to AI development.

One particularly notable aspect of DeepSeek-V3 is its ability to run efficiently on consumer-grade hardware, such as the Mac Studio with an M3 Ultra chip. Reports indicate that the model achieves speeds of over 20 tokens per second on this platform, making it a potential "nightmare for OpenAI". This contrasts sharply with the data center requirements typically associated with state-of-the-art AI models. The company's focus on algorithmic efficiency has allowed them to achieve notable gains despite restricted access to the latest silicon, showcasing that Chinese AI innovation has flourished by focusing on algorithmic efficiency and novel approaches to model architecture.

Recommended read:
References :
  • venturebeat.com: DeepSeek-V3 now runs at 20 tokens per second on Mac Studio, and that’s a nightmare for OpenAI
  • AI News: DeepSeek disruption: Chinese AI innovation narrows global technology divide
  • GZERO Media: How DeepSeek changed China’s AI ambitions

@tomshardware.com //
Ant Group has announced a significant breakthrough in AI, achieving a 20% reduction in AI costs by training models on domestically produced Chinese chips. According to reports, the company utilized chips from Chinese tech giants Alibaba and Huawei, reaching performance levels comparable to those obtained with Nvidia's H800 chips. The AI models, named Ling-Plus and Ling-Lite, are said to match or even outperform leading models, with Ant Group claiming its AI models outperformed Meta’s in benchmarks and cut inference costs.

This accomplishment signals a potential leap forward in China's AI development efforts and a move towards self-reliance in semiconductor technology. While Ant Group still uses Nvidia hardware for some tasks, it is now relying more on alternatives, including chips from AMD and Chinese manufacturers, driven in part by U.S. sanctions that limit access to Nvidia's advanced GPUs. This shift could lessen the country’s dependence on foreign technology.

Recommended read:
References :
  • Jon Keegan: This post mentions Ant Group's breakthrough with new, fast, cheap AI models trained on Chinese chips, highlighting innovation around US export controls.
  • www.tomshardware.com: This news article reports on Ant Group's use of Chinese semiconductors to improve AI development efficiency and reduce costs.
  • www.techrepublic.com: Ant Group Slashes AI Costs by 20% With Chinese-Made Chips: What It Means for U.S. Tech

Matthias Bastian@THE DECODER //
Baidu has launched two new AI models, ERNIE 4.5 and ERNIE X1, designed to compete with DeepSeek's R1 model. The company is making these models freely accessible to individual users through the ERNIE Bot platform, ahead of the initially planned schedule. ERNIE 4.5 is a multimodal foundation model, integrating text, images, audio, and video to enhance understanding and content generation across various data types. This model demonstrates significant improvements in language understanding, reasoning, and coding abilities.

ERNIE X1 is Baidu's first model specifically designed for complex reasoning tasks, excelling in logical inference, problem-solving, and structured decision-making suitable for applications in finance, law, and data analysis. Baidu claims that ERNIE X1 matches DeepSeek R1’s performance at half the cost. ERNIE 4.5 has shown performance on par with models like DeepSeek-R1, but at approximately half the deployment cost.

Recommended read:
References :
  • AiThority: With the launch of ERNIE 4.5 and ERNIE X1, ERNIE Bot is made free to the public ahead of schedule, and users can access both models free of charge.
  • techxplore.com: Chinese internet search giant Baidu released a new artificial intelligence reasoning model Sunday and made its AI chatbot services free to consumers as ferocious competition grips the sector.
  • THE DECODER: Baidu claims its Ernie X1 reasoning model matches Deepseek-R1 performance at half the price
  • Analytics Vidhya: China has done it again with its AI models and this time the blow is bigger and better! Baidu – a Chinese AI company, recently released two large language models (LLMs) – ERNIE 4.5 & X1.
  • TestingCatalog: Discover Baidu's new AI models, ERNIE 4.5 and ERNIE X1, now freely accessible via ERNIE Bot. Experience cutting-edge AI tech ahead of schedule!
  • Analytics India Magazine: China’s Baidu Launches Two New AI Models, Rivals DeepSeek R1 at Half the Price
  • TechCrunch: Chinese search engine Baidu has launched two new AI models — Ernie 4.5, the latest version of the company’s foundational model first released two years ago, as well as a new reasoning model, Ernie X1. According to Reuters, Baidu claims that Ernie X1’s performance is “on par with DeepSeek R1 at only
  • AIwire: With the launch of ERNIE 4.5 and ERNIE X1, ERNIE Bot is made free to the public ahead of schedule, and users can access both models free of charge.
  • AI News: Baidu undercuts rival AI models with ERNIE 4.5 and ERNIE X1
  • AI News | VentureBeat: Baidu has also announced plans to integrate ERNIE 4.5 and ERNIE X1 into its broader ecosystem, including Baidu Search and the Wenxiaoyan app.
  • www.tomshardware.com: ERNIE 4.5 AI model by Baidu claims to match DeepSeek R1 at half the cost
  • Fello AI: Baidu’s New ERNIE 4.5 & X1 – A Free AI That Is Better Than GPT-4.5 & Costs Pennies!

Matthias Bastian@THE DECODER //
References: TestingCatalog , THE DECODER , AiThority ...
Baidu has released two new large language models, ERNIE 4.5 and ERNIE X1, claiming they outperform OpenAI's GPT-4.5 and DeepSeek-R1. These models are more cost-effective, offering high quality at a fraction of the price. ERNIE 4.5 is a multimodal foundation model that integrates text, images, audio, and video, enhancing its ability to understand and generate different kinds of content. ERNIE X1 is a deep-thinking reasoning model with multimodal capabilities, excelling in tasks requiring advanced reasoning.

Baidu has made both models freely accessible to individual users via the ERNIE Bot platform, ahead of schedule. For enterprise users and developers, ERNIE 4.5 is available via APIs on Baidu AI Cloud's Qianfan platform, with ERNIE X1 set to follow. Baidu also plans to integrate the models into its existing products, including Baidu Search and the Wenxiaoyan app. This move positions Baidu as a competitive force in the AI landscape, challenging Western AI companies.

Recommended read:
References :
  • TestingCatalog: This article discusses Baidu's ERNIE 4.5 and ERNIE X1 models, highlighting their performance and lower prices compared to DeepSeek.
  • THE DECODER: This article discusses Baidu’s new LLMs, ERNIE 4.5 and ERNIE X1, highlighting their competitive pricing and plans for open-source release in the context of the AI market.
  • Analytics Vidhya: This article discusses Baidu’s release of ERNIE 4.5 and ERNIE X1 LLMs, highlighting their claimed performance advantages over GPT-4.5 and cost-effectiveness.
  • AiThority: With the launch of ERNIE 4.5 and ERNIE X1, ERNIE Bot is made free to the public ahead of schedule, and users can access both models free of charge. As a deep-thinking reasoning model with multimodal capabilities, ERNIE X1 delivers performance on par with DeepSeek R1 at only half the price. ERNIE 4.5 is the [...]
  • techxplore.com: Chinese internet search giant Baidu released a new artificial intelligence reasoning model Sunday and made its AI chatbot services free to consumers as ferocious competition grips the sector.
  • TechCrunch: Chinese search engine Baidu has launched two new AI models — Ernie 4.5, the latest version of the company’s foundational model first released two years ago, as well as a new reasoning model, Ernie X1.
  • AI News: Baidu undercuts rival AI models with ERNIE 4.5 and ERNIE X1
  • techstrong.ai: Baidu Unleashes Speedy New AI Model to Rival DeepSeek
  • AI News | VentureBeat: Baidu has also announced plans to integrate ERNIE 4.5 and ERNIE X1 into its broader ecosystem, including Baidu Search and the Wenxiaoyan app.
  • Fello AI: Baidu’s New ERNIE 4.5 & X1 – A Free AI That Is Better Than GPT-4.5 & Costs Pennies!

Ryan Daws@AI News //
References: AI News , Unite.AI
Leading US artificial intelligence companies, including OpenAI, Anthropic, and Google, are urging the US government to take decisive action to secure the nation's AI leadership. The companies have submitted documents to the government warning that America's lead in AI is diminishing due to the increasing capabilities of Chinese models like Deepseek R1. This call for action comes in response to a request for information on developing an AI Action Plan.

These submissions highlight concerns about national security, economic competitiveness, and the necessity for strategic regulatory frameworks. OpenAI warns that Deepseek shows that the US lead is not wide and is narrowing, characterizing the model as state-subsidized, state-controlled, and freely available. Anthropic's filing focuses on biosecurity concerns, particularly Deepseek-R1's willingness to provide information about biological weapons, demonstrating the need for better government oversight of AI systems.

Recommended read:
References :
  • AI News: OpenAI and Google call for US government action to secure AI lead
  • Unite.AI: OpenAI, Anthropic, and Google Urge Action as US AI Lead Diminishes

Nitika Sharma@Analytics Vidhya //
China's Manus AI, developed by Monica, is generating buzz as an invite-only multi-agent AI product. This AI agent is designed to autonomously tackle complex, real-world tasks by operating as a multi-agent system. It utilizes a planner optimized for strategic reasoning, and an executor driven by Claude 3.5 Sonnet, incorporating code execution, web browsing, and multi-file code management.

The AI agent has sparked considerable global attention, igniting discussions about its technological and ethical implications, as well as its potential impact on the AI landscape. Manus reportedly outperformed OpenAI's o3-powered Deep Research agent on benchmarks, as showcased on the Manus website, leading some to believe it is among the most effective autonomous agents currently available. However, there is some skepticism due to it appearing to be a Claude wrapper with a jailbreak and tools optimized for the GAIA benchmark.

Recommended read:
References :
  • Maginative: Manus AI, China's new autonomous agent, is making waves with its ability to independently analyze, plan, and execute tasks. With industry leaders calling it “the AI agent we were promised,â€� it's raising the stakes in the global AI race.
  • MarkTechPost: In today’s digital era, the way we work is rapidly evolving, yet many challenges persist. Conventional AI assistants and manual workflows struggle to keep pace with the complexity and volume of modern tasks. Professionals and businesses face repetitive manual processes, inefficient research methods, and a lack of true automation. While traditional tools offer suggestions and […] The post appeared first on .
  • Fello AI: Manus AI is a newly announced autonomous AI agent developed by the Chinese startup Monica. It has been designed as a general AI agent that goes beyond simple text generation by autonomously planning, executing, and delivering complex tasks. The system is positioned as a breakthrough in AI technology, offering capabilities that mimic a human team working […] The post appeared first on .
  • Analytics Vidhya: Ever felt buried under a mountain of tasks, wishing for an extra set of hands to get things done? What if you could offload those tasks and get results without being glued to your screen? Manus – an AI agent from China gaining attention for its ability to handle general tasks with ease. In a […] The post appeared first on .
  • The Rundown AI: PLUS: China's Manus demos ‘world’s first fully autonomous’ AI agent
  • Craig Smith: Forbes discusses China’s Autonomous Agent, Manus, Changes Everything
  • AI News | VentureBeat: What you need to know about Manus, the new AI agentic system from China
  • AI Accelerator Institute: China’s new AI agent, Manus, operates autonomously, sparking debate on its impact, ethics, and global AI competition. Here’s what you need to know.
  • thezvi.wordpress.com: The Manus Marketing Madness
  • Analytics Vidhya: This article talks about comparison between China's new AI agent 'Manus' and OpenAI 'Operator'
  • The Register - Software: Prompts see it scour the web for info and turn it into decent documents at reasonable speed Chinese researchers’ AI prowess is again a hot topic after a startup called Monica.im last week revealed “Manusâ€�, a service it bills as a “general agentâ€� that might improve on tools offered by Western companies.
  • AIwire: China’s Manus AI: A Game-Changer or Just Another Overhyped Agent?
  • bdtechtalks.com: What is Manus, the AI agent taking on OpenAI Deep Research
  • OODAloop: China’s new AI agent, Manus, operates autonomously, sparking debate on its impact, ethics, and global AI competition. Here’s what you need to know.
  • pub.towardsai.net: Discussion on Manus AI's architecture, performance, and potential.
  • Tech News | Euronews RSS: A new Chinese AI platform is causing a frenzy. But is it worth the hype? Euronews Next takes a look.
  • techxplore.com: What to know about Manus, China's latest AI assistant
  • www.laptopmag.com: What is Manus AI? The autonomous assistant that wants to do the work for you
  • techstrong.ai: Chinese Startup’s Manus AI Agent Generates Hype, Skepticism
  • www.tomsguide.com: Manus AI is the new challenger to DeepSeek — everything you need to know
  • Gradient Flow: Manus: What You Need To Know
  • hackernoon.com: Founder of China’s New AI Model Says His Agent is More Autonomous Than Rivals'
  • iHLS: Introducing Manus: The World’s First Fully Autonomous AI Agent
  • TechNode: China’s AI agent Manus gains traction amid growing demand for autonomous AI