News from the AI & ML world

DeeperML

Asif Razzaq@MarkTechPost //
DeepSeek AI is accelerating the release of its R2 AI reasoning model, a sequel to its R1 model that was launched in January. The R1 model matched or exceeded the performance of models from major Western companies like OpenAI, Meta, and Google. The release of R1 precipitated a significant stock sell-off, and the R2 model is expected to have enhanced coding and reasoning capabilities in multiple languages.

DeepSeek is moving up the release date for R2, which was initially planned for early May. This accelerated release may further intensify concerns in the United States regarding global AI leadership and is expected to encourage many Chinese companies to integrate DeepSeek models into their products. Furthermore, DeepSeek has announced the release of DeepGEMM, a library designed for efficient FP8 General Matrix Multiplications (GEMMs), as part of #OpenSourceWeek. This new library will help improve the efficiency of training AI models.

Share: bluesky twitterx--v2 facebook--v1 threads


References :
  • Techstrong.ai: DeepSeek is working on the sequel to its R1 blockbuster.
  • Analytics Vidhya: As part of the ongoing #OpenSourceWeek, DeepSeek announced the release of DeepGEMM, a cutting-edge library designed for efficient FP8 General Matrix Multiplications (GEMMs).
  • MarkTechPost: DeepSeek AI Releases DualPipe: A Bidirectional Pipeline Parallelism Algorithm for Computation-Communication Overlap in V3/R1 Training
  • MarkTechPost: DeepSeek AI Releases Smallpond: A Lightweight Data Processing Framework Built on DuckDB and 3FS
  • Fello AI: DeepSeek is rapidly emerging as a significant player in the AI space, particularly since its public release in January 2025. This Chinese AI startup, founded in 2023, has quickly gained traction, challenging established models like ChatGPT and Claude.
Classification: