News from the AI & ML world

DeeperML - #llm

Maximilian Schreiner@THE DECODER //
Google's Gemini 2.5 Pro is making waves as a top-tier reasoning model, marking a leap forward in Google's AI capabilities. Released recently, it's already garnering attention from enterprise technical decision-makers, especially those who have traditionally relied on OpenAI or Claude for production-grade reasoning. Early experiments, benchmark data, and developer reactions suggest Gemini 2.5 Pro is worth serious consideration.

Gemini 2.5 Pro distinguishes itself with its transparent, structured reasoning. Google's step-by-step training approach results in a structured chain of thought that provides clarity. The model presents ideas in numbered steps, with sub-bullets and internal logic that's remarkably coherent and transparent. This breakthrough offers greater trust and steerability, enabling enterprise users to validate, correct, or redirect the model with more confidence when evaluating output for critical tasks.

Recommended read:
References :
  • SiliconANGLE: Google LLC said today it’s updating its flagship Gemini artificial intelligence model family by introducing an experimental Gemini 2.5 Pro version.
  • The Tech Basic: Google's New AI Models “Think” Before Answering, Outperform Rivals
  • AI News | VentureBeat: Google releases ‘most intelligent model to date,’ Gemini 2.5 Pro
  • Analytics Vidhya: We Tried the Google 2.5 Pro Experimental Model and It’s Mind-Blowing!
  • www.tomsguide.com: Google unveils Gemini 2.5 — claims AI breakthrough with enhanced reasoning and multimodal power
  • Google DeepMind Blog: Gemini 2.5: Our most intelligent AI model
  • THE DECODER: Google Deepmind has introduced Gemini 2.5 Pro, which the company describes as its most capable AI model to date. The article appeared first on .
  • intelligence-artificielle.developpez.com: Google DeepMind a lancé Gemini 2.5 Pro, un modèle d'IA qui raisonne avant de répondre, affirmant qu'il est le meilleur sur plusieurs critères de référence en matière de raisonnement et de codage
  • The Tech Portal: Google unveils Gemini 2.5, its most intelligent AI model yet with ‘built-in thinking’
  • Ars OpenForum: Google says the new Gemini 2.5 Pro model is its “smartest†AI yet
  • The Official Google Blog: Gemini 2.5: Our most intelligent AI model
  • www.techradar.com: I pitted Gemini 2.5 Pro against ChatGPT o3-mini to find out which AI reasoning model is best
  • bsky.app: Google's AI comeback is official. Gemini 2.5 Pro Experimental leads in benchmarks for coding, math, science, writing, instruction following, and more, ahead of OpenAI's o3-mini, OpenAI's GPT-4.5, Anthropic's Claude 3.7, xAI's Grok 3, and DeepSeek's R1. The narrative has finally shifted.
  • Shelly Palmer: Google’s Gemini 2.5: AI That Thinks Before It Speaks
  • bdtechtalks.com: What to know about Google Gemini 2.5 Pro
  • Interconnects: The end of a busy spring of model improvements and what's next for the presumed leader in AI abilities.
  • www.techradar.com: Gemini 2.5 is now available for Advanced users and it seriously improves Google’s AI reasoning
  • www.zdnet.com: Google releases 'most intelligent' experimental Gemini 2.5 Pro - here's how to try it
  • Unite.AI: Gemini 2.5 Pro is Here—And it Changes the AI Game (Again)
  • TestingCatalog: Gemini 2.5 Pro sets new AI benchmark and launches on AI Studio and Gemini
  • Analytics Vidhya: Google DeepMind's latest AI model, Gemini 2.5 Pro, has reached the #1 position on the Arena leaderboard.
  • AI News: Gemini 2.5: Google cooks up its ‘most intelligent’ AI model to date
  • Fello AI: Google’s Gemini 2.5 Shocks the World: Crushing AI Benchmark Like No Other AI Model!
  • Analytics India Magazine: Google Unveils Gemini 2.5, Crushes OpenAI GPT-4.5, DeepSeek R1, & Claude 3.7 Sonnet
  • Practical Technology: Practical Tech covers the launch of Google's Gemini 2.5 Pro and its new AI benchmark achievements.
  • Shelly Palmer: Google's Gemini 2.5: AI That Thinks Before It Speaks
  • www.producthunt.com: Google's most intelligent AI model
  • Windows Copilot News: Google reveals AI ‘reasoning’ model that ‘explicitly shows its thoughts’
  • AI News | VentureBeat: Hands on with Gemini 2.5 Pro: why it might be the most useful reasoning model yet
  • thezvi.wordpress.com: Gemini 2.5 Pro Experimental is America’s next top large language model. That doesn’t mean it is the best model for everything. In particular, it’s still Gemini, so it still is a proud member of the Fun Police, in terms of …
  • www.computerworld.com: Gemini 2.5 can, among other things, analyze information, draw logical conclusions, take context into account, and make informed decisions.
  • www.infoworld.com: Google introduces Gemini 2.5 reasoning models
  • Maginative: Google's Gemini 2.5 Pro leads AI benchmarks with enhanced reasoning capabilities, positioning it ahead of competing models from OpenAI and others.
  • www.infoq.com: Google's Gemini 2.5 Pro is a powerful new AI model that's quickly becoming a favorite among developers and researchers. It's capable of advanced reasoning and excels in complex tasks.
  • AI News | VentureBeat: Google’s Gemini 2.5 Pro is the smartest model you’re not using – and 4 reasons it matters for enterprise AI
  • Communications of the ACM: Google has released Gemini 2.5 Pro, an updated AI model focused on enhanced reasoning, code generation, and multimodal processing.
  • The Next Web: Google has released Gemini 2.5 Pro, an updated AI model focused on enhanced reasoning, code generation, and multimodal processing.
  • www.tomsguide.com: Surprise move comes just days after Gemini 2.5 Pro Experimental arrived for Advanced subscribers.
  • Composio: Google just launched Gemini 2.5 Pro on March 26th, claiming to be the best in coding, reasoning and overall everything. But I The post appeared first on .
  • Composio: Google's Gemini 2.5 Pro, released on March 26th, is being hailed for its enhanced reasoning, coding, and multimodal capabilities.
  • Analytics India Magazine: Gemini 2.5 Pro is better than the Claude 3.7 Sonnet for coding in the Aider Polyglot leaderboard.
  • www.zdnet.com: Gemini's latest model outperforms OpenAI's o3 mini and Anthropic's Claude 3.7 Sonnet on the latest benchmarks. Here's how to try it.
  • www.marketingaiinstitute.com: [The AI Show Episode 142]: ChatGPT’s New Image Generator, Studio Ghibli Craze and Backlash, Gemini 2.5, OpenAI Academy, 4o Updates, Vibe Marketing & xAI Acquires X
  • www.tomsguide.com: Gemini 2.5 is free, but can it beat DeepSeek?
  • www.tomsguide.com: Google Gemini could soon help your kids with their homework — here’s what we know
  • PCWorld: Google’s latest Gemini 2.5 Pro AI model is now free for all users
  • www.techradar.com: Google just made Gemini 2.5 Pro Experimental free for everyone, and that's awesome.
  • Last Week in AI: #205 - Gemini 2.5, ChatGPT Image Gen, Thoughts of LLMs
  • Data Phoenix: Google Unveils Gemini 2.5: Its Most Intelligent AI Model Yet
  • SiliconANGLE: AWS brings its generative AI assistant to the Amazon OpenSearch Service

Ryan Daws@AI News //
DeepSeek V3-0324 has emerged as a leading AI model, topping benchmarks for non-reasoning AI in an open-source breakthrough. This milestone signifies a significant advancement in the field, as it marks the first time an open weights model has achieved the top position among non-reasoning models. The model's performance surpasses proprietary counterparts and edges it closer to proprietary reasoning models, highlighting the growing viability of open-source solutions for latency-sensitive applications. DeepSeek V3-0324 represents a new era for open-source AI, offering a powerful and adaptable tool for developers and enterprises.

DeepSeek-V3 now runs at 20 tokens per second on Apple’s Mac Studio, presenting a challenge to OpenAI’s cloud-dependent business model. The 685-billion-parameter model, DeepSeek-V3-0324, is freely available for commercial use under the MIT license. This achievement, coupled with its cost efficiency and performance, signals a shift in the AI sector, where open-source frameworks increasingly compete with closed systems. Early testers report significant improvements over previous versions, positioning DeepSeek's new model above Claude Sonnet 3.5 from Anthropic.

Recommended read:
References :
  • Analytics India Magazine: The model outperformed all other non-reasoning models across several benchmarks but trailed behind DeepSeek-R1, OpenAI’s o1, o3-mini, and other reasoning models.
  • venturebeat.com: DeepSeek-V3 now runs at 20 tokens per second on Mac Studio, and that’s a nightmare for OpenAI
  • AI News: DeepSeek V3-0324 tops non-reasoning AI models in open-source first
  • Analytics Vidhya: DeepSeek V3-0324: Generated 700 Lines of Code without Breaking
  • Analytics Vidhya: DeepSeek V3-0324 vs Claude 3.7: Which is the Better Coder?
  • Cloud Security Alliance: Markets reacted dramatically, with Nvidia alone losing nearly $600 billion in value in a single day, part of a broader...
  • GZERO Media: Just a few short months ago, Silicon Valley seemed to have the artificial intelligence industry in a chokehold.
  • MarkTechPost: DeepSeek AI Unveils DeepSeek-V3-0324: Blazing Fast Performance on Mac Studio, Heating Up the Competition with OpenAI
  • SiliconANGLE: DeepSeek today released an improved version of its DeepSeek-V3 large language model under a new open-source license.
  • techstrong.ai: DeepSeek Ups Ante (Again) in Duel with OpenAI, Anthropic
  • www.zdnet.com: DeepSeek V3 model gets a major upgrade
  • www.techradar.com: DeepSeek’s new AI is smarter, faster, cheaper, and a real rival to OpenAI's models
  • Composio: Deepseek v3 0324: Finally, the Sonnet 3.5 at Home
  • AI News: DeepSeek disruption: Chinese AI innovation narrows global technology divide

Ryan Daws@AI News //
DeepSeek V3-0324, the latest large language model from Chinese AI startup DeepSeek, is making waves in the artificial intelligence industry. The model, quietly released with an MIT license for commercial use, has quickly become the highest-scoring non-reasoning model on the Artificial Analysis Intelligence Index. This marks a significant milestone for open-source AI, surpassing proprietary counterparts like Google’s Gemini 2.0 Pro, Anthropic’s Claude 3.7 Sonnet, and Meta’s Llama 3.3 70B.

DeepSeek V3-0324's efficiency is particularly notable. Early reports indicate that it can run directly on consumer-grade hardware, specifically Apple’s Mac Studio with an M3 Ultra chip, achieving speeds of over 20 tokens per second. This capability is a major departure from the typical data center requirements associated with state-of-the-art AI. The updated version demonstrates substantial improvements in reasoning and benchmark performance, as well as enhanced Chinese writing proficiency and optimized translation quality.

Recommended read:
References :
  • venturebeat.com: DeepSeek-V3 now runs at 20 tokens per second on Mac Studio, and that’s a nightmare for OpenAI
  • AI News: DeepSeek V3-0324 tops non-reasoning AI models in open-source first
  • Analytics Vidhya: DeepSeek V3-0324: Generated 700 Lines of Code without Breaking
  • Analytics India Magazine: The model outperformed all other non-reasoning models across several benchmarks but trailed behind DeepSeek-R1, OpenAI’s o1, o3-mini, and other reasoning models.
  • Cloud Security Alliance: DeepSeek: Behind the Hype and Headlines
  • techstrong.ai: DeepSeek Ups Ante (Again) in Duel with OpenAI, Anthropic
  • www.techradar.com: Deepseek’s new AI is smarter, faster, cheaper, and a real rival to OpenAI's models
  • Analytics Vidhya: DeepSeek V3-0324 vs Claude 3.7: Which is the Better Coder?
  • MarkTechPost: DeepSeek AI Unveils DeepSeek-V3-0324: Blazing Fast Performance on Mac Studio, Heating Up the Competition with OpenAI
  • www.zdnet.com: It's called V3-0324, but the real question is: Is it foreshadowing the upcoming launch of R2?
  • SiliconANGLE: DeepSeek today released an improved version of its DeepSeek-V3 large language model under a new open-source license.
  • Composio: Deepseek v3 o324, a new checkpoint, has been released by Deepseek in silence, with no marketing or hype, just a tweet and The post appeared first on .
  • Composio: Deepseek v3-0324 vs. Claude 3.7 Sonnet

Matthias Bastian@THE DECODER //
Baidu has launched two new AI models, ERNIE 4.5 and ERNIE X1, designed to compete with DeepSeek's R1 model. The company is making these models freely accessible to individual users through the ERNIE Bot platform, ahead of the initially planned schedule. ERNIE 4.5 is a multimodal foundation model, integrating text, images, audio, and video to enhance understanding and content generation across various data types. This model demonstrates significant improvements in language understanding, reasoning, and coding abilities.

ERNIE X1 is Baidu's first model specifically designed for complex reasoning tasks, excelling in logical inference, problem-solving, and structured decision-making suitable for applications in finance, law, and data analysis. Baidu claims that ERNIE X1 matches DeepSeek R1’s performance at half the cost. ERNIE 4.5 has shown performance on par with models like DeepSeek-R1, but at approximately half the deployment cost.

Recommended read:
References :
  • AiThority: With the launch of ERNIE 4.5 and ERNIE X1, ERNIE Bot is made free to the public ahead of schedule, and users can access both models free of charge.
  • techxplore.com: Chinese internet search giant Baidu released a new artificial intelligence reasoning model Sunday and made its AI chatbot services free to consumers as ferocious competition grips the sector.
  • THE DECODER: Baidu claims its Ernie X1 reasoning model matches Deepseek-R1 performance at half the price
  • Analytics Vidhya: China has done it again with its AI models and this time the blow is bigger and better! Baidu – a Chinese AI company, recently released two large language models (LLMs) – ERNIE 4.5 & X1.
  • TestingCatalog: Discover Baidu's new AI models, ERNIE 4.5 and ERNIE X1, now freely accessible via ERNIE Bot. Experience cutting-edge AI tech ahead of schedule!
  • Analytics India Magazine: China’s Baidu Launches Two New AI Models, Rivals DeepSeek R1 at Half the Price
  • TechCrunch: Chinese search engine Baidu has launched two new AI models — Ernie 4.5, the latest version of the company’s foundational model first released two years ago, as well as a new reasoning model, Ernie X1. According to Reuters, Baidu claims that Ernie X1’s performance is “on par with DeepSeek R1 at only
  • AIwire: With the launch of ERNIE 4.5 and ERNIE X1, ERNIE Bot is made free to the public ahead of schedule, and users can access both models free of charge.
  • AI News: Baidu undercuts rival AI models with ERNIE 4.5 and ERNIE X1
  • AI News | VentureBeat: Baidu has also announced plans to integrate ERNIE 4.5 and ERNIE X1 into its broader ecosystem, including Baidu Search and the Wenxiaoyan app.
  • www.tomshardware.com: ERNIE 4.5 AI model by Baidu claims to match DeepSeek R1 at half the cost
  • Fello AI: Baidu’s New ERNIE 4.5 & X1 – A Free AI That Is Better Than GPT-4.5 & Costs Pennies!

Matthias Bastian@THE DECODER //
References: TestingCatalog , THE DECODER , AiThority ...
Baidu has released two new large language models, ERNIE 4.5 and ERNIE X1, claiming they outperform OpenAI's GPT-4.5 and DeepSeek-R1. These models are more cost-effective, offering high quality at a fraction of the price. ERNIE 4.5 is a multimodal foundation model that integrates text, images, audio, and video, enhancing its ability to understand and generate different kinds of content. ERNIE X1 is a deep-thinking reasoning model with multimodal capabilities, excelling in tasks requiring advanced reasoning.

Baidu has made both models freely accessible to individual users via the ERNIE Bot platform, ahead of schedule. For enterprise users and developers, ERNIE 4.5 is available via APIs on Baidu AI Cloud's Qianfan platform, with ERNIE X1 set to follow. Baidu also plans to integrate the models into its existing products, including Baidu Search and the Wenxiaoyan app. This move positions Baidu as a competitive force in the AI landscape, challenging Western AI companies.

Recommended read:
References :
  • TestingCatalog: This article discusses Baidu's ERNIE 4.5 and ERNIE X1 models, highlighting their performance and lower prices compared to DeepSeek.
  • THE DECODER: This article discusses Baidu’s new LLMs, ERNIE 4.5 and ERNIE X1, highlighting their competitive pricing and plans for open-source release in the context of the AI market.
  • Analytics Vidhya: This article discusses Baidu’s release of ERNIE 4.5 and ERNIE X1 LLMs, highlighting their claimed performance advantages over GPT-4.5 and cost-effectiveness.
  • AiThority: With the launch of ERNIE 4.5 and ERNIE X1, ERNIE Bot is made free to the public ahead of schedule, and users can access both models free of charge. As a deep-thinking reasoning model with multimodal capabilities, ERNIE X1 delivers performance on par with DeepSeek R1 at only half the price. ERNIE 4.5 is the [...]
  • techxplore.com: Chinese internet search giant Baidu released a new artificial intelligence reasoning model Sunday and made its AI chatbot services free to consumers as ferocious competition grips the sector.
  • TechCrunch: Chinese search engine Baidu has launched two new AI models — Ernie 4.5, the latest version of the company’s foundational model first released two years ago, as well as a new reasoning model, Ernie X1.
  • AI News: Baidu undercuts rival AI models with ERNIE 4.5 and ERNIE X1
  • techstrong.ai: Baidu Unleashes Speedy New AI Model to Rival DeepSeek
  • AI News | VentureBeat: Baidu has also announced plans to integrate ERNIE 4.5 and ERNIE X1 into its broader ecosystem, including Baidu Search and the Wenxiaoyan app.
  • Fello AI: Baidu’s New ERNIE 4.5 & X1 – A Free AI That Is Better Than GPT-4.5 & Costs Pennies!

Matthias Bastian@THE DECODER //
Mistral AI, a French artificial intelligence startup, has launched Mistral Small 3.1, a new open-source language model boasting 24 billion parameters. According to the company, this model outperforms similar offerings from Google and OpenAI, specifically Gemma 3 and GPT-4o Mini, while operating efficiently on consumer hardware like a single RTX 4090 GPU or a MacBook with 32GB RAM. It supports multimodal inputs, processing both text and images, and features an expanded context window of up to 128,000 tokens, which makes it suitable for long-form reasoning and document analysis.

Mistral Small 3.1 is released under the Apache 2.0 license, promoting accessibility and competition within the AI landscape. Mistral AI aims to challenge the dominance of major U.S. tech firms by offering a high-performance, cost-effective AI solution. The model achieves inference speeds of 150 tokens per second and is designed for text and multimodal understanding, positioning itself as a powerful alternative to industry-leading models without the need for expensive cloud infrastructure.

Recommended read:
References :
  • THE DECODER: Mistral launches improved Small 3.1 multimodal model
  • venturebeat.com: Mistral AI launches efficient open-source model that outperforms Google and OpenAI offerings with just 24 billion parameters, challenging U.S. tech giants' dominance in artificial intelligence.
  • Maginative: Mistral Small 3.1 Outperforms Gemma 3 and GPT-4o Mini
  • TestingCatalog: Mistral Small 3: A 24B open-source AI model optimized for speed
  • Simon Willison's Weblog: Mistral Small 3.1, an open-source AI model, delivers state-of-the-art performance.
  • SiliconANGLE: Paris-based artificial intelligence startup Mistral AI said today it’s open-sourcing a new, lightweight AI model called Mistral Small 3.1, claiming it surpasses the capabilities of similar models created by OpenAI and Google LLC.
  • Analytics Vidhya: Mistral Small 3.1: The Best Model in its Weight Class
  • Analytics Vidhya: Mistral 3.1 vs Gemma 3: Which is the Better Model?

Ryan Daws@AI News //
DeepSeek has released DeepSeek V3-0324, an upgraded version of their large language model, marking a significant milestone in open-source AI. According to Artificial Analysis, this new iteration is the highest-scoring non-reasoning model available, surpassing even proprietary counterparts from Google, Anthropic, and Meta. Its accessibility improves the AI research environment. Early reports indicate substantial improvements in reasoning and coding abilities, positioning it as a real rival to OpenAI's models.

The updated model, V3-0324, excels in benchmarks such as MMLU-Pro, GPQA, AIME, and LiveCodeBench, demonstrating enhanced problem-solving and knowledge retention. It runs at 20 tokens per second on a Mac Studio, showcasing its efficiency. With its MIT license, DeepSeek-V3-0324 is freely available for commercial use, and it can run directly on consumer-grade hardware. DeepSeek's advancements signal a shift in the AI sector, as open-source frameworks increasingly compete with closed systems, offering developers a powerful and adaptable tool.

Recommended read:
References :
  • The Algorithmic Bridge: This article mentions DeepSeek V3 update among other AI-related topics, providing a broader context for the DeepSeek release.
  • venturebeat.com: DeepSeek-V3 now runs at 20 tokens per second on Mac Studio, and that’s a nightmare for OpenAI
  • AI News: DeepSeek V3-0324 tops non-reasoning AI models in open-source first
  • Analytics Vidhya: DeepSeek V3 developed by the Chinese AI research lab DeepSeek under High-Flyer has been a standout in the AI landscape since its initial open-source release in December 2024.
  • www.techradar.com: DeepSeek releases upgraded AI model with better performance at lower costs.
  • SiliconANGLE: DeepSeek releases improved V3 model under MIT license
  • Quinta?s weblog: DeepSeek-V3 now runs at 20 tokens per second on Mac Studio, and that’s a nightmare for OpenAI

Dashveenjit Kaur@AI News //
References: venturebeat.com , AI News , Sify ...
Chinese AI startup DeepSeek is shaking up the global technology landscape with its latest large language model, DeepSeek-V3-0324. This new model has been lauded for matching the performance of American AI models, while boasting significantly lower development costs. According to Lee Kai-fu, CEO of Chinese startup 01.AI, the gap between Chinese and American AI capabilities has narrowed dramatically, with China even ahead in some specific areas.

DeepSeek-V3-0324 features enhanced reasoning capabilities and improved performance in multiple benchmarks, particularly in mathematics. The model scored 59.4 on the American Invitational Mathematics Examination (AIME), a significant improvement over its predecessor. Häme University lecturer Kuittinen Petri noted DeepSeek's achievements were realized with just a fraction of the resources available to competitors like OpenAI. This breakthrough has been attributed to DeepSeek’s focus on algorithmic efficiency and novel approaches to model architecture, allowing them to overcome restrictions on access to the latest silicon.

This disruption is not going unnoticed, when DeepSeek launched its R1 model in January, America’s Nasdaq plunged 3.1%, while the S&P 500 fell 1.5%. While DeepSeek claimed a $5.6 million training cost, this represented only the marginal cost of the final training run. SemiAnalysis estimates DeepSeek's actual hardware investment at closer to $1.6 billion, with hundreds of millions in operating costs. The developments present opportunities and challenges for the.

Recommended read:
References :
  • venturebeat.com: DeepSeek-V3 now runs at 20 tokens per second on Mac Studio, and that’s a nightmare for OpenAI
  • AI News: DeepSeek disruption: Chinese AI innovation narrows global technology divide
  • GZERO Media: How DeepSeek changed China’s AI ambitions
  • Sify: DeepSeek’s AI Revolution: Creating an Entire AI Ecosystem
  • Nordic APIs: ChatGPT vs. DeepSeek: A Side-by-Side Comparison
  • Composio: Deepseek v3-0324 vs. Claude 3.7 Sonnet

mpesce@Windows Copilot News //
References: THE DECODER , Ken Yeung , www.itpro.com ...
Microsoft is actively enhancing its Copilot service with regular updates, introducing new features and improvements. Recent updates, as of March 19, 2025, include Copilot Vision which is now available on Android devices for Pro subscribers in the United States, allowing users to leverage their phone's camera for assistance. Copilot Vision access in Edge has expanded to all free users in the U.S., enabling the AI to scan, analyze, and provide insights from webpages. Additionally, the "Think Deeper" reasoning model has been upgraded to OpenAI's o3-mini (high), leading to faster and more flexible responses to complex questions.

Microsoft is also making strides in AI research and security. Microsoft Research has developed a more efficient method for incorporating external knowledge into language models, called KBLaM, which uses vector pairs woven directly into the model's architecture. This system offers faster and more efficient responses by processing knowledge directly within the model. Furthermore, Microsoft is expanding its Security Copilot service with new AI agents designed to help IT teams manage increasing security threats, with 11 agents being added in an April preview, designed to improve security outcomes at machine speed and scale.

Recommended read:
References :
  • THE DECODER: Microsoft Research has developed a more efficient way to incorporate external knowledge into language models.
  • Ken Yeung: Nathan Myhrvold had a front-row seat to the rise of modern computing that few can match. As Microsoft’s first CTO, he helped shape its technology strategy during its formative years from 1986 to 2000, working closely with Bill Gates and leading groundbreaking research efforts.
  • Windows Copilot News: Researchers act as attackers, sending an email that, when processed by the LLMail service based on a user query, will result in the bypass of the implemented prompt injection defenses, so that the LLM would execute a specific tool call.
  • www.itpro.com: Microsoft is expanding its Security Copilot service with new AI agents to help overworked IT teams deal with surging security threats.
  • Microsoft Copilot Blog: Welcome to Microsoft’s Copilot Release Notes. Here we’ll provide regular updates on what’s happening with Copilot, from new features to firmware updates and more.

@Google DeepMind Blog //
Researchers are making strides in understanding how AI models think. Anthropic has developed an "AI microscope" to peek into the internal processes of its Claude model, revealing how it plans ahead, even when generating poetry. This tool provides a limited view of how the AI processes information and reasons through complex tasks. The microscope suggests that Claude uses a language-independent internal representation, a "universal language of thought", for multilingual reasoning.

The team at Google DeepMind introduced JetFormer, a new Transformer designed to directly model raw data. This model, capable of both understanding and generating text and images seamlessly, maximizes the likelihood of raw data without depending on any pre-trained components. Additionally, a comprehensive benchmark called FACTS Grounding has been introduced to evaluate the factuality of large language models (LLMs). This benchmark measures how accurately LLMs ground their responses in provided source material and avoid hallucinations, aiming to improve trust and reliability in AI-generated information.

Recommended read:
References :
  • Google DeepMind Blog: FACTS Grounding: A new benchmark for evaluating the factuality of large language models
  • THE DECODER: Anthropic's AI microscope reveals how Claude plans ahead when generating poetry

Hassan Shittu@Fello AI //
References: Fello AI , lambdalabs.com
Nvidia is making significant strides in healthcare and AI infrastructure, particularly through the development of specialized large language models (LLMs). Their DNA LLM exemplifies this, aiming to revolutionize genomic research and drug discovery. This highlights AI's potential to transform medical science by enabling faster analysis and interpretation of biological data.

Lambda has been recognized as NVIDIA's 2025 Healthcare Partner of the Year for accelerating AI innovation in healthcare and biotech. John Snow Labs introduced the first commercially available Medical Reasoning LLM at NVIDIA GTC, optimized for clinical reasoning and capable of verbalizing its thought processes. Nvidia's involvement in this has helped lead the way for these healthcare specific Large Language Models.

Recommended read:
References :
  • Fello AI: NVIDIA DNA LLM: The Power To Curing All Diseases?
  • lambdalabs.com: This article discusses Lambda Honored to Accelerate AI Innovation in Healthcare with NVIDIA