News from the AI & ML world

DeeperML - #mathematics

Matt Marshall@AI News | VentureBeat //
Microsoft is enhancing its Copilot Studio platform with AI-driven improvements, introducing deep reasoning capabilities that enable agents to tackle intricate problems through methodical thinking and combining AI flexibility with deterministic business process automation. The company has also unveiled specialized deep reasoning agents for Microsoft 365 Copilot, named Researcher and Analyst, to help users achieve tasks more efficiently. These agents are designed to function like personal data scientists, processing diverse data sources and generating insights through code execution and visualization.

Microsoft's focus includes securing AI and using it to bolster security measures, as demonstrated by the upcoming Microsoft Security Copilot agents and new security features. Microsoft aims to provide an AI-first, end-to-end security platform that helps organizations secure their future, one example being the AI agents designed to autonomously assist with phishing, data security, and identity management. The Security Copilot tool will automate routine tasks, allowing IT and security staff to focus on more complex issues, aiding in defense against cyberattacks.

Recommended read:
References :
  • Microsoft Security Blog: Learn about the upcoming availability of Microsoft Security Copilot agents and other new offerings for a more secure AI future.
  • www.zdnet.com: Designed for Microsoft's Security Copilot tool, the AI-powered agents will automate basic tasks, freeing IT and security staff to tackle more complex issues.

Maximilian Schreiner@THE DECODER //
Google has unveiled Gemini 2.5 Pro, its latest and "most intelligent" AI model to date, showcasing significant advancements in reasoning, coding proficiency, and multimodal functionalities. According to Google, these improvements come from combining a significantly enhanced base model with improved post-training techniques. The model is designed to analyze complex information, incorporate contextual nuances, and draw logical conclusions with unprecedented accuracy. Gemini 2.5 Pro is now available for Gemini Advanced users and on Google's AI Studio.

Google emphasizes the model's "thinking" capabilities, achieved through chain-of-thought reasoning, which allows it to break down complex tasks into multiple steps and reason through them before responding. This new model can handle multimodal input from text, audio, images, videos, and large datasets. Additionally, Gemini 2.5 Pro exhibits strong performance in coding tasks, surpassing Gemini 2.0 in specific benchmarks and excelling at creating visually compelling web apps and agentic code applications. The model also achieved 18.8% on Humanity’s Last Exam, demonstrating its ability to handle complex knowledge-based questions.

Recommended read:
References :
  • SiliconANGLE: Google LLC said today it’s updating its flagship Gemini artificial intelligence model family by introducing an experimental Gemini 2.5 Pro version.
  • The Tech Basic: Google's New AI Models “Think” Before Answering, Outperform Rivals
  • AI News | VentureBeat: Google releases ‘most intelligent model to date,’ Gemini 2.5 Pro
  • Analytics Vidhya: We Tried the Google 2.5 Pro Experimental Model and It’s Mind-Blowing!
  • www.tomsguide.com: Google unveils Gemini 2.5 — claims AI breakthrough with enhanced reasoning and multimodal power
  • Google DeepMind Blog: Gemini 2.5: Our most intelligent AI model
  • THE DECODER: Google Deepmind has introduced Gemini 2.5 Pro, which the company describes as its most capable AI model to date. The article appeared first on .
  • intelligence-artificielle.developpez.com: Google DeepMind a lancé Gemini 2.5 Pro, un modèle d'IA qui raisonne avant de répondre, affirmant qu'il est le meilleur sur plusieurs critères de référence en matière de raisonnement et de codage
  • The Tech Portal: Google unveils Gemini 2.5, its most intelligent AI model yet with ‘built-in thinking’
  • Ars OpenForum: Google says the new Gemini 2.5 Pro model is its “smartest†AI yet
  • The Official Google Blog: Gemini 2.5: Our most intelligent AI model
  • www.techradar.com: I pitted Gemini 2.5 Pro against ChatGPT o3-mini to find out which AI reasoning model is best
  • bsky.app: Google's AI comeback is official. Gemini 2.5 Pro Experimental leads in benchmarks for coding, math, science, writing, instruction following, and more, ahead of OpenAI's o3-mini, OpenAI's GPT-4.5, Anthropic's Claude 3.7, xAI's Grok 3, and DeepSeek's R1. The narrative has finally shifted.
  • Shelly Palmer: Google’s Gemini 2.5: AI That Thinks Before It Speaks
  • bdtechtalks.com: Gemini 2.5 Pro is a new reasoning model that excels in long-context tasks and benchmarks, revitalizing Google’s AI strategy against competitors like OpenAI.
  • Interconnects: The end of a busy spring of model improvements and what's next for the presumed leader in AI abilities.
  • www.techradar.com: Gemini 2.5 is now available for Advanced users and it seriously improves Google’s AI reasoning
  • www.zdnet.com: Google releases 'most intelligent' experimental Gemini 2.5 Pro - here's how to try it
  • Unite.AI: Gemini 2.5 Pro is Here—And it Changes the AI Game (Again)
  • TestingCatalog: Gemini 2.5 Pro sets new AI benchmark and launches on AI Studio and Gemini
  • Analytics Vidhya: Google DeepMind's latest AI model, Gemini 2.5 Pro, has reached the #1 position on the Arena leaderboard.
  • AI News: Gemini 2.5: Google cooks up its ‘most intelligent’ AI model to date
  • Fello AI: Google’s Gemini 2.5 Shocks the World: Crushing AI Benchmark Like No Other AI Model!
  • Analytics India Magazine: Google Unveils Gemini 2.5, Crushes OpenAI GPT-4.5, DeepSeek R1, & Claude 3.7 Sonnet
  • Practical Technology: Practical Tech covers the launch of Google's Gemini 2.5 Pro and its new AI benchmark achievements.
  • Shelly Palmer: Google's Gemini 2.5: AI That Thinks Before It Speaks
  • www.producthunt.com: Google's most intelligent AI model
  • Windows Copilot News: Google reveals AI ‘reasoning’ model that ‘explicitly shows its thoughts’
  • AI News | VentureBeat: Hands on with Gemini 2.5 Pro: why it might be the most useful reasoning model yet
  • thezvi.wordpress.com: Gemini 2.5 Pro Experimental is America’s next top large language model. That doesn’t mean it is the best model for everything. In particular, it’s still Gemini, so it still is a proud member of the Fun Police, in terms of …
  • www.computerworld.com: Gemini 2.5 can, among other things, analyze information, draw logical conclusions, take context into account, and make informed decisions.
  • www.infoworld.com: Google introduces Gemini 2.5 reasoning models
  • Maginative: Google's Gemini 2.5 Pro leads AI benchmarks with enhanced reasoning capabilities, positioning it ahead of competing models from OpenAI and others.
  • www.infoq.com: Google's Gemini 2.5 Pro is a powerful new AI model that's quickly becoming a favorite among developers and researchers. It's capable of advanced reasoning and excels in complex tasks.
  • AI News | VentureBeat: Google’s Gemini 2.5 Pro is the smartest model you’re not using – and 4 reasons it matters for enterprise AI
  • Communications of the ACM: Google has released Gemini 2.5 Pro, an updated AI model focused on enhanced reasoning, code generation, and multimodal processing.
  • The Next Web: Google has released Gemini 2.5 Pro, an updated AI model focused on enhanced reasoning, code generation, and multimodal processing.
  • www.tomsguide.com: Gemini 2.5 Pro is now free to all users in surprise move
  • Composio: Google just launched Gemini 2.5 Pro on March 26th, claiming to be the best in coding, reasoning and overall everything. But I The post appeared first on .
  • Composio: Google's Gemini 2.5 Pro, released on March 26th, is being hailed for its enhanced reasoning, coding, and multimodal capabilities.
  • Analytics India Magazine: Gemini 2.5 Pro is better than the Claude 3.7 Sonnet for coding in the Aider Polyglot leaderboard.
  • www.zdnet.com: Gemini's latest model outperforms OpenAI's o3 mini and Anthropic's Claude 3.7 Sonnet on the latest benchmarks. Here's how to try it.
  • www.marketingaiinstitute.com: [The AI Show Episode 142]: ChatGPT’s New Image Generator, Studio Ghibli Craze and Backlash, Gemini 2.5, OpenAI Academy, 4o Updates, Vibe Marketing & xAI Acquires X
  • www.tomsguide.com: Gemini 2.5 is free, but can it beat DeepSeek?
  • www.tomsguide.com: Google Gemini could soon help your kids with their homework — here’s what we know
  • PCWorld: Google’s latest Gemini 2.5 Pro AI model is now free for all users
  • www.techradar.com: Google just made Gemini 2.5 Pro Experimental free for everyone, and that's awesome.
  • Last Week in AI: #205 - Gemini 2.5, ChatGPT Image Gen, Thoughts of LLMs

Michael Weiss@Diagonal Argument //
References: Diagonal Argument
Recent discussions in mathematical concepts and programming tools cover a range of topics, including theoretical foundations and practical applications. Peter Cameron highlighted the Compactness Theorem for first-order logic, explaining its consequences and connections to topology. Also, a beginner's guide to sets has been published to explain how they work and some applications.

Noel Welsh presented a talk at Imperial College on dualities in programming, exploring the relationships between data and codata, calls and returns, and ASTs and stack machines. The use of adjoints in boolean operations was justified, and Daniel Lemire published an overview of parallel programming using Go. These discussions bridge the gap between abstract mathematical principles and their concrete uses in software development and programming paradigms.

Recommended read:
References :
  • Diagonal Argument: Some equations and inequalities for adjoints: they preserve some boolean operations, and “half-preserveâ€� some others.

msaul@mathvoices.ams.org //
Researchers at the Technical University of Munich (TUM) and the University of Cologne have developed an AI-based learning system designed to provide individualized support for schoolchildren in mathematics. The system utilizes eye-tracking technology via a standard webcam to identify students’ strengths and weaknesses. By monitoring eye movements, the AI can pinpoint areas where students struggle, displaying the data on a heatmap with red indicating frequent focus and green representing areas glanced over briefly.

This AI-driven approach allows teachers to provide more targeted assistance, improving the efficiency and personalization of math education. The software classifies the eye movement patterns and selects appropriate learning videos and exercises for each pupil. Professor Maike Schindler from the University of Cologne, who has collaborated with TUM Professor Achim Lilienthal for ten years, emphasizes that this system is completely new, tracking eye movements, recognizing learning strategies via patterns, offering individual support, and creating automated support reports for teachers.

Recommended read:
References :
  • www.sciencedaily.com: Researchers have developed an AI-based learning system that recognizes strengths and weaknesses in mathematics by tracking eye movements with a webcam to generate problem-solving hints. This enables teachers to provide significantly more children with individualized support.
  • phys.org: Researchers at the Technical University of Munich (TUM) and the University of Cologne have developed an AI-based learning system that recognizes strengths and weaknesses in mathematics by tracking eye movements with a webcam to generate problem-solving hints.
  • medium.com: Artificial Intelligence Math: How AI is Revolutionizing Math Learning
  • medium.com: Exploring AI Math Master Applications: Enhancing Mathematics Learning with Artificial Intelligence
  • phys.org: AI-based math: Individualized support for students uses eye tracking

@phys.org //
Researchers at the Technical University of Munich (TUM) and the University of Cologne have developed an AI-based learning system designed to provide individualized support for schoolchildren in mathematics. The system utilizes eye-tracking technology via a standard webcam to identify students’ strengths and weaknesses. By monitoring eye movements, the AI can pinpoint areas where students struggle, displaying the data on a heatmap with red indicating frequent focus and green representing areas glanced over briefly.

This AI-driven approach allows teachers to provide more targeted assistance, improving the efficiency and personalization of math education. The software classifies the eye movement patterns and selects appropriate learning videos and exercises for each pupil. Professor Maike Schindler from the University of Cologne, who has collaborated with TUM Professor Achim Lilienthal for ten years, emphasizes that this system is completely new, tracking eye movements, recognizing learning strategies via patterns, offering individual support, and creating automated support reports for teachers.

Recommended read:
References :
  • phys.org: AI-based math: Individualized support for students uses eye tracking
  • www.sciencedaily.com: AI-based math: Individualized support for schoolchildren

vishnupriyan@Verdict //
Google's AI mathematics system, known as AlphaGeometry2 (AG2), has surpassed the problem-solving capabilities of International Mathematical Olympiad (IMO) gold medalists in solving complex geometry problems. This second-generation system combines a language model with a symbolic engine, enabling it to solve 84% of IMO geometry problems, compared to the 81.8% solved by human gold medalists. Developed by Google DeepMind, AG2 can engage in both pattern matching and creative problem-solving, marking a significant advancement in AI's ability to mimic human reasoning in mathematics.

This achievement comes shortly after Microsoft released its own advanced AI math reasoning system, rStar-Math, highlighting the growing competition in the AI math domain. While rStar-Math uses smaller language models to solve a broader range of problems, AG2 focuses on advanced geometry problems using a hybrid reasoning model. The improvements in AG2 represent a 30% performance increase over the original AlphaGeometry, particularly in visual reasoning and logic, essential for solving complex geometry challenges.

Recommended read:
References :
  • Shelly Palmer: Google’s Veo 2 at 50 Cents a Second: Priced Right—for Now
  • www.livescience.com: 'Math Olympics' has a new contender — Google's AI now 'better than human gold medalists' at solving geometry problems
  • Verdict: Google expands Deep Research tool for workspace users
  • www.sciencedaily.com: Google's second generation of its AI mathematics system combines a language model with a symbolic engine to solve complex geometry problems better than International Mathematical Olympiad (IMO) gold medalists.

@artsci.washington.edu //
University of Washington professors Xiaodong Xu, Cynthia Vinzant, and Shayan Oveis Gharan have been honored by the National Academy of Sciences (NAS) for their outstanding research achievements. The NAS awards program has been recognizing outstanding achievement in the physical, biological, and social sciences since 1866. The annual awards ceremony will honor the major contributions made by 20 researchers.

Xu received the NAS Award for Scientific Discovery for his experimental observation of the fractional quantum anomalous Hall effect. This award, presented every two years, recognizes an accomplishment or discovery in basic research within the previous five years that is expected to have a significant impact on astronomy, biochemistry, biophysics, chemistry, materials science, or physics. Xu's research explores new quantum phenomena in layered two-dimensional materials and engineered quantum systems.

Vinzant and Oveis Gharan, along with Nima Anari and Kuikui Liu, will receive the Michael and Sheila Held Prize for breakthrough work advancing the theory of matroids and mixing rates of Markov chains. The Michael and Sheila Held Prize is presented annually to honor outstanding, innovative, creative, and influential research in the areas of combinatorial and discrete optimization, or related parts of computer science, such as the design and analysis of algorithms and complexity theory. This $100,000 prize is intended to recognize recent work.

Recommended read:
References :
  • Recent News: This news article is about the NAS awards for Xu, Vinzant, and Oveis Gharan.
  • artsci.washington.edu: This page from UW describes the NAS awards for Xu, Vinzant, and Oveis Gharan.

@techcrunch.com //
DeepMind's artificial intelligence, AlphaGeometry2, has achieved a remarkable feat by solving 84% of the geometry problems from the International Mathematical Olympiad (IMO) over the past 25 years. This performance surpasses the average gold medalist in the prestigious competition for gifted high school students. The AI's success highlights the growing capabilities of AI in handling sophisticated mathematical tasks.

AlphaGeometry2 represents an upgraded system from DeepMind, incorporating advancements such as the integration of Google's Gemini large language model and the ability to reason by manipulating geometric objects. This neuro-symbolic system combines a specialized language model with abstract reasoning coded by humans, enabling it to generate rigorous proofs and avoid common AI pitfalls like hallucinations. This could potentially impact fields that heavily rely on mathematical expertise.

Recommended read:
References :
  • www.nature.com: This news report discusses DeepMind's AI achieving performance comparable to top human solvers in mathematics.
  • techcrunch.com: DeepMind says its AlphaGeometry2 model solved 84% of International Math Olympiad's geometry problems from the last 25 years, surpassing average gold medalists
  • Techmeme: DeepMind says its AlphaGeometry2 model solved 84% of International Math Olympiad's geometry problems from the last 25 years, surpassing average gold medalists (Kyle Wiggers/TechCrunch)
  • techxplore.com: TechXplore reports on DeepMind AI achieves gold-medal level performance on challenging Olympiad math questions.
  • www.analyticsvidhya.com: DeepMind’s AlphaGeometry2 Surpasses Math Olympiad
  • www.marktechpost.com: Marktechpost discusses Google DeepMind's AlphaGeometry2.