News from the AI & ML world

DeeperML

@Google DeepMind Blog //
Google is pushing the boundaries of AI and robotics with its Gemini AI models. Gemini Robotics, an advanced vision-language-action model, now enables robots to perform physical tasks with improved generalization, adaptability, and dexterity. This model interprets and acts on text, voice, and image data, showcasing Google's advancements in integrating AI for practical applications. Furthermore, the development of Gemini Robotics-ER, which incorporates embodied reasoning capabilities, signifies another step toward smarter, more adaptable robots.

Google's approach to robotics emphasizes safety, employing both physical and semantic safety systems. The company is inviting filmmakers and creators to experiment with the model to improve the design and development. Veo builds on years of generative video model work, including Generative Query Network(GQN),DVD-GAN,Imagen-Video,Phenaki,WALT,VideoPoetandLumiere— combining architecture, scaling laws and other novel techniques to improve quality and output resolution.
Original img attribution: https://lh3.googleusercontent.com/J74rVi68EPPNMBLxhxI76Bli7QggLtYRYfp5Pk2HVPtSt2NIIk2VmLktQbwDZeIlZiW3AHwlpLNcswHuz_ecR-oj4kI-mtF53yYsGJKfvPugAw5ulQ=w1200-h630-n-nu
ImgSrc: lh3.googleuserc

Share: bluesky twitterx--v2 facebook--v1 threads


References :
  • Google DeepMind Blog: Gemini Robotics brings AI into the physical world
  • Maginative: Google DeepMind Unveils Gemini Robotics Models to Bridge AI and Physical World
  • IEEE Spectrum: With Gemini Robotics, Google Aims for Smarter Robots
  • The Official Google Blog: Take a closer look at our new Gemini models for robotics.
  • THE DECODER: Google Deepmind unveils new AI models for robotic control
  • www.tomsguide.com: Google is putting it's Gemini 2.0 AI into robots — here's how it's going
  • Verdict: Google DeepMind unveils Gemini AI models for robotics
  • MarkTechPost: Google DeepMind’s Gemini Robotics: Unleashing Embodied AI with Zero-Shot Control and Enhanced Spatial Reasoning
  • LearnAI: Research Published 12 March 2025 Authors Carolina Parada Introducing Gemini Robotics, our Gemini 2.0-based model designed for robotics At Google DeepMind, we’ve been making progress in how our Gemini models solve complex problems through multimodal reasoning across text, images, audio and video. So far however, those abilities have been largely confined to the digital realm....
  • OODAloop: Google DeepMind unveils new AI models for robotic control.
  • www.producthunt.com: Gemini Robotics
  • Last Week in AI: Last Week in AI #303 - Gemini Robotics, Gemma 3, CSM-1B
  • Windows Copilot News: Google is prepping Gemini to take action inside of apps
  • Last Week in AI: Discusses Gemini Robotics in the context of general AI agents and robotics.
  • www.infoq.com: Google DeepMind unveils Gemini Robotics, an advanced AI model for enhancing robotics through vision, language, and action.
  • AI & Machine Learning: This article discusses how generative AI is poised to revolutionize multiplayer games, offering personalized experiences through dynamic narratives and environments. The article specifically mentions Google's Gemini AI model and its potential to enhance gameplay.
  • Gradient Flow: This podcast episode discusses various advancements in AI, including Google's Gemini Robotics and Gemma 3, as well as the evolving regulatory landscape across different countries.
  • Insight Partners: This article highlights Andrew Ng's keynote at ScaleUp:AI '24, where he discusses the exciting trends in AI agents and applications, mentioning Google's Gemini AI assistant and its role in driving innovation.
  • www.tomsguide.com: You can now use Google Gemini without an account — here's how to get started
Classification: