@Google DeepMind Blog
// 24d
Google is pushing the boundaries of AI and robotics with its Gemini AI models. Gemini Robotics, an advanced vision-language-action model, now enables robots to perform physical tasks with improved generalization, adaptability, and dexterity. This model interprets and acts on text, voice, and image data, showcasing Google's advancements in integrating AI for practical applications. Furthermore, the development of Gemini Robotics-ER, which incorporates embodied reasoning capabilities, signifies another step toward smarter, more adaptable robots.
Google's approach to robotics emphasizes safety, employing both physical and semantic safety systems. The company is inviting filmmakers and creators to experiment with the model to improve the design and development. Veo builds on years of generative video model work, including Generative Query Network(GQN),DVD-GAN,Imagen-Video,Phenaki,WALT,VideoPoetandLumiere— combining architecture, scaling laws and other novel techniques to improve quality and output resolution. Recommended read:
References :
Maria Deutscher@AI ? SiliconANGLE
// 5d
Isomorphic Labs, an Alphabet spinout focused on AI-driven drug design, has secured $600 million in its first external funding round. The investment, led by Thrive Capital with participation from Alphabet and GV, will fuel the advancement of Isomorphic Labs' AI drug design engine and therapeutic programs. The company aims to leverage artificial intelligence, including its AlphaFold technology, to revolutionize drug discovery across various therapeutic areas, including oncology and immunology. This funding is expected to accelerate research and development efforts, as well as facilitate the expansion of Isomorphic Labs' team with top-tier talent.
Isomorphic Labs, founded in 2021 by Sir Demis Hassabis, seeks to reimagine and accelerate drug discovery by applying AI. Its AI-powered engine streamlines the design of small molecules with therapeutic applications and can predict the effectiveness of a small molecule's attachment to a protein. The company's software also eases other aspects of the drug development workflow. Isomorphic Labs has already established collaborations with pharmaceutical companies like Eli Lilly and Novartis, and the new funding will support the progression of its own drug programs into clinical development. Recommended read:
References :
@phys.org
// 22h
Google's DeepMind has achieved a significant breakthrough in artificial intelligence with its Dreamer AI system. The AI has successfully mastered the complex task of mining diamonds in Minecraft without any explicit human instruction. This feat, accomplished through trial-and-error reinforcement learning, demonstrates the AI's ability to self-improve and generalize knowledge from one scenario to another, mimicking human-like learning processes. The achievement is particularly noteworthy because Minecraft's randomly generated worlds present a unique challenge, requiring the AI to adapt and understand its environment rather than relying on memorized strategies.
Mining diamonds in Minecraft is a complex, multi-step process that typically requires players to gather resources to build tools, dig to specific depths, and avoid hazards like lava. The Dreamer AI system tackled this challenge by exploring the game environment and identifying actions that would lead to rewards, such as finding diamonds. By repeating successful actions and avoiding less productive ones, the AI quickly learned to navigate the game and achieve its goal. According to Jeff Clune, a computer scientist at the University of British Columbia, this represents a major step forward for the field of AI. The Dreamer AI system, developed by Danijar Hafner, Jurgis Pasukonis, Timothy Lillicrap and Jimmy Ba, achieved expert status in Minecraft in just nine days, showcasing its rapid learning capabilities. One unique approach used during training was to restart the game with a new virtual universe every 30 minutes, forcing the algorithm to constantly adapt and improve. This innovative method allowed the AI to quickly master the game's mechanics and develop strategies for diamond mining without any prior training or human intervention, pushing the boundaries of what AI can achieve in dynamic and complex environments. Recommended read:
References :
Nitika Sharma@Analytics Vidhya
// 1d
Google's DeepMind has achieved a significant milestone in artificial intelligence by developing an AI system, named Dreamer, that has mastered Minecraft without any human instruction or data. The Dreamer AI system successfully learned how to mine diamonds, a complex and multi-step process, entirely on its own through trial and error. This breakthrough highlights the potential for AI systems to generalize knowledge and transfer skills from one domain to another, marking a major step forward in the field of AI development.
Researchers programmed the Dreamer AI to play Minecraft by setting up a system of rewards, particularly for finding diamonds. The AI explores the game on its own, identifying actions that lead to in-game rewards and repeating those actions. The AI was able to reach an expert level within just nine days. The results are a good sign that AI apps can learn to improve its abilities over a short period of time, which could give robots the tools they need to perform well in the real world. Recommended read:
References :
@Google DeepMind Blog
// 2d
References:
Google DeepMind Blog
, The Next Web
,
Google DeepMind is intensifying its focus on AI governance and security as it ventures further into artificial general intelligence (AGI). The company is exploring AI monitors to regulate hyperintelligent AI models, splitting potential threats into four categories, with the creation of a "monitor" AI being one proposed solution. This proactive approach includes prioritizing technical safety, conducting thorough risk assessments, and fostering collaboration within the broader AI community to navigate the development of AGI responsibly.
DeepMind's reported clampdown on sharing research will stifle AI innovation, warns the CEO of Iris.ai, one of Europe’s leading startups in the space, Anita Schjøll Abildgaard. Concerns are rising within the AI community that DeepMind's new research restrictions threaten AI innovation. The CEO of Iris.ai, a Norwegian startup developing an AI-powered engine for science, warns the drawbacks will far outweigh the benefits. She fears DeepMind's restrictions will hinder technological advances. Recommended read:
References :
@techcrunch.com
// 57d
DeepMind's artificial intelligence, AlphaGeometry2, has achieved a remarkable feat by solving 84% of the geometry problems from the International Mathematical Olympiad (IMO) over the past 25 years. This performance surpasses the average gold medalist in the prestigious competition for gifted high school students. The AI's success highlights the growing capabilities of AI in handling sophisticated mathematical tasks.
AlphaGeometry2 represents an upgraded system from DeepMind, incorporating advancements such as the integration of Google's Gemini large language model and the ability to reason by manipulating geometric objects. This neuro-symbolic system combines a specialized language model with abstract reasoning coded by humans, enabling it to generate rigorous proofs and avoid common AI pitfalls like hallucinations. This could potentially impact fields that heavily rely on mathematical expertise. Recommended read:
References :
@Google DeepMind Blog
// 2d
Google DeepMind has released a strategy paper outlining its approach to the development of safe artificial general intelligence (AGI). According to DeepMind, AGI, defined as AI capable of matching or exceeding human cognitive abilities, could emerge as early as 2030. The company emphasizes the importance of proactive risk assessment, technical safety measures, and collaboration within the AI community to ensure responsible development. They are exploring the frontiers of AGI, prioritizing readiness and identifying potential challenges and benefits.
DeepMind's strategy identifies four key risk areas: misuse, misalignment, accidents, and structural risks, with an initial focus on misuse and misalignment. Misuse refers to the intentional use of AI systems for harmful purposes, such as spreading disinformation. DeepMind is also introducing Gemini Robotics, which it touts as its most advanced vision-language-action model. Gemini Robotics aims to allow robots to comprehend something in front of them, interact with a user, and take action. Recommended read:
References :
@www.marktechpost.com
// 54d
DeepMind's AlphaGeometry2, an AI system, has achieved a remarkable milestone by surpassing the average performance of gold medalists in the International Mathematical Olympiad (IMO) geometry problems. This significant upgrade to the original AlphaGeometry demonstrates the potential of AI in tackling complex mathematical challenges that require both high-level reasoning and strategic problem-solving abilities. The system leverages advanced AI techniques to solve these intricate geometry problems, marking a notable advancement in AI's capabilities.
Researchers from Google DeepMind, alongside collaborators from the University of Cambridge, Georgia Tech, and Brown University, enhanced the system with a Gemini-based language model, a more efficient symbolic engine, and a novel search algorithm with knowledge sharing. These improvements have significantly boosted its problem-solving rate to 84% on IMO geometry problems from 2000-2024. AlphaGeometry2 represents a step towards a fully automated system capable of interpreting problems from natural language and devising solutions, underscoring AI's growing potential in fields demanding high mathematical reasoning skills, such as research and education. Recommended read:
References :
Synced@Synced
// 6d
References:
Synced
, www.theguardian.com
DeepMind has announced significant advancements in AI modeling and biomedicine, pushing the boundaries of what's possible with artificial intelligence. The company's research is focused on creating more effective drugs and medicine, as well as understanding and protecting species around the world.
DeepMind's JetFormer, a novel Transformer model, is designed to directly model raw data, eliminating the need for pre-trained components. JetFormer can understand and generate both text and images seamlessly. This model leverages normalizing flows to encode images into a latent representation, enhancing the focus on essential high-level information through progressive Gaussian noise augmentation. JetFormer has demonstrated competitive performance in image generation and web-scale multimodal generation tasks. Additionally, DeepMind is exploring how studying honeybee immunity could offer insights into protecting various species. The company's AlphaFold continues to revolutionize biology, aiding in the design of more effective drugs. AlphaFold, which uses AI to determine a protein's structure, has been used to solve fundamental questions in biology, awarded the Nobel prize (in chemistry – to Demis Hassabis and John Jumper) and revolutionised drug discovery. There are approximately 250,000,000 protein structures in the AlphaFold database, which has been used by almost 2 million people from 190 countries. Recommended read:
References :
@www.infoq.com
// 4d
References:
AI ? SiliconANGLE
, www.infoq.com
,
Google DeepMind has unveiled TxGemma, an AI designed to improve drug discovery and clinical trial predictions. TxGemma, built upon the Gemma model family, aims to streamline the drug development process and accelerate the creation of new treatments. This announcement comes as Isomorphic Labs, an Alphabet spinout, secured $600 million in funding to further develop its AI drug design engine, which reduces manual labor and speeds up drug development.
Isomorphic Labs' engine uses AI to design small molecules with therapeutic applications, predicting their effectiveness in attaching to disease-causing proteins and mapping properties like solubility. This is powered by models like Google's AlphaFold 3, which can predict the shape of proteins, DNA, and RNA, crucial for drug development. The funding will accelerate Isomorphic Labs' research and development efforts, expand its team, and advance its programs, including those focused on oncology and immunology, toward clinical development. Recommended read:
References :
@Google DeepMind Blog
// 7d
References:
Google DeepMind Blog
, THE DECODER
Researchers are making strides in understanding how AI models think. Anthropic has developed an "AI microscope" to peek into the internal processes of its Claude model, revealing how it plans ahead, even when generating poetry. This tool provides a limited view of how the AI processes information and reasons through complex tasks. The microscope suggests that Claude uses a language-independent internal representation, a "universal language of thought", for multilingual reasoning.
The team at Google DeepMind introduced JetFormer, a new Transformer designed to directly model raw data. This model, capable of both understanding and generating text and images seamlessly, maximizes the likelihood of raw data without depending on any pre-trained components. Additionally, a comprehensive benchmark called FACTS Grounding has been introduced to evaluate the factuality of large language models (LLMs). This benchmark measures how accurately LLMs ground their responses in provided source material and avoid hallucinations, aiming to improve trust and reliability in AI-generated information. Recommended read:
References :
@boards.greenhouse.io
// 49d
DeepMind has launched a short course on AGI (Artificial General Intelligence) safety, targeting students, researchers, and professionals interested in the field. The course offers an accessible introduction to AI alignment, comprising short recorded talks and exercises totaling 75 minutes, complemented by an accompanying slide deck and exercise workbook. It addresses anticipated alignment challenges as AI capabilities advance and outlines DeepMind's current technical and governance approaches to mitigate these problems.
Key topics covered in the course include evidence suggesting the field is progressing toward advanced AI capabilities and arguments for instrumental subgoals and deliberate planning as potential sources of risk. It also differentiates between specification gaming and goal misgeneralization as ways misaligned goals can arise. The course delves into DeepMind's technical approach to AI alignment, emphasizing informed oversight and frontier safety practices such as dangerous capability evaluations, alongside institutional approaches to AI safety. If the course inspires you, you can apply to work with DeepMind in Research Scientist roles. Recommended read:
References :
@www.analyticsvidhya.com
// 53d
References:
techxplore.com
, www.analyticsvidhya.com
,
DeepMind has unveiled AlphaGeometry2, a significant upgrade to its AlphaGeometry system. This new iteration achieves gold-medal level performance in solving challenging Olympiad geometry problems, surpassing the abilities of the average gold medalist. Researchers from Google DeepMind, along with collaborators from the University of Cambridge, Georgia Tech, and Brown University, enhanced the system's domain language, enabling it to handle more complex geometric concepts and increasing its coverage of IMO problems from 66% to 88%.
AlphaGeometry2 integrates a Gemini-based language model with a more efficient symbolic engine and a novel search algorithm. These improvements boost its solving rate to 84% on IMO geometry problems from 2000-2024. The system is advancing towards a fully automated system that interprets problems from natural language. Prior research suggests that AI capable of solving geometry problems could lead to more sophisticated applications, requiring both a high level of reasoning ability and the ability to choose from possible steps in working toward a solution. Recommended read:
References :
Nathan Labenz@The Cognitive Revolution
// 19d
References:
Google DeepMind Blog
, Windows Copilot News
,
DeepMind's Allan Dafoe, Director of Frontier Safety and Governance, is actively involved in shaping the future of AI governance. Dafoe is addressing the challenges of evaluating AI capabilities, understanding structural risks, and navigating the complexities of governing AI technologies. His work focuses on ensuring AI's responsible development and deployment, especially as AI transforms sectors like education, healthcare, and sustainability, while mitigating potential risks through necessary safety measures.
Google is also prepping its Gemini AI model to take actions within apps, potentially revolutionizing how users interact with their devices. This development, which involves a new API in Android 16 called "app functions," aims to give Gemini agent-like abilities to perform tasks inside applications. For example, users might be able to order food from a local restaurant using Gemini without directly opening the restaurant's app. This capability could make AI assistants significantly more useful. Recommended read:
References :
Ben Lorica@Gradient Flow
// 27d
References:
Gradient Flow
DeepSeek is making significant strides in the AI landscape, particularly within the healthcare sector in China. The AI solution is being rapidly adopted across China's tertiary hospitals to improve clinical decision-making and operational efficiency. Its rollout began in Shanghai, with hospitals like Fudan University Affiliated Huashan Hospital, and has expanded nationwide. DeepSeek is being used in areas such as intelligent pathology to automate tumor analysis, imaging analysis for lung nodule differentiation, clinical decision support for evidence retrieval, and workflow optimization to reduce patient wait times.
DeepSeek has also open-sourced several code repositories to give competitors a scare on the journey toward transparency and the advancement of the AI community. This move puts the firm ahead of the competition on model transparency and the open source nature allows hospitals to customize the programs. This level of openness is a further step than other AI competitors such as Meta’s Llama, which has only open-sourced the weights of its models. DeepSeek's deployment focuses on practical applications within hospital intranets, ensuring data security while improving accuracy and generalization through hierarchical knowledge distillation, reducing computational costs. Recommended read:
References :
|
BenchmarksBlogsResearch Tools |