News from the AI & ML world

DeeperML - #datascience

Kuldeep Jha@Verdict //
Databricks has unveiled Agent Bricks, a new tool designed to streamline the development and deployment of enterprise AI agents. Built on Databricks' Mosaic AI platform, Agent Bricks automates the optimization and evaluation of these agents, addressing the common challenges that prevent many AI projects from reaching production. The tool utilizes large language models (LLMs) as "judges" to assess the reliability of task-specific agents, eliminating manual processes that are often slow, inconsistent, and difficult to scale. Jonathan Frankle, chief AI scientist of Databricks Inc., described Agent Bricks as a generalization of the best practices and techniques observed across various verticals, reflecting how Databricks believes agents should be built.

Agent Bricks originated from the need of Databricks' customers to effectively evaluate their AI agents. Ensuring reliability involves defining clear criteria and practices for comparing agent performance. According to Frankle, AI's inherent unpredictability makes LLM judges crucial for determining when an agent is functioning correctly. This requires ensuring that the LLM judge understands the intended purpose and measurement criteria, essentially aligning the LLM's judgment with that of a human judge. The goal is to create a scaled reinforcement learning system where judges can train an agent to behave as developers intend, reducing the reliance on manually labeled data.

Databricks' new features aim to simplify AI development by using AI to build agents and the pipelines that feed them. Fueled by user feedback, these features include a framework for automating agent building and a no-code interface for creating pipelines for applications. Kevin Petrie, an analyst at BARC U.S., noted that these announcements help Databricks users apply AI and GenAI applications to their proprietary data sets, thereby gaining a competitive advantage. Agent Bricks is currently in beta testing and helps users avoid the trap of "vibe coding" by forcing rigorous testing and evaluation until the model is extremely reliable.

Share: bluesky twitterx--v2 facebook--v1 threads


References :
  • www.bigdatawire.com: Databricks Wants to Take the Pain Out of Building, Deploying AI Agents with Bricks
  • siliconangle.com: The best judge of artificial intelligence could be AI — at least that’s the idea behind Databricks Inc.’s new tool, Agent Bricks.
  • thenewstack.io: Databricks Launches Agent Bricks, Its New No-Code AI Agent Builder
  • www.infoworld.com: Databricks has released a beta version of a new agent building interface to help enterprises automate and optimize the agent building process.
  • thenewstack.io: Databricks Launches Agent Bricks, Its New No-Code AI Agent Builder
  • AI News | VentureBeat: Databricks Agent Bricks automates enterprise AI agent optimization and evaluation, eliminating manual processes that block production deployments.
  • SiliconANGLE: The best judge of artificial intelligence could be AI — at least that’s the idea behind Databricks Inc.’s new tool, Agent Bricks.
  • BigDATAwire: Databricks today launched Agent Bricks, a new offering aimed at helping customers AI agent systems up and running quickly, with the cost, safety, and efficiency they demand.
  • Analytics India Magazine: Databricks also launched MLflow 3.0, a redesigned version of its AI lifecycle management platform.
  • Verdict: Databricks introduces Agent Bricks for AI agent development
  • www.verdict.co.uk: Databricks introduces Agent Bricks for AI agent development
  • techstrong.ai: Highlights Databricks' simplified approach to building and training AI agents.
  • siliconangle.com: Reveals Databricks' play for AI agents and their data platform strategy.
  • techstrong.ai: Databricks this week launched a series of initiatives, including a beta release of an Agent Bricks framework that makes it simpler to create and modify artificial intelligence agents using techniques developed by Mosaic AI Research using multiple types of large language models (LLMs).
Classification:
Kuldeep Jha@Verdict //
Databricks has unveiled Agent Bricks, a no-code AI agent builder designed to streamline the development and deployment of enterprise AI agents. Built on Databricks’ Mosaic AI platform, Agent Bricks aims to address the challenge of AI agents failing to reach production due to slow, inconsistent, and difficult-to-scale manual evaluation processes. The platform allows users to request task-specific agents and then automatically generates a series of large language model (LLM) "judges" to assess the agent's reliability. This automation is intended to optimize and evaluate enterprise AI agents, reducing reliance on manual vibe tracking and improving confidence in production-ready deployments.

Agent Bricks incorporates research-backed innovations, including Test-time Adaptive Optimization (TAO), which enables AI tuning without labeled data. Additionally, the platform generates domain-specific synthetic data, creates task-aware benchmarks, and optimizes the balance between quality and cost without manual intervention. Jonathan Frankle, Chief AI Scientist of Databricks Inc., emphasized that Agent Bricks embodies the best engineering practices, styles, and techniques observed in successful agent development, reflecting Databricks' philosophical approach to building agents that are reliable and effective.

The development of Agent Bricks was driven by customer needs to evaluate their agents effectively. Frankle explained that AI's unpredictable nature necessitates LLM judges to evaluate agent performance against defined criteria and practices. Databricks has essentially created scaled reinforcement learning, where judges can train an agent to behave as desired by developers, reducing the reliance on labeled data. Hanlin Tang, Databricks’ Chief Technology Officer of Neural Networks, noted that Agent Bricks aims to give users the confidence to take their AI agents into production.

Share: bluesky twitterx--v2 facebook--v1 threads


References :
  • SiliconANGLE: How Databricks’ Agent Bricks uses AI to judge AI
  • thenewstack.io: Databricks Launches Agent Bricks, Its New No-Code AI Agent Builder
  • techstrong.ai: Databricks Simplifies Building and Training of AI Agents
  • www.bigdatawire.com: Databricks Is Making a Long-Term Play to Fix AI’s Biggest Constraint
Classification:
Haden Pelletier@Towards Data Science //
Recent discussions in statistics highlight significant concepts and applications relevant to data science. A book review explores seminal ideas and controversies in the field, focusing on key papers and historical perspectives. The review mentions Fisher's 1922 paper, which is credited with creating modern mathematical statistics, and discusses debates around hypothesis testing and Bayesian analysis.

Stephen Senn's guest post addresses the concept of "relevant significance" in statistical testing, cautioning against misinterpreting statistical significance as proof of a genuine effect. Senn points out that rejecting a null hypothesis does not necessarily mean it is false, emphasizing the importance of careful interpretation of statistical results.

Furthermore, aspiring data scientists are advised to familiarize themselves with essential statistical concepts for job interviews. These include understanding p-values, Z-scores, and outlier detection methods. A p-value is crucial for hypothesis testing, and Z-scores help identify data points that deviate significantly from the mean. These concepts form a foundation for analyzing data and drawing meaningful conclusions in data science applications.

Share: bluesky twitterx--v2 facebook--v1 threads


References :
  • errorstatistics.com: Stephen Senn (guest post): “Relevant significance? Be careful what you wish for”
  • Towards Data Science: 5 Statistical Concepts You Need to Know Before Your Next Data Science Interview
  • Xi'an's Og: Seminal ideas and controversies in Statistics [book review]
  • medium.com: Statistics for Data Science and Machine Learning
  • medium.com: Why Data Science Needs Statistics
Classification: