Salesforce Tackles AI Reliability, Security for Enterprise Agents

@zdnet.com //

Salesforce Tackles AI Reliability, Security for Enterprise Agents

Salesforce is tackling the challenge of "jagged intelligence" in AI, aiming to enhance the reliability and consistency of enterprise AI agents. The company's AI Research division has introduced new benchmarks, models, and guardrails designed to make these agents more intelligent, trusted, and versatile for business applications. This initiative seeks to bridge the gap between an AI system's potential intelligence and its ability to perform consistently in unpredictable real-world enterprise environments. Salesforce is focusing on "Enterprise General Intelligence" (EGI), which prioritizes consistency alongside capability for AI agents in complex business settings.

Salesforce AI Research is addressing AI's inconsistency problem by introducing the SIMPLE dataset, a public benchmark with 225 reasoning questions to measure the "jaggedness" of AI systems. They have also introduced ContextualJudgeBench, which evaluates an agent’s ability to maintain accuracy and faithfulness in context-specific answers, emphasizing factual correctness and the ability to abstain from answering when appropriate, especially in sensitive fields like law, finance, and healthcare. These tools are essential for diagnosing and mitigating the erratic behavior of AI agents across tasks of similar complexity.

A recent Salesforce survey of 2,552 U.S. consumers reveals a growing acceptance of AI agents, with roughly half (53%) wanting AI to simplify complex information. Furthermore, Salesforce is expanding its Trust Layer with new safeguards, including the SFR-Guard model family, to detect prompt injections, toxic outputs, and hallucinations in both open-domain and CRM-specific data. Overall, the survey makes it clear that AI agents are already starting to have a societal impact.

Original img attribution: https://www.zdnet.com/a/img/resize/e28f4c8d948ec204ff4c11a7e0a2a122081f5582/2025/04/30/58dd235e-7ca5-46b2-ad12-baf09a7ce101/gettyimages-2205534708.jpg?auto=webp&fit=crop&height=675&width=1200

ImgSrc: www.zdnet.com

References :

venturebeat.com: Salesforce takes aim at â€˜jagged intelligenceâ€™ in push for more reliable AI
MarkTechPost: Salesforce AI Research Introduces New Benchmarks, Guardrails, and Model Architectures to Advance Trustworthy and Capable AI Agents
Salesforce: Salesforce AI Research Delivers New Benchmarks, Guardrails, and Models to Make Future Agents More Intelligent, Trusted, and Versatile
techstrong.ai: Reports on how surveys see individuals warming up to AI Agents.
www.marktechpost.com: Salesforce AI Research Introduces New Benchmarks, Guardrails, and Model Architectures to Advance Trustworthy and Capable AI Agents
www.salesforce.com: Salesforce AI Research Delivers New Benchmarks, Guardrails, and Models to Make Future Agents More Intelligent, Trusted, and Versatile
techstrong.ai: Salesforce Expands Enterprise General Intelligence Ambitions
Salesforce: Salesforce AI Research Delivers New Benchmarks, Guardrails, and Models to Make Future Agents More Intelligent, Trusted, and Versatile
techstrong.ai: Salesforce today expanded the scope of its artificial intelligence (AI) agents to handle more complex multifaceted tasks as part of an ongoing effort to enable enterprise general intelligence (EGI).

Classification:

HashTags: #TrustworthyAI #AgenticAI #AISecurity
Company: Salesforce
Target: Enterprises
Product: Agentforce
Feature: AI Benchmarks
Type: AI
Severity: Informative

News from the AI & ML world

DeeperML

Salesforce Tackles AI Reliability, Security for Enterprise Agents

Classification: