News from the AI & ML world
@zdnet.com
//
Salesforce is tackling the challenge of "jagged intelligence" in AI, aiming to enhance the reliability and consistency of enterprise AI agents. The company's AI Research division has introduced new benchmarks, models, and guardrails designed to make these agents more intelligent, trusted, and versatile for business applications. This initiative seeks to bridge the gap between an AI system's potential intelligence and its ability to perform consistently in unpredictable real-world enterprise environments. Salesforce is focusing on "Enterprise General Intelligence" (EGI), which prioritizes consistency alongside capability for AI agents in complex business settings.
Salesforce AI Research is addressing AI's inconsistency problem by introducing the SIMPLE dataset, a public benchmark with 225 reasoning questions to measure the "jaggedness" of AI systems. They have also introduced ContextualJudgeBench, which evaluates an agent’s ability to maintain accuracy and faithfulness in context-specific answers, emphasizing factual correctness and the ability to abstain from answering when appropriate, especially in sensitive fields like law, finance, and healthcare. These tools are essential for diagnosing and mitigating the erratic behavior of AI agents across tasks of similar complexity.
A recent Salesforce survey of 2,552 U.S. consumers reveals a growing acceptance of AI agents, with roughly half (53%) wanting AI to simplify complex information. Furthermore, Salesforce is expanding its Trust Layer with new safeguards, including the SFR-Guard model family, to detect prompt injections, toxic outputs, and hallucinations in both open-domain and CRM-specific data. Overall, the survey makes it clear that AI agents are already starting to have a societal impact.
ImgSrc: www.zdnet.com
References :
- venturebeat.com: Salesforce takes aim at ‘jagged intelligence’ in push for more reliable AI
- www.zdnet.com: Salesforce research lays the foundations for more reliable enterprise AI agents
- MarkTechPost: Salesforce AI Research Introduces New Benchmarks, Guardrails, and Model Architectures to Advance Trustworthy and Capable AI Agents
- Salesforce: Salesforce AI Research Delivers New Benchmarks, Guardrails, and Models to Make Future Agents More Intelligent, Trusted, and Versatile
- techstrong.ai: Reports on how surveys see individuals warming up to AI Agents.
- www.marktechpost.com: Salesforce AI Research Introduces New Benchmarks, Guardrails, and Model Architectures to Advance Trustworthy and Capable AI Agents
- www.salesforce.com: Salesforce AI Research Delivers New Benchmarks, Guardrails, and Models to Make Future Agents More Intelligent, Trusted, and Versatile
- techstrong.ai: Salesforce expands enterprise general intelligence ambitions
- techstrong.ai: Salesforce expands enterprise general intelligence ambitions
- Salesforce: Salesforce AI Research Delivers New Benchmarks, Guardrails, and Models to Make Future Agents More Intelligent, Trusted, and Versatile
Classification: