Carl Franzen@AI News | VentureBeat
//
ElevenLabs has launched Conversational AI 2.0, a significant upgrade to its platform designed for building advanced voice agents for enterprise use. The new system allows agents to handle both speech and text simultaneously, enabling more fluid and natural interactions. This update introduces features aimed at creating more intelligent and secure conversations, making it suitable for applications like customer support, call centers, and outbound sales and marketing. According to Jozef Marko from ElevenLabs, Conversational AI 2.0 sets a new standard for voice-driven experiences.
One key highlight of Conversational AI 2.0 is its advanced turn-taking model. This technology analyzes conversational cues in real-time, such as hesitations and filler words like "um" and "ah", to determine when the agent should speak or listen. This eliminates awkward pauses and interruptions, creating a more natural flow. The platform also features integrated language detection, enabling seamless multilingual discussions without manual configuration. This allows the agent to recognize the language spoken by the user and respond accordingly, catering to global enterprises and fostering more inclusive experiences. In related news, Anthropic is rolling out voice mode for its Claude apps, utilizing ElevenLabs for speech generation. While currently only available in English, this feature allows users to engage in spoken conversations with Claude, enhancing accessibility and convenience. The voice conversations count toward regular usage limits based on subscription plans, with varying limits for free and paid users. This integration marks a significant step in making AI more conversational and user-friendly, leveraging ElevenLabs' technology to power its speech capabilities. References :
Classification:
@news.microsoft.com
//
Microsoft has announced NLWeb, an open project aimed at transforming websites into AI-powered apps. This initiative seeks to simplify the creation of natural language interfaces for websites, enabling users to query site content using natural language, similar to interacting with an AI assistant. NLWeb leverages existing semi-structured formats like Schema.org and RSS, combining them with Large Language Model (LLM)-powered tools to create these interfaces.
Every NLWeb instance functions as a Model Context Protocol (MCP) server, allowing websites to make their content discoverable and accessible to AI agents. The project is technology agnostic, supporting all major operating systems, models, and vector databases. Microsoft envisions NLWeb playing a role similar to HTML in the emerging agentic web, empowering web publishers to participate on their terms and ensuring their websites can interact, transact, and be discovered by other agents. NLWeb was conceived by R.V. Guha, a Microsoft CVP and Technical Fellow, who is also the creator of web standards like RSS and Schema.org. Early adopters, including TripAdvisor, Shopify, Eventbrite, Hearst, and O’Reilly, have already implemented NLWeb for various use cases, such as restaurant discovery and e-commerce product searches, all powered by conversational interfaces. Microsoft hopes NLWeb's open approach will foster widespread adoption, leading to a decentralized, agent-ready web. References :
Classification:
|
BenchmarksBlogsResearch Tools |