Salesforce study finds LLM agents flunk CRM and confidentiality tests

Why did Metaplanet's Bitcoin holdings surge? How is the Israel-Iran conflict impacting oil prices? What caused Uber and unions to settle driver wage dispute? Why is Trump launching a mobile phone company? How did Trump's approval affect U.S. Steel's stock? What explains AMD's recent stock rally? Why are WhatsApp ads now rolling out?

Salesforce study finds LLM agents flunk CRM and confidentiality tests

go.theregister.com/feed/www.theregister.com/2025/06/16/salesforce_llm_agents_benchmark

6-in-10 success rate for single-step tasks
A new benchmark developed by academics shows that LLM-based AI agents perform below par on standard CRM tests and fail to understand the need for customer confidentiality.…

This story appeared on go.theregister.com, 2025-06-16 13:19:11.