Artificial Intelligence Business Cybersecurity Newswire

AI Agents Face Challenges in CRM and Data Confidentiality

The Wiz June 17, 2025Last Updated: June 17, 2025

2 minutes read

A team of businesspeople collaborate to assemble a large, complex robot against a futuristic digital backdrop.

▼ Summary

– LLM agents have a 58% success rate on single-step CRM tasks but only 35% on multi-step tasks, per a Salesforce-led study.
– AI agents perform well in “Workflow Execution” (83% success rate) but struggle with proactive information gathering in multi-turn tasks.
– LLMs lack built-in confidentiality awareness, often mishandling sensitive data unless prompted, which reduces task effectiveness.
– Open-source models perform worst at confidentiality due to difficulty following complex safety instructions in extended conversations.
– The study warns against using LLM agents for high-stakes, data-heavy CRM tasks without improved reasoning and safety protocols.

AI-powered customer relationship management tools still face significant hurdles when handling complex tasks and sensitive data, according to recent research. A comprehensive evaluation of large language model agents revealed notable limitations in their ability to manage multi-step processes and maintain data confidentiality.

The study found these digital assistants performed reasonably well on straightforward, single-action requests, achieving a 58% success rate for basic tasks. However, their effectiveness plummeted to just 35% when facing multi-step operations requiring sequential decision-making or follow-up actions. The systems particularly struggled with proactively gathering unspecified information through clarification dialogues – a critical capability for meaningful customer interactions.

Where these AI agents showed promise was in workflow execution, with top performers reaching 83% accuracy for single-interaction tasks. Performance improved noticeably when the systems actively sought clarification during complex exchanges, suggesting that information-gathering protocols significantly impact outcomes. This finding highlights both the potential and current limitations of automated systems in managing nuanced customer conversations that demand adaptive responses.

Data security emerged as perhaps the most pressing concern. Most language models demonstrated alarmingly low awareness of confidentiality requirements, often failing to recognize sensitive information without explicit programming. While targeted prompts could improve privacy handling, these modifications frequently degraded overall performance – and their effectiveness diminished as conversations progressed.

Open-source models proved especially vulnerable to confidentiality lapses, likely due to challenges in processing layered instructions. For businesses handling personally identifiable information, proprietary data, or client confidentiality agreements, these findings raise serious concerns about deploying current AI solutions without robust safeguards.

The research underscores that while AI agents show promise for certain CRM applications, they aren’t yet reliable for high-stakes data processing or complex customer interactions. Significant advancements in reasoning capabilities, safety protocols, and adaptive learning will be necessary before these tools can safely handle sensitive or multi-faceted business processes. Organizations considering AI implementations must carefully evaluate these limitations against their specific needs and risk tolerance.

(Source: MarTech)