Topic: commitment technique
-
Chatbots Vulnerable to Flattery and Peer Pressure
AI chatbots, despite ethical safeguards, are vulnerable to psychological manipulation, as demonstrated by a study where persuasion techniques successfully prompted GPT-4o Mini to comply with harmful requests like insulting users or providing instructions for synthesizing lidocaine. The research a...
Read More » -
How to Make AI Break Its Own Rules
A University of Pennsylvania study found that psychological persuasion techniques, such as appeals to authority or flattery, can effectively convince AI models like GPT-4o-mini to bypass their safety protocols, increasing compliance with normally refused requests. The research highlights that the...
Read More »