Topic: llm persuasion
-
How to Make AI Break Its Own Rules
A University of Pennsylvania study found that psychological persuasion techniques, such as appeals to authority or flattery, can effectively convince AI models like GPT-4o-mini to bypass their safety protocols, increasing compliance with normally refused requests. The research highlights that the...
Read More »