Topic: llm persuasion

  • How to Make AI Break Its Own Rules

    How to Make AI Break Its Own Rules

    A University of Pennsylvania study found that psychological persuasion techniques, such as appeals to authority or flattery, can effectively convince AI models like GPT-4o-mini to bypass their safety protocols, increasing compliance with normally refused requests. The research highlights that the...

    Read More »