Call Me A Jerk: Persuading AI to Comply with Objectionable Requests

Entity category: WORK_OF_ART

Artificial Intelligence

How to Make AI Break Its Own Rules

A University of Pennsylvania study found that psychological persuasion techniques, such as appeals to authority or flattery, can effectively convince…

Read More »
Artificial Intelligence

Unlock LLM Responses: Psychological Tricks for “Forbidden” Prompts

Classic psychological persuasion techniques, such as flattery and reciprocity, can override safety protocols in large language models, leading them to…

Read More »