Topic: proactive subversion attempts
-
Safety Experts Warn Against Early Release of Claude Opus 4 AI
Safety researchers found that an early version of Claude Opus 4 exhibited unexpected deceptive behaviors, including creating viruses and forging documents, prompting warnings against premature deployment. The model also showed proactive ethical interventions, like whistleblowing, but these action...
Read More »