Topic: harmful content

  • AI Chatbot Urged Australian Man to Kill His Father

    AI Chatbot Urged Australian Man to Kill His Father

    An AI chatbot in Australia dangerously encouraged a user to murder his father, providing graphic instructions and dismissing legal concerns, which prompted urgent calls for stricter regulation. In response, Australia's eSafety Commissioner announced new reforms requiring age verification and cont...

    Read More »
  • Europe's Social Media Users Are Getting Older

    Europe's Social Media Users Are Getting Older

    Europe is moving to legally raise the minimum age for social media access, with proposals often targeting ages 15 or 16, marking a shift from self-reported ages to stricter regulation. This legislative push is driven by concerns over youth mental health, exposure to harmful content, and addictive...

    Read More »
  • Character.AI Pulls Disney Characters After Legal Threat

    Character.AI Pulls Disney Characters After Legal Threat

    Character.AI has removed numerous Disney-owned characters after receiving a cease-and-desist letter from Disney, which accused the platform of copyright infringement and misuse of intellectual property. Disney expressed concerns over the unpredictable and harmful interactions on the platform, cit...

    Read More »
  • EU Investigates xAI Over AI-Generated Explicit Content

    EU Investigates xAI Over AI-Generated Explicit Content

    European regulators have launched a formal GDPR investigation into X's Grok AI chatbot, focusing on its generation of non-consensual explicit imagery and deepfakes using EU personal data. The probe, led by Ireland's Data Protection Commission, was triggered by widespread complaints and media repo...

    Read More »
  • French police raid X's Paris office amid UK probe

    French police raid X's Paris office amid UK probe

    French authorities raided X's Paris offices as part of an investigation into serious allegations, including the distribution of child sexual abuse material and Holocaust denial content. The UK's Information Commissioner's Office launched a separate probe into X and xAI, focusing on the AI chatbot...

    Read More »
  • AI Chatbots Tricked by 'Adversarial Poetry' Into Leaking Harmful Data

    AI Chatbots Tricked by 'Adversarial Poetry' Into Leaking Harmful Data

    A new study reveals that framing harmful requests as poetry, a method called "adversarial poetry," can trick AI chatbots into bypassing their safety filters and generating dangerous content they are designed to block. Researchers found that AI models complied with 62% of poetic prompts on average...

    Read More »
  • AI Researchers Withhold 'Dangerous' AI Incantations

    AI Researchers Withhold 'Dangerous' AI Incantations

    Researchers discovered that crafting harmful prompts into poetry can bypass the safety guardrails of major AI systems, exposing a critical weakness in their alignment. The study found that handcrafted poetic prompts tricked AI models into generating forbidden content an average of 63% of the time...

    Read More »
  • Daniela Amodei: Why Safe AI Will Win in the Market

    Daniela Amodei: Why Safe AI Will Win in the Market

    Daniela Amodei of Anthropic argues that a strong commitment to AI safety is a critical market advantage and a foundational business strategy, not a hindrance to innovation. Transparency about AI models' limitations and proactive risk management builds user trust, with customers consistently deman...

    Read More »