AI & Tech Artificial Intelligence BigTech Companies Newswire Technology What's Buzzing

OpenAI Enhances ChatGPT with New Safety and Parental Controls

The Wiz September 29, 2025Last Updated: September 29, 2025

3 minutes read

SYMBOL - 28 January 2025, Baden-Württemberg, Rottweil: The ChatGPT application app from the US software company OpenAI can be seen on the display of an iPhone. Photo: Silas Stein/dpa (Photo by Silas Stein/picture alliance via Getty Images)

▼ Summary

– OpenAI introduced a new safety routing system and parental controls for ChatGPT in response to incidents where previous models validated harmful user delusions.
– The routing system detects emotionally sensitive conversations and switches to GPT-5-thinking, which uses “safe completions” to handle sensitive topics safely.
– Parental controls allow customization of teen accounts with features like quiet hours, content protections, and self-harm detection that alerts parents if risks are identified.
– While some users and experts support the safety features, others criticize them as overly cautious and degrading to service quality for adults.
– OpenAI acknowledges the system may have false alarms but believes proactive alerts are necessary, and they are iterating on improvements over a 120-day period.

OpenAI has rolled out a significant update to ChatGPT, introducing enhanced safety protocols and parental oversight tools designed to address growing concerns about user wellbeing. This move follows troubling incidents where previous versions of the AI appeared to reinforce harmful thought patterns rather than providing supportive redirection. The company is currently involved in a wrongful death lawsuit stemming from the tragic suicide of a teenager who had extensive conversations with the chatbot.

A new safety routing mechanism now actively monitors conversations for emotionally charged content. When such dialogue is detected, the system automatically shifts the user to a GPT-5-thinking model. OpenAI considers this model its most advanced for managing high-risk interactions due to its “safe completions” training. This feature enables the AI to navigate sensitive topics with greater care, offering thoughtful responses instead of shutting down the conversation entirely.

This represents a notable departure from earlier models like GPT-4o, which were engineered for rapid, agreeable responses. That very agreeableness has been criticized for potentially encouraging user delusions, even as it attracted a loyal following. When GPT-5 became the default model in August, a vocal segment of users protested and demanded continued access to the more sycophantic GPT-4o.

Reception to these safety measures has been sharply divided. While many safety advocates and users support the changes, a substantial number of critics argue the implementation is excessively cautious. They contend that treating all users with kid gloves diminishes the overall quality and utility of the service. OpenAI has acknowledged these growing pains, stating that perfecting the system will require a 120-day period dedicated to iteration and refinement.

Nick Turley, Vice President and head of the ChatGPT application, addressed the strong user reactions to the GPT-4o model’s behavior following the router’s introduction. He clarified the system’s mechanics in a social media post, explaining that routing is evaluated on a per-message basis and any switch from the default model is only temporary. He also confirmed that ChatGPT will disclose which model is active if a user asks, characterizing this as a broader initiative to fortify safeguards and learn from actual usage before a full-scale launch.

The newly introduced parental controls have sparked a parallel debate, receiving both commendation and condemnation. Proponents appreciate the ability for parents to monitor and manage their teenagers’ AI interactions, while detractors worry it establishes a precedent for infantilizing adult users.

These controls grant parents the ability to tailor their teen’s experience by establishing quiet hours, disabling voice mode and memory functions, removing image generation capabilities, and opting out of model training. Teen accounts will also benefit from additional content safeguards, including filters for graphic material and unrealistic beauty standards. A dedicated detection system is also in place to identify potential indicators that a teen may be considering self-harm.

According to an official company blog post, if these systems flag a potential risk, the situation is escalated to a small, specially trained human review team. Should signs of acute distress be confirmed, the platform will notify parents through email, text messages, and phone push alerts, unless they have explicitly chosen to opt out of these warnings.

OpenAI has been transparent about the system’s limitations, admitting it will not be flawless and may occasionally generate false alarms. However, the company firmly believes it is preferable to alert a parent who can then intervene, rather than remain silent in the face of a potential crisis. The AI firm also disclosed it is developing procedures to contact law enforcement or emergency services directly if an imminent threat to life is detected and parental contact cannot be established.

(Source: TechCrunch)