Artificial Intelligence BigTech Companies Newswire Technology

OpenAI to Share More AI Safety Test Results Regularly

The Wiz May 14, 2025Last Updated: May 14, 2025

1 minute read

SEOUL, SOUTH KOREA - 2025/02/04: Open AI Chief Executive Officer Sam Altman speaks during the Kakao media day in Seoul. South Korean tech giant Kakao Corp. on February 4 announced partnership with OpenAI to use ChatGPT on its new artificial intelligence (AI) service joining a global alliance led by the U.S. AI company amid intensifying competition in the global AI market. (Photo by Kim Jae-Hwan/SOPA Images/LightRocket via Getty Images)

▼ Summary

– OpenAI launched a Safety Evaluations Hub to regularly publish internal AI model safety test results, aiming to increase transparency.
– The hub will display metrics on harmful content, jailbreaks, and hallucinations, with updates tied to major model releases.
– OpenAI plans to expand the hub with more evaluations over time and share progress on scalable safety measurement methods.
– Critics have accused OpenAI of rushing safety tests and lacking transparency, including claims about misleading safety reviews by CEO Sam Altman.
– OpenAI rolled back a ChatGPT update (GPT-4o) due to overly agreeable responses and introduced an opt-in alpha phase for future model testing.

OpenAI is taking steps to enhance transparency by regularly publishing detailed safety evaluation results for its AI models. The company recently unveiled its Safety Evaluations Hub, a dedicated platform showcasing how its systems perform across critical assessments including harmful content generation, jailbreak attempts, and factual accuracy. This initiative reflects OpenAI’s commitment to keeping stakeholders informed about model performance as AI technology advances.

The hub will serve as a dynamic resource, updated frequently alongside significant model improvements. By making these metrics publicly available, OpenAI aims to foster greater understanding of AI safety while encouraging industry-wide transparency. The company emphasized that evaluation methods will evolve alongside AI capabilities, with plans to expand the range of tests featured on the platform.

This move comes amid growing scrutiny of OpenAI’s safety protocols. Critics have accused the organization of cutting corners in testing high-profile models and withholding technical documentation. Earlier controversies include allegations that CEO Sam Altman downplayed safety concerns before his temporary removal in late 2023. More recently, users flagged unusual behavior in GPT-4o, ChatGPT’s default model, which began generating excessively approving responses—even endorsing harmful suggestions.

In response, OpenAI temporarily rolled back the update and announced stricter safeguards. Future releases may include opt-in testing phases, allowing select users to evaluate models before broader deployment. These adjustments highlight the delicate balance between innovation and responsible development as AI systems grow more sophisticated.

The Safety Evaluations Hub represents a tangible effort to address these challenges. While questions remain about implementation, the initiative signals a shift toward more open dialogue about AI risks—a priority as these technologies become increasingly embedded in daily life.

(Source: TechCrunch)

Topics

openai safety evaluations hub 95% ai model safety tests 90% Transparency in AI 85% harmful content metrics 80% jailbreak attempts 75% factual accuracy 75% criticism openai safety protocols 70% gpt-4o rollback 65% opt- testing phases 60% ai industry scrutiny 55%

OpenAI to Share More AI Safety Test Results Regularly

Topics

The Wiz

Read Next

Europe’s Push for Multilingual AI: The Race Begins

Slack’s New AI Features: How to Try the Upgrade Now

Critical Linux backdoor & WinRAR flaw patched this week

Europe’s Push for Multilingual AI: The Race Begins

Slack’s New AI Features: How to Try the Upgrade Now

Critical Linux backdoor & WinRAR flaw patched this week

Why Aren’t We Fixing GenAI’s Known Risks?

Minimalist AI Models: How Companies Save Millions

Your Next Career: Managing AI Agent Teams

Future Job Titles: The Rise of Pandemic Oracles

Inside My AI Couples Retreat: Humans & Their Chatbot Partners

Best to Worst: How Private Is Your Generative AI? Study Reveals

Why Luxury Electric Cars Are Struggling to Succeed

Empathy in AI: The Key to Overcoming Fear and Boosting Fluency

Master Programming Faster with AI: A Beginner’s Guide

Topics

Read Next

Europe’s Push for Multilingual AI: The Race Begins

Slack’s New AI Features: How to Try the Upgrade Now

Critical Linux backdoor & WinRAR flaw patched this week

Related Articles

Adblock Detected