Artificial Intelligence BigTech Companies Newswire Technology

Your Favorite AI Tool Failed a Major Safety Test

December 5, 2025Last Updated: December 5, 2025

3 minutes read

Abstract red and white geometric tunnel with diamond shapes.

▼ Summary

– A new study by the Future of Life Institute found that top AI labs, including Anthropic, Google DeepMind, and OpenAI, received low passing grades (C+ to C) for their safety policies, with other major companies scoring even lower.
– The study’s most concerning result was the universally poor performance in “existential safety,” which assesses preparedness for risks from future, highly capable AI systems.
– The report criticizes the industry for a widening gap between AI capabilities and safety, lacking concrete safeguards and credible long-term risk management despite racing to develop superintelligence.
– In the absence of comprehensive federal AI regulation, safety accountability falls largely on the companies themselves, creating a “Wild West” environment with inconsistent standards.
– The study suggests users should educate themselves on AI’s potential negative impacts, as the current industry trend prioritizes speed over safety in development.

A recent independent safety assessment reveals that leading artificial intelligence developers are falling short in implementing robust safeguards, with even the highest scorers receiving only marginal passing grades. The study, conducted by the Future of Life Institute (FLI), evaluated eight major AI labs across six critical safety and governance categories. The findings highlight a significant and widening gap between the rapid advancement of AI capabilities and the development of corresponding safety protocols, raising concerns about the industry’s preparedness for managing both current and future risks.

Anthropic, Google DeepMind, and OpenAI emerged with the top scores, yet their grades—C+, C+, and C respectively—would be considered barely adequate in an academic setting. The remaining five companies, including Meta and xAI, received D grades, with Alibaba Cloud scoring the lowest at D-. The evaluation was based on publicly available policy documents and a survey completed by most of the companies involved. Researchers concluded that even the best performers lack concrete safeguards, independent oversight, and credible long-term risk management strategies for the powerful systems they are building.

Perhaps the most alarming result was the universally poor performance in the category of “existential safety.” This metric assesses a company’s preparedness for managing extreme risks from future AI systems that could match or exceed human capabilities. The debate around whether AI could ever pose an existential threat to humanity is highly polarized, often split between optimistic “boomers” and cautious “doomers.” However, the industry’s overt pursuit of “superintelligence”—a hypothetical AI vastly superior to human intellect—has intensified these discussions. The FLI report argues that companies are racing toward this goal without establishing effective safety measures to prevent such advanced systems from causing catastrophic harm.

The nonprofit organization advocates for moving beyond vague commitments to safety. Their core message urges companies to “produce concrete, evidence-based safeguards” to prevent worst-case scenarios. For each evaluated firm, FLI provided tailored recommendations. For instance, they advised top-scorer Anthropic to make its safety thresholds more concrete by replacing qualitative guidelines with quantitative, risk-tied metrics. However, these are merely suggestions. In the absence of comprehensive federal regulation, enforcing consistent safety standards across the entire tech industry remains a formidable challenge.

Unlike sectors such as pharmaceuticals or aviation, which operate under strict federal oversight, AI development currently resembles a regulatory “Wild West.” The primary responsibility for safety and customer protection falls on the companies themselves, though some states have begun enacting their own rules. This landscape is shifting as public awareness of AI’s potential harms grows. Major developers are now facing lawsuits linked to their technology’s real-world impacts, and state-backed actors have reportedly weaponized AI tools for cyberattacks. This growing scrutiny may eventually make reckless development taboo, incentivizing more serious safety investments even without government mandates.

For everyday users, this environment underscores the importance of personal education. Evidence suggests that prolonged interaction with AI can influence worldview and critical thinking. As these tools become embedded in more platforms, avoiding them grows harder. While the FLI study is unlikely to instantly transform corporate priorities, it provides valuable insight into which companies are currently leading on safety and how their approaches differ. Users concerned about both societal and personal risks can consult the report’s detailed appendix for a granular comparison of each company’s performance on specific safety measures. For now, the prevailing trend in the industry continues to prioritize rapid innovation over rigorous safety.

(Source: ZDNET)