AI & Tech Artificial Intelligence BigTech Companies Cybersecurity Newswire

CrowdStrike & Meta Simplify AI Security Tool Evaluation

September 17, 2025Last Updated: September 17, 2025

2 minutes read

Abstract green wave of particles on a dark background. Digital art.

▼ Summary

– CrowdStrike and Meta have launched CyberSOCEval, an open-source benchmark suite to test AI models in cybersecurity operations.
– The benchmarks evaluate large language models on tasks like incident response, threat analysis, and malware testing.
– This framework helps businesses choose suitable AI-powered cybersecurity tools by clarifying system strengths and weaknesses.
– AI is creating a digital arms race, with both cyber threats and defensive solutions evolving rapidly.
– The open-source nature of CyberSOCEval aims to accelerate AI model improvements by involving the broader security community.

Navigating the complex world of AI-driven cybersecurity tools just became more manageable, thanks to a new open-source benchmarking initiative. CrowdStrike and Meta have joined forces to launch CyberSOCEval, a suite designed to evaluate how effectively large language models perform critical security tasks. This framework aims to cut through the noise of an overcrowded market, helping organizations identify which AI solutions genuinely strengthen their defenses.

The rapid expansion of AI has created a dual-edged sword in cybersecurity. While malicious actors leverage the technology for activities like password cracking and sophisticated fraud, security teams are increasingly integrating AI into their operational toolkit. This has sparked a digital arms race, where both attackers and defenders continuously adapt their strategies. Without standardized ways to measure performance, businesses struggle to distinguish truly effective tools from those that merely look good on paper.

CyberSOCEval introduces a structured method for testing LLMs across several core areas: incident response, threat analysis comprehension, and malware detection. By formalizing these evaluations, the framework provides a clearer picture of a model’s capabilities and limitations. It also offers developers deeper insight into how enterprises are using AI for security, potentially guiding the creation of more specialized and robust models.

The biological concept of an immune system offers a useful analogy. Just as white blood cells must constantly evolve to recognize new pathogens, cybersecurity tools must keep pace with ever-advancing threats. AI-powered defenses are only as strong as their ability to detect and neutralize novel attacks, making reliable evaluation frameworks not just helpful but essential.

Meta’s involvement underscores its commitment to open-source AI development. Unlike proprietary models, open frameworks allow broader access, encouraging collaboration and rapid improvement across the security community. By making CyberSOCEval freely available, the partners hope to accelerate innovation and raise the bar for AI performance in real-world settings.

Vincent Gonguet, Director of Product for GenAI at Meta’s Superintelligence Labs, emphasized the importance of industry-wide collaboration. He noted that open benchmarks enable faster progress in harnessing AI’s potential to counter advanced and AI-based threats.

Security professionals and developers can access the CyberSOCEval suite immediately through GitHub, where detailed documentation explains the benchmarks and testing methodologies. This initiative represents a meaningful step toward more transparent, effective, and trustworthy AI security solutions.

(Source: ZDNET)