SWE-bench

Entity category: technology

Artificial Intelligence

Unlock Claude Sonnet 4.5: Your Next Coding Breakthrough

September 30, 2025

Abstract burst of glowing orange fiber optic strands against a dark background.

Anthropic has launched Claude Sonnet 4.5 as its premier coding model, offering significant performance improvements in coding, reasoning, and computer…

Claude Opus 4.1 Boosts Coding & AI Agent Performance

August 9, 2025

Claude Opus 4.1 offers significant advancements in AI-powered coding and autonomous task execution, with improved performance and safety protocols, making…

Artificial Intelligence

Claude 4.1 Outperforms in Coding Tests Ahead of GPT-5 Launch

August 6, 2025

Retro computer with code displayed on screen, surrounded by digital elements on an orange background.

Anthropic’s Claude Opus 4.1 leads in coding performance with 74.5% accuracy on the SWE-bench test, surpassing OpenAI and Google, but…

AI Coding Agents Evolve Skills with Evolutionary AI

June 27, 2025

Futuristic cityscape with a giant DNA helix rising through it, digital code raining down.

AI coding agents are rapidly transforming software development, with major tech companies like Microsoft and Google using them for significant…

Artificial Intelligence

Top 3 AI Breakthroughs You Missed This Week

May 24, 2025

AI microchip on a complex network of glowing lines and nodes, representing artificial intelligence and interconnected systems.

Microsoft introduced Model Context Protocol (MCP) to enable seamless communication between AI systems, allowing developers to create interconnected workflows and…

Artificial Intelligence

Claude Opus 4 Breaks Records: Outperforms OpenAI in AI Coding Marathon

May 23, 2025

Isometric illustration of a modern computer displaying code, a keyboard, mouse, and small potted plant on a coral-colored surface.

Anthropic's Claude Opus 4 and Sonnet 4 AI models set new benchmarks in professional environments, with Opus maintaining focus on…

GPT-4.1 Launches: OpenAI Claims Multimodal Edge, Outperforming GPT-4o

April 15, 2025

Super flagship with screens representing GPT4.1

OpenAI took to a livestream event Monday to announce its newest family of AI models, dubbed GPT-4.1 – a name…

Artificial Intelligence

OpenAI Targets Developers with GPT-4.1, A Fine-Tuned Coding Push

April 15, 2025

OpenAI on blue bg of bits

OpenAI has introduced a new family of AI models dubbed GPT-4.1, adding another layer to its already complex naming structure.…