gpt-5 performance

GPT-5 Fails Over 50% of Real-World Orchestration Tasks in MCP-Universe Benchmark

August 23, 2025

Retro robots sit at desks in a classroom, taking a test. A charming illustration.

Salesforce AI Research has introduced MCP-Universe, an open-source benchmark that evaluates large language models' performance in real-world enterprise scenarios, focusing…

AI & Tech

GPT-5 vs. GPT-4o: Which AI Performs Better?

August 15, 2025

Red and blue robots facing off, appearing ready to fight. CGI render.

Users have criticized OpenAI's GPT-5 for its clinical tone, reduced creativity, and misleading responses, leading OpenAI to reintroduce GPT-4o as…

AI & Tech

GPT-5 Rollout Faces Challenges, Says OpenAI

August 9, 2025

A retro-style illustration of a robot straining to pull a wooden cart uphill through a rocky mountain pass.

OpenAI's GPT-5 launch faced unexpected performance issues, struggling with basic math and coding tasks, contradicting its benchmark claims. Users and…

Artificial Intelligence

OpenAI accused of sentiment analysis controversy

August 8, 2025

Three people discuss GPT model performance data shown on a screen comparing software engineering and code editing accuracy.

OpenAI faced criticism for misleading performance charts during its GPT-5 showcase, with inconsistencies in data representation across multiple graphs. CEO…