ai benchmarking

AI & Tech

GPT-5.4 Shatters Professional Benchmark Records

OpenAI has launched GPT-5.4, a powerful frontier model for professional work, available in standard, specialized "Thinking," and high-performance "Pro" configurations.…

Read More »
AI & Tech

Google’s Gemini Pro Shatters Benchmark Records Again

Google has launched a preview of its advanced large language model, "Gemini Pro 3.1", which is positioned as a major…

Read More »
Artificial Intelligence

LMArena Hits $1.7B Valuation Just Four Months After Launch

LMArena achieved a $1.7 billion valuation after a $150 million Series A round, reflecting intense market demand for independent AI…

Read More »
Artificial Intelligence

GPT-5.2 vs. Gemini 3: Can It Finally Surpass the Competition?

OpenAI has released **GPT-5.2**, a model designed for professional knowledge work, claiming it is their most capable yet for enhancing…

Read More »
AI & Tech

OpenAI Launches GPT-5.2 in Response to Google’s ‘Code Red’

OpenAI has launched GPT-5.2, its most advanced model, offered in three versions (Instant, Thinking, Pro) to cater to different professional…

Read More »
Artificial Intelligence

GPT-5 Matches Human Performance in Diverse Jobs, Says OpenAI

OpenAI's GDPval benchmark evaluates AI performance against human professionals in key economic sectors, showing models like GPT-5 and Claude Opus…

Read More »
AI & Tech

Google Launches Gemini AI for Advanced Parallel Reasoning

Google launched Gemini 2.5 Deep Think, its most advanced AI reasoning model, capable of solving complex problems by evaluating multiple…

Read More »
AI & Tech

Mathematicians Battle AI in Secret Showdown

But Glazer wanted to speed things up, so Epoch AI hosted the in-person meeting on Saturday, May 17, and Sunday,…

Read More »
Artificial Intelligence

QwenLong-L1 Outperforms LLMs in Long-Context Reasoning

Alibaba's QwenLong-L1 framework enables large language models to analyze lengthy documents (hundreds of thousands of tokens) with high accuracy, addressing…

Read More »
Artificial Intelligence

LM Arena Secures $100M for AI Leaderboards

LM Arena raised $100 million in seed funding at a $600 million valuation, led by Andreessen Horowitz and UC Investments,…

Read More »
Artificial Intelligence

Anthropic & Google Land OpenAI-Backed Harvey as Client

Harvey, a $3B legal AI startup, is expanding its partnerships to include AI models from Anthropic and Google, moving beyond…

Read More »