ai benchmarks

Artificial Intelligence

Amazon Bets Against AI Benchmark Obsession

Amazon's SVP of AGI, Rohit Prasad, criticizes the AI industry's focus on standardized benchmarks, arguing they are noisy and fail…

Read More »
AI & Tech

New AI Benchmark Tests Chatbots’ Commitment to Human Wellbeing

HumaneBench is a new evaluation framework designed to systematically measure AI chatbots' impact on user welfare, focusing on principles like…

Read More »
AI & Tech

Google’s Gemini 3: Smarter, Faster, and Free

Google has launched Gemini 3, a free, smarter, and faster AI model that enhances user interactions across its ecosystem with…

Read More »
AI & Tech

China’s Open-Source Triumph in AI: Kimi K2 Thinking Rewrites the Rules

The new AI frontier is open-source, and it's being driven from the East. Moonshot AI’s Kimi K2 Thinking, an affordable…

Read More »
Artificial Intelligence

China’s Free AI Model Outperforms GPT-5 and Sonnet 4.5

Moonshot's new open-source AI model, Kimi K2 Thinking, claims to outperform top proprietary models like GPT-5 and Claude Sonnet 4.5…

Read More »
AI & Tech

DeepSeek V3.1: The Most Powerful Open AI Model Yet

DeepSeek V3.1 is a powerful 685-billion parameter open-source AI model that matches the performance of top proprietary systems, challenging the…

Read More »
AI & Tech

OpenAI Unveils GPT-5 & New Models: AI-Powered Software Creation

OpenAI launched the GPT-5 model family, featuring four variants (GPT-5, GPT-5 Mini, GPT-5 Nano, and GPT-5 Pro) tailored for different…

Read More »
Artificial Intelligence

Mixed Reactions to OpenAI’s New Open Source GPT Models

OpenAI released its first major open-source models since 2019, gpt-oss-120B and gpt-oss-20B, sparking debate over their competitive performance in benchmarks…

Read More »
Artificial Intelligence

OpenAI Unveils Two New Open-Source AI Reasoning Models

OpenAI has released two open-source AI reasoning models (gpt-oss-120b and gpt-oss-20b), marking its first major open-weight release since GPT-2 and…

Read More »
AI & Tech

OpenAI Launches Open-Source GPT Models: GPT-OSS-120B & GPT-OSS-20B

OpenAI released two open-source language models, GPT-OSS-120B and GPT-OSS-20B, offering high performance and accessibility for enterprises and developers. The models…

Read More »
Artificial Intelligence

Exaone Ecosystem Grows With Cutting-Edge AI Models

LG AI Research launched Exaone 4.0, a hybrid reasoning AI model excelling in science, math, and coding benchmarks, outperforming competitors…

Read More »
Artificial Intelligence

Elon Musk’s xAI Unveils Grok 4 with $300/Month Subscription

Elon Musk's xAI launched "Grok 4" and Grok 4 Heavy, its most advanced AI models, alongside a $300/month SuperGrok Heavy…

Read More »
AI & Tech

AGI Debate Divides Microsoft and OpenAI

Microsoft and OpenAI disagree on defining AGI, with some reports linking it to a $100 billion profit benchmark, highlighting widespread…

Read More »
Artificial Intelligence

DeepSeek R1-0528 Rivals OpenAI & Gemini in Open Source AI

DeepSeek's new open-source AI model DeepSeek-R1-0528 rivals premium models like OpenAI and Google Gemini, offering advanced reasoning capabilities while remaining…

Read More »
Artificial Intelligence

OpenAI’s ChatGPT Pro Gets $200 Upgrade With New o3 Operator

OpenAI has upgraded ChatGPT Pro with the new o3 reasoning model, enhancing capabilities like Operator, its autonomous web browsing tool,…

Read More »
Artificial Intelligence

AI2’s Compact Model Outshines Google & Meta in Performance

AI2's Olmo 2 1B, a 1-billion-parameter AI model, outperforms similar-sized models from Google, Meta, and Alibaba across benchmarks while being…

Read More »