Topic: ai benchmarking

  • LM Arena Secures $100M for AI Leaderboards

    LM Arena Secures $100M for AI Leaderboards

    LM Arena raised $100 million in seed funding at a $600 million valuation, led by Andreessen Horowitz and UC Investments, with participation from top venture firms. The platform, launched by UC Berkeley researchers in 2023, provides transparent AI model comparisons and is widely used by major AI l...

    Read More »
  • Google's Gemini AI Conquers Pokémon Blue (With Some Help)

    Google's Gemini AI Conquers Pokémon Blue (With Some Help)

    Google's AI model Gemini successfully completed Pokémon Blue, a milestone celebrated by Google's CEO, though the project was led by independent developer Joel Z. Pokémon games serve as benchmarks for AI reasoning, with both Gemini and rival Claude (from Anthropic) tested on different versions, th...

    Read More »