Topic: model scaling

  • Small Language Models: AI21's Edge AI Breakthrough

    Small Language Models: AI21's Edge AI Breakthrough

    AI21's Jamba Reasoning 3B is an open-source, 3-billion-parameter model designed for high performance on consumer hardware, featuring a large 250,000-token context window for processing extensive documents and complex tasks efficiently. The model employs a hybrid architecture that blends transform...

    Read More »
  • Microsoft Builds In-House AI Models to Break Free from OpenAI

    Microsoft Builds In-House AI Models to Break Free from OpenAI

    Microsoft has developed two proprietary AI models, MAI-Voice-1 and MAI-1-preview, to diversify its technology and reduce reliance on external partners like OpenAI. MAI-Voice-1 is a speech-generation system for expressive audio, while MAI-1-preview is a large language model optimized for Copilot a...

    Read More »
  • Are Faulty Incentives Causing AI Hallucinations?

    Are Faulty Incentives Causing AI Hallucinations?

    Advanced language models like GPT-5 and ChatGPT persistently generate plausible but false statements, known as hallucinations, which are inherent and can be reduced but not fully eliminated. Hallucinations occur because models learn to predict text patterns without truth labels during pretraining...

    Read More »