Artificial IntelligenceNewswireStartupsTechnology

Deep Cogito Launches 4 Open-Source AI Models with Self-Learning Intuition

▼ Summary

Deep Cogito released four new large language models (LLMs) designed to improve reasoning over time, ranging from 70B to 671B parameters, available under mixed licensing terms.
– The models include dense (70B, 405B) and mixture-of-experts (109B, 671B) variants, optimized for different use cases like low-latency applications or high-performance inference tasks.
– The 671B MoE model is the flagship, offering competitive performance with shorter reasoning chains and lower compute costs compared to leading open models.
– Deep Cogito’s models use a unique training approach called iterated distillation and amplification (IDA), which internalizes reasoning steps to improve efficiency and accuracy.
– The company trained all models for under $3.5 million, focusing on smarter priors rather than more tokens, making them cost-effective for enterprises and developers.

Deep Cogito, a San Francisco-based AI research startup founded by former Google engineers, has unveiled four groundbreaking open-source language models capable of refining their reasoning abilities autonomously. These models, part of the Cogito v2 series, range from 70 billion to 671 billion parameters and cater to diverse enterprise and developer needs. Available under flexible licensing, they represent a significant leap in self-learning AI technology.

The lineup includes two dense models (70B and 405B) and two mixture-of-experts (MoE) variants (109B and 671B). Dense models activate all parameters simultaneously, making them predictable and hardware-friendly, ideal for latency-sensitive applications or environments with limited GPU resources. Meanwhile, MoE models employ selective activation of specialized subnetworks, enabling massive scale without proportional computational costs. The 671B MoE model stands out as the flagship, delivering frontier-level accuracy with shorter reasoning chains, outperforming competitors in benchmarks.

Developers can access these models through Hugging Face for downloads, Unsloth for local deployment, or via APIs from Together AI, Baseten, and RunPod. For those constrained by hardware, an 8-bit quantized version of the 671B model reduces memory demands while maintaining near-full performance, a cost-effective solution for running large-scale AI workloads.

What sets Cogito v2 apart is its hybrid reasoning architecture. Unlike conventional models that generate responses instantly, these systems can pause, reflect, and refine their answers internally. More importantly, this reflective capability isn’t just a runtime feature, it’s embedded in the training process. The models learn from their own reasoning paths, distilling effective strategies into their neural weights over time. This self-improving intuition leads to faster, more accurate responses, even in standard inference mode.

Deep Cogito’s approach, termed iterated distillation and amplification (IDA), replaces static training methods with dynamic self-learning. The result? Models that don’t just compute, they reason. Internal benchmarks show the 671B MoE model matching or surpassing rivals like DeepSeek R1 and Qwen1.5-72B, all while using 60% shorter reasoning chains. In practical tests, it solved complex math problems, legal reasoning tasks, and ambiguous multi-hop questions with remarkable efficiency.

Despite their sophistication, Deep Cogito’s models were trained for a fraction of the cost seen in industry giants, under $3.5 million total, compared to OpenAI’s reported nine-figure budgets. CEO Drishan Arora attributes this efficiency to prioritizing smarter reasoning over brute-force scaling. By eliminating redundant thought processes, Cogito v2 delivers high performance without inflating inference costs, a game-changer for enterprises balancing accuracy and operational expenses.

Looking ahead, Deep Cogito plans to continue refining its models through iterative learning. Every release serves as a foundation for the next, reinforcing the company’s commitment to open-source AI innovation. With backing from Benchmark and partnerships with Hugging Face, Meta, and others, the startup is poised to influence how next-gen AI systems think, not just by processing more data, but by learning how to reason better.

For developers and businesses, Cogito v2 offers a compelling alternative: high-performance AI that evolves with use. Whether fine-tuning for niche applications or deploying at scale, these models represent a shift toward intuitive, cost-efficient machine intelligence. The future of AI may not lie in bigger models, but in smarter ones.

(Source: VentureBeat)

Topics

deep cogitos new llms 95% model variants parameters 90% iterated distillation amplification ida 88% hybrid reasoning architecture 87% mixture- -experts moe models 85% self-improving intuition 85% Performance Benchmarks 82% cost-effective training 80% enterprise developer applications 78% future ai reasoning 75%