Artificial Intelligence BigTech Companies Newswire Technology

JetBrains Launches Mellum: Open AI Coding Model

May 1, 2025Last Updated: May 1, 2025

2 minutes read

Close-up of a digital screen displaying binary code with glowing orange padlock symbols interspersed, suggesting data security.

▼ Summary

– JetBrains has introduced Mellum, an AI model for code generation, focusing on code completion and available on Hugging Face, trained on over 4 trillion tokens with 4 billion parameters.
– Mellum requires customization for specific tasks and is not a plug-and-play solution; pre-tuned versions for Python are available as proof-of-concept.
– The model was trained using diverse sources, including GitHub and Wikipedia, over 20 days with 256 Nvidia H200 GPUs, but may inherit biases and security risks.
– JetBrains emphasizes Mellum as a foundation for innovation, not a finished product, and has open-sourced it under an Apache 2.0 license to encourage collaboration.
– Mellum offers potential benefits for developers but requires careful oversight, highlighting the need for human review of AI-generated code.

JetBrains has unveiled Mellum, its first publicly available AI model tailored for code generation, marking a significant step in AI-assisted software development. The model, now accessible on Hugging Face, specializes in code completion—predicting and finishing code snippets based on context—and was trained on a massive dataset exceeding 4 trillion tokens. With 4 billion parameters, Mellum demonstrates robust problem-solving capabilities, though its true potential emerges only after fine-tuning for specific programming tasks.

Unlike plug-and-play solutions, Mellum requires customization before deployment. JetBrains has shared pre-tuned versions for Python, though these serve primarily as proof-of-concept demonstrations rather than production-ready tools. The company emphasizes that Mellum is designed for integration into developer environments, AI coding assistants, and academic research, offering flexibility for experimentation and education.

The model’s training leveraged diverse sources, including GitHub’s permissively licensed code and English Wikipedia articles, processed over 20 days using a cluster of 256 Nvidia H200 GPUs. While powerful, JetBrains acknowledges limitations: Mellum may inherit biases from public repositories, and its outputs aren’t guaranteed to be secure or bug-free. This aligns with broader industry concerns—a 2023 Snyk survey found that over half of organizations face security risks from AI-generated code.

Despite these challenges, JetBrains views Mellum as a foundation for innovation rather than a finished product. The company’s blog post underscores a focused approach: “We’re not chasing generality—we’re building focus.” By open-sourcing the model under an Apache 2.0 license, JetBrains invites collaboration, hoping to spur advancements in code understanding, generation, and tooling.

For developers, Mellum represents both an opportunity and a cautionary tale. While it accelerates coding workflows, its outputs demand scrutiny—a reminder that AI assistance in software development still requires human oversight. As JetBrains puts it, “If Mellum sparks even one meaningful experiment, we’d consider it a win.”

(Source: TechCrunch)