
JetBrains
A suite of intelligent development tools
4.9•72 reviews•1.9K followers
A suite of intelligent development tools
4.9•72 reviews•1.9K followers
Powerful IDEs for most programming languages and technologies along with products for team collaboration.
This is the 27th launch from JetBrains. View more
Mellum by JetBrains
Launching today
Meet Mellum, a family of fast language models, including a next-generation model for ultra-low-latency and high-performance inference.



Launch Team





Latency-first models are underrated. I run AI voice agents and on a live phone call latency isn't a nice-to-have, it's the whole UX — a 2-second pause feels broken to a caller in a way it never does inside an IDE. For narrow, well-scoped tasks I'll take fast-and-good-enough over slow-and-brilliant every time. Is Mellum something you'd consider for real-time / voice use cases, or is it squarely aimed at the coding loop?
Shipping a focused, smaller coding model as open weights is the interesting bet here — the frontier-model-for-everything approach is expensive and overkill for completion. What I'd want to know: what context window does Mellum practically use for repo-level completion, and is it trained for fill-in-the-middle specifically, or general next-token? FIM quality is usually what separates a good in-IDE model from a chat model bolted into an editor.
How does it compare with the Qwen 3.6 and Gemma 4 models? It's disappointing to only see the old models. It seems misleading.
Build Check
Yeah! Workflow performance became key and this is bringing a clear advantage there. @fmerian doing what Flo does! The real hunting goat!