Cursor Introduces Composer 2.5

cursor.com

157 points by asar 18 hours ago


https://twitter.com/cursor_ai/status/2056415413077233983

luodaint - 8 minutes ago

Benchmarks measure turn-level capabilities: you feed a task into the system and then grade the result. Capability for production-level usage concerns session-level decision making: does the agent know when to stop editing, retain the right amount of context, or go back and reread the file if the state has changed?

This is not a property of the model, but a property of the discipline; it can be operationalized by what you have documented before the session begins. Without "stop editing where you can no longer follow your changes to the spec" and "go back and read the migration file before changing the schema," there is nothing to halt the process until it fails integration.

Those teams who get consistent results independent of the model being used typically do so because they have operationalized their discipline first. Those switching out models monthly tend to expect the model to supply them.

throwaw12 - 3 hours ago

> Composer 2.5 is built on the same open-source checkpoint as Composer 2, Moonshot's Kimi K2.5.

Really nice to see they're giving credit to the company and I am optimistic Kimi K open models soon will outperform Opus models

goyozi - 4 hours ago

I kind of want to try it, to see if and how far they can take an open model and improve it but I really don’t miss the Cursor user experience. Constant UI changes, half-baked features, smaller and smaller limits, useless AI change attribution; I think I’ll wait for others to report if it’s any good.

memoryleakgame - 13 hours ago

If these benches from their site hold up (they likely wont)

Wouldn't this compress ai revenue like 15x quickly

If they really have a 4.7 opus high equivalent at 1/16 the cost wouldn't this significantly effect all the current capex and planing

Maybe they are getting elon to cover cost

zurfer - an hour ago

Kudos to the team. Please consider making the model available via API!

- an hour ago
[deleted]
asar - 17 hours ago

The model is (like Composer 2) based on Kimi K2.5 and they claim SOTA performance for 1/10th of the cost. The tweet also mentions that they've started a new model from scratch on Colossus 2 (xAI/SpaceX Cluster). Really impressive how they've made this jump from being called the vscode fork with no moat just a couple of months ago.

PUSH_AX - 17 hours ago

They set themselves up for flack when they use whatever these evals are… they did the same for composer 2 which was evaled in close competition with frontier models, spoiler alert, it wasn’t even close in practice.

So now 2.5 is supposed to compete with opus 4.7? Sure…

jtwaleson - 16 hours ago

Ok this might be weird but I've moved everyone in my 4 person team to our team plan and costs seem to have sky rocketed compared to the individual plans. Where before most people spent 20-100 USD, now the total bill is more like 1k USD. I haven't gone into the details but it feels like I'm being scammed.

m_mueller - 5 hours ago

It's a bit confusing to me why they'd make this 'fast' version the default, as it appears to be much more expensive than Composer 2. Wasn't it supposed to be a very cheap alternative to SOTA models?

everfrustrated - 17 hours ago

Full details https://cursor.com/blog/composer-2-5

I_am_tiberius - 2 hours ago

I hope people soon wake up to the fact that they use user data for model fine tuning.

bingud - 2 hours ago

Seems like a promising and useful model but its probably scary how much customer data they fed into it to reach this performance

try-working - 2 hours ago

A lot of people saying Cursor have no moat. Sure. Neither do OpenAI or Anthropic.

granzymes - 14 hours ago

Surprised this got pushed off the front page so quickly! It’s exciting to see what the Cursor team has been able to do with significantly fewer resources than the frontier labs.

I do wish they weren’t joining xAI. Something tells me there will be a contingent of researchers that departs Cursor if that merger is consummated.

uf00lme - 5 hours ago

I wonder why they didn’t train off Kimi 2.6, I hope is it because they already had a good base and not that they messed up that relationship.

- 5 hours ago
[deleted]
big-chungus4 - 5 hours ago

Can you please train Qwen 3.5 like 0.8B to 9B using the same training techniques

DeathArrow - 2 hours ago

I think anybody will be much better by acquiring a coding plan from Kimi.com and using Kimi K2.6, with whatever harness they like, including Claude Code, instead of paying more for Cursor's version of Kimi K2.5.

vanuatu - 16 hours ago

It's always great that more companies are throwing their hat in the ring, especially focusing on value (latency + intelligence + cost)

Dongyu_Jia - 2 hours ago

Will this be the cursor's last dance? LoL

lukebrichey - 14 hours ago

this feels super bullish on cursor/spacexai's ability to train a frontier level model. could be truly SOTA on coding given that their RL data is this powerful

jdlyga - 17 hours ago

It's a bit odd that they're not comparing it against Sonnet

polski-g - 13 hours ago

I don't know why their model isn't on Openrouter yet. They must not have enough capacity to offer it.

svclaws - 17 hours ago

Their previous Composer was already marketed as a cheap model capable of competing with SOTA on most tasks. The evals they shared back then backed this up but in my day-to-day usage it fell short across the board. Canceled my cursor subscription and switched to Claude Code a few weeks ago. It has its own shortcomings but in terms of model capability and UX quality Cursor will have a hard time competing in the long term. Elon Musk will be a very good way out for them.

ChrisArchitect - 17 hours ago

Non-x link: https://cursor.com/blog/composer-2-5 (https://news.ycombinator.com/item?id=48182126)

sergiotapia - 17 hours ago

Congratulations on the launch! I'm interested in trying Cursor but it's very confusing what I should buy. What does the Pro $20 plan get me in usage if I only use Composer 2.5? How fast is the model?

re-thc - 16 hours ago

Did they just upgrade Kimi 2.5 to 2.6?

scuderiaseb - 17 hours ago

[dead]

contextcost - 4 hours ago

[dead]

SadErn - 3 hours ago

[dead]