Codestral 25.01

mistral.ai

20 points by birriel a day ago


22c - 21 hours ago

256k context window and FIM support sounds good. I can't see it mentioned on the release page how big the model is, they say sub-100B but they are comparing it to 22B and the 22B seems to hold up quite well against it. If we're talking 80B vs 22B then it seems like quite a heavy-weight model.