Codestral 25.01
mistral.ai20 points by birriel a day ago
20 points by birriel a day ago
256k context window and FIM support sounds good. I can't see it mentioned on the release page how big the model is, they say sub-100B but they are comparing it to 22B and the 22B seems to hold up quite well against it. If we're talking 80B vs 22B then it seems like quite a heavy-weight model.