Krea 2 Technical Report
krea.ai111 points by mattnewton a day ago
111 points by mattnewton a day ago
Hi HN, we're releasing weights for our latest text to image model and publishing this writeup on how we trained it in quite a bit of depth.
I hope there is something in the report for everyone, we included a fair bit on the actual training and data infrastructure usually not written about much, that I think will be interesting to people here. There's more that didn't fit, happy to answer questions!
This is a massive technical report for an open weights image gen model. As someone who has followed this space closely, it’s really cool to read about the behind-the-scenes experimentation and effort that went into the final product. I hope you will release some of the find tuning tools so the community can experiment with them as well and really push what the model’s capable of.
You can find some links and details in the GitHub readme for finetuning / LoRA support. Ostiris, musubi tuner, fal and hugging face diffusers are all day-0 supported :) https://github.com/krea-ai/krea-2
We recommend training off the undistilled, Raw checkpoint, and then applying the LoRA to the Turbo model for inference.
Interesting item on the careers page btw. For anyone that knows what older school Mellanox was about, it might be your kind of thing: https://jobs.ashbyhq.com/krea/ebe94024-eef6-4306-a019-10072a... :D
Turbo appears GGUF'd already: https://huggingface.co/Abiray/Krea-2-Turbo-GGUF
It's a good model sadly the use of the qwen vae is a bit of a downer.
It's been mentioned by some that using the wan2.1 vae instead solves this. I haven't personally had time to try yet.