Instructions to use CalamitousFelicitousness/Krea-2-Turbo-Diffusers with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Diffusers
How to use CalamitousFelicitousness/Krea-2-Turbo-Diffusers with Diffusers:
pip install -U diffusers transformers accelerate
import torch from diffusers import DiffusionPipeline # switch to "mps" for apple devices pipe = DiffusionPipeline.from_pretrained("CalamitousFelicitousness/Krea-2-Turbo-Diffusers", dtype=torch.bfloat16, device_map="cuda") prompt = "Astronaut in a jungle, cold color palette, muted colors, detailed, 8k" image = pipe(prompt).images[0] - Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- Draw Things
- DiffusionBee
Krea 2 (K2) Turbo - Diffusers
Diffusers-format conversion of the Krea 2 Turbo checkpoint from Krea. Turbo is an 8-step distilled model built for fast, high-quality text-to-image generation, and is the checkpoint recommended for inference.
LoRAs trained on the undistilled Krea-2-Base-Diffusers apply directly to Turbo, so the recommended workflow is to train on Base and run on Turbo.
Model Summary
Krea 2 is a latent-diffusion image model trained from scratch with an emphasis on aesthetics and stylistic control. The architecture is a single-stream multimodal diffusion transformer.
- Transformer: single-stream DiT, 12.9B parameters, 28 blocks at width 6144. Grouped-query attention, a learned output gate, per-head QK normalization, and a 3-axis rotary embedding. A text-fusion stage inside the transformer collapses twelve text-encoder hidden-state layers into one conditioning stream.
- Text encoder: Qwen/Qwen3-VL-4B-Instruct, tapped at twelve intermediate layers (text-only conditioning).
- VAE: the Qwen-Image autoencoder (
AutoencoderKLQwenImage, f8, 16 latent channels). - Sampler: flow matching with a fixed timestep shift.
Weights are stored in their original mixed precision (bf16 matmuls, fp32 norms and modulations).
The transformer config carries is_distilled: true, so guidance is disabled automatically.
Recommended Settings
Turbo is distilled for few-step sampling and runs without classifier-free guidance.
| Setting | Value |
|---|---|
| Steps | 8 |
| Guidance (CFG) | 0 (disabled) |
| Resolution | 1024 x 1024 up to 2048 x 2048 |
The timestep shift is pinned (mu = 1.15), matching the distillation schedule.
Prompting
Natural-language prompts are recommended. Long, detailed descriptions yield the best results, though strong images are produced from short prompts as well. For text rendering, the words to be rendered are wrapped in quotes. An optional prompt-expansion system prompt is available in the upstream krea-2-oss repository.
License
The weights are released under the Krea 2 community license.
Citation
@misc{krea2,
title = {Krea 2},
author = {Krea},
year = {2026},
url = {https://www.krea.ai/krea-2}
}
- Downloads last month
- 13