Ideogram 4 GGUF

Quantized GGUF diffusion transformer weights for Ideogram 4, converted from the original FP8 release for use with ComfyUI GGUF loader nodes.

This repository contains GGUF files for the two Ideogram 4 diffusion components:

  • ideogram4-transformer-*.gguf: the main text-guided diffusion transformer.
  • ideogram4-unconditional_transformer-*.gguf: the unconditional transformer used by CFG workflows.

These files are not a complete standalone Ideogram 4 package. Your workflow still needs the other runtime assets expected by Ideogram 4 in ComfyUI, such as the text or multimodal encoder components and VAE.

ComfyUI Support

Use these models with the ComfyUI nodes from molbal/ComfyUI-GGUF. Install that custom node repository into your ComfyUI custom_nodes folder, then restart ComfyUI.

Place the downloaded .gguf files in one of ComfyUI's diffusion model folders:

ComfyUI/models/diffusion_models/
ComfyUI/models/unet/

Load the files with Unet Loader (GGUF) or Unet Loader (GGUF/Advanced) in an Ideogram 4 workflow that accepts separate main and unconditional diffusion models.

Files

Each quant level is published for both the main transformer and the unconditional transformer.

Quant Main transformer Unconditional transformer Approx. size per file
Q4_0 ideogram4-transformer-q4_0.gguf ideogram4-unconditional_transformer-q4_0.gguf 5.64 GB
Q4_1 ideogram4-transformer-q4_1.gguf ideogram4-unconditional_transformer-q4_1.gguf 6.21 GB
Q5_0 ideogram4-transformer-q5_0.gguf ideogram4-unconditional_transformer-q5_0.gguf 6.77 GB
Q5_1 ideogram4-transformer-q5_1.gguf ideogram4-unconditional_transformer-q5_1.gguf 7.33 GB
Q8_0 ideogram4-transformer-q8_0.gguf ideogram4-unconditional_transformer-q8_0.gguf 10.14 GB

The main and unconditional models do not need to use the same quant level.

Suggested Pairings

Main transformer Unconditional transformer Notes
q8_0 q8_0 Highest precision GGUF pair in this repo.
q8_0 q5_1 Keeps the main transformer high precision while reducing memory on the unconditional side.
q8_0 q4_1 Larger quality bias toward the main transformer with lower CFG-side memory.
q5_1 q4_1 Balanced quality and size.
q5_0 q4_1 Lower memory starting point.
q4_0 q4_0 Smallest available GGUF pair.

Inference Measurements

Peak Memory

The chart below shows peak RAM and VRAM measured during Ideogram 4 inference with different main/unconditional quant pairings, including an NVFP4 baseline for comparison.

Peak memory usage of GGUF quants and NVFP4

Relative Inference Speed

The chart below compares relative per-iteration inference speed across quant pairings, with the NVFP4 + NVFP4 run used as the 100% reference in the chart.

Ideogram 4 inference speed among various quantization combinations

Measurements were taken by u/molbal on Windows 11 with an AMD Ryzen 7 6800H CPU, 48 GB RAM, and an RTX 3080 Laptop GPU with 8 GB VRAM. Exact memory use and speed will vary by workflow, image size, sampler settings, ComfyUI version, and loaded auxiliary models.

Download

Download the two files you want to pair from the Files tab, or use the Hugging Face CLI. For example:

huggingface-cli download molbal/ideogram-4-gguf ideogram4-transformer-q5_1.gguf ideogram4-unconditional_transformer-q4_1.gguf --local-dir ComfyUI/models/diffusion_models

Compatibility Notes

These are non-K GGUF quantizations intended for PyTorch dequantization in ComfyUI. K-quants are not included because this ComfyUI loading path does not use fused quantized linear kernels.

If a workflow fails to load these files, update molbal/ComfyUI-GGUF and confirm that both the main and unconditional transformer files are present in a ComfyUI diffusion model folder.

License

These files are derived from ideogram-ai/ideogram-4-fp8 and follow the Ideogram 4 non-commercial license.

Downloads last month
616
GGUF
Model size
9B params
Architecture
Hardware compatibility
Log In to add your hardware

4-bit

5-bit

8-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for molbal/ideogram-4-gguf

Quantized
(10)
this model