Instructions to use SeeSee21/AniSee with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Diffusers
How to use SeeSee21/AniSee with Diffusers:
pip install -U diffusers transformers accelerate
import torch from diffusers import DiffusionPipeline # switch to "mps" for apple devices pipe = DiffusionPipeline.from_pretrained("SeeSee21/AniSee", dtype=torch.bfloat16, device_map="cuda") prompt = "Astronaut in a jungle, cold color palette, muted colors, detailed, 8k" image = pipe(prompt).images[0] - Notebooks
- Google Colab
- Kaggle
- Local Apps
- Draw Things
- DiffusionBee
- π¨ AniSee
- πΌοΈ Preview Gallery
- β¨ What is AniSee?
- π― Key Features
- πΊοΈ AniSee Roadmap
- βοΈ AniSee Base β Recommended Settings
- π Resolution Guide
- π‘ Prompting Guide
- β Recommended Positive Prefix
- β Recommended Negative Prompt
- π‘οΈ Safety Tags
- π§ Installation
- π§© Official Workflow
- π Repository Structure
- π Version History
- π Links
- π Credits
- π License
- β€οΈ Notes
- πΌοΈ Preview Gallery
π¨ AniSee
Personal Anime Fine-Tune of Anima Preview3 Base
Full Fine-Tune β’ Clean Anime Aesthetics β’ Tag + Natural Language β’ Anima-Compatible
Diffusion Model β’ 1 MP Native β’ LoRA-friendly
πΌοΈ Preview Gallery
Browse the full curated set of sample images on the dedicated gallery page:
β¨ What is AniSee?
AniSee is a personal full fine-tune of CircleStone Labs' Anima Preview3 Base, retrained on my own curated dataset to push the model further into a cleaner, more focused anime aesthetic.
It is not a LoRA merge β AniSee is a full fine-tune (~20K steps) with the LLM adapter only very lightly co-trained, following the official Anima fine-tuning guidelines. The goal is to keep everything that makes Anima a strong illustration base β Danbooru tags, natural language prompts, mixed prompts, full Qwen text encoder, Qwen-Image VAE β while shifting the default style toward a stronger anime look in line with my other checkpoints.
AniSee is mainly intended for:
- Anime-style illustrations
- Character-focused images
- Cleaner anime aesthetics
- Style experiments
- Testing Anima-based fine-tunes inside ComfyUI
This is the first release β only the Diffusion Model variant for now. If testing goes well, an AIO version and a 4-Step Turbo version (based on the new CDM β Continuous-Time Distribution Matching distillation method) will follow.
π― Key Features
- β Full fine-tune on Anima Preview3 Base β not a LoRA merge
- β ~20K training steps on a curated anime dataset
- β Clean, focused anime aesthetics
- β Supports Danbooru-style tags, natural language, and mixed prompts
- β Compatible with the standard Anima ComfyUI workflow
- β
Drop-in replacement for
anima-preview3-base.safetensors - β Uses the existing Qwen text encoder + Qwen-Image VAE β included in the repo
- β LoRA training friendly β same base architecture as Anima
πΊοΈ AniSee Roadmap
β Released
π¨ AniSee Base
Full fine-tune of Anima Preview3 Base β Diffusion Model variant. This is the foundation of the AniSee family.
π Planned
π¦ AniSee AIO
All-in-one checkpoint with Diffusion Model + Qwen Text Encoder + Qwen-Image VAE integrated into a single file. Single-file convenience, no extra loaders needed.
π AniSee Turbo (4-Step, CDM)
If testing of the Base goes well, a 4-Step Turbo variant distilled with the brand-new Continuous-Time Distribution Matching (CDM) method (Liu et al., 2026). CDM migrates the DMD framework from discrete anchoring to continuous optimization, achieving state-of-the-art few-step generation without GAN or reward-model auxiliary objectives. Should give clean 4-step anime generations with strong fine details.
π Paper: https://byliutao.github.io/cdm_page/
π§ Official AniSee ComfyUI Workflow
A dedicated workflow with the auto-prefix, optional Qwen3-VL prompt enhancer, LoRA support, and Ultimate SD Upscale is already included in this repo under workflows/AniSee.json.
More updates coming as testing progresses! π¨
βοΈ AniSee Base β Recommended Settings
The settings I personally use and recommend as a starting point:
Steps: 40
CFG: 4.5
Sampler: er_sde
Scheduler: simple
Resolution: ~1 MP # e.g. 1024Γ1024, 896Γ1152, 1152Γ896
CFG Guide: 4.0β5.0 is the sweet spot for balanced quality and creativity. Going above 5.0 starts to risk burning the image, especially with heavy quality tags. If results feel too harsh, drop CFG slightly or reduce quality tag count.
Sampler alternatives (all work well, just different character):
| Sampler / Scheduler | Character |
|---|---|
er_sde + simple (default) |
Neutral style, flat colors, sharp lines |
euler_a |
Softer, thinner lines, slightly more 2.5D feel, tolerates higher CFG |
dpmpp_2m_sde_gpu |
Similar to er_sde but more "creative", can get wild on short prompts |
Feel free to experiment β these are just starting points, not hard rules.
π Resolution Guide
| Use Case | Resolution |
|---|---|
| β Square / General purpose | 1024 Γ 1024 |
| Portrait / Character art | 896 Γ 1152 |
| Landscape / Scenes | 1152 Γ 896 |
| Wider cinematic | 1254 Γ 836 |
| Widescreen | 1365 Γ 768 |
Stay around 1 MP for the cleanest results. The Anima base starts breaking down somewhere around 2 MP, so if you want bigger images, generate at 1 MP first and upscale afterwards.
π‘ Prompting Guide
AniSee inherits Anima's prompting system. It accepts:
- Danbooru / anime-style tags
- Natural language prompts
- Mixed prompts (tags + sentences)
A good prompt structure:
[quality tags] [meta tags] [safety tag] [subject (1girl/1boy/etc)]
[character] [appearance] [pose] [clothing] [background] [lighting] [style]
Important tag rules (inherited from Anima):
- Use lowercase for tags, spaces instead of underscores
- Score tags are the only tags that use underscores (
score_7, etc.) - Artist tags must be prefixed with
@β e.g.@artistname
β Good (mixed prompt)
masterpiece, best quality, score_7, highres, illustration, safe, 1girl,
long silver hair, blue eyes, black hoodie, standing in a rainy city street
at night, neon lights reflecting on wet asphalt, cinematic lighting,
detailed anime illustration
β Good (natural language)
masterpiece, best quality, score_7, highres, illustration.
A young anime girl with long silver hair and golden eyes, wearing a
traditional shrine maiden outfit with white haori and red hakama.
She stands in a sunlit bamboo forest, cherry blossoms falling softly
around her. Warm afternoon light filtering through the trees,
detailed fabric shading, calm serene expression.
β Avoid
Very short tag dumps like anime girl, silver hair, hoodie β the model can produce unexpected results when the prompt is too sparse. Aim for at least a few descriptive tags or 2+ sentences.
β Recommended Positive Prefix
Start every prompt with:
masterpiece, best quality, score_7, highres, illustration,
Then add your subject, character, scene, and style tags after that.
You can also experiment with other quality tag combinations:
masterpiece, best quality, score_7, safe(Anima default)masterpiece, best quality, score_8, highres, official artscore_9, masterpiece, absurdres, anime screenshot
But the prefix above is what I personally use and recommend as a starting point.
β Recommended Negative Prompt
This is the negative I run with β it cleans up most common issues without being so aggressive that it kills the style:
worst quality, low quality, score_1, score_2, score_3, artist name,
(lowres:1.2), (worst quality:1.4), (low quality:1.4), (bad anatomy:1.4),
bad hands, multiple views, comic, jpeg artifacts, patreon logo,
patreon username, web address, signature, watermark, artist name,
censored, mosaic censoring
If your images come out too flat or lose style, reduce the weights on the heavier terms (e.g. drop (low quality:1.4) back to low quality).
π‘οΈ Safety Tags
Inherited from Anima. Use one of these in the positive prompt:
safeβ for normal generations (recommended default)sensitivensfwexplicit
π§ Installation
Step 1 β Download the files
You need three files (all included in this repo):
anisee.safetensorsβ the modeltext_encoders/qwen_3_06b_base.safetensorsβ text encodervae/qwen_image_vae.safetensorsβ VAE
Step 2 β Place the files
ComfyUI/models/diffusion_models/
βββ anisee.safetensors
ComfyUI/models/text_encoders/
βββ qwen_3_06b_base.safetensors
ComfyUI/models/vae/
βββ qwen_image_vae.safetensors
If you already run Anima Preview3 Base, you already have the text encoder and VAE β AniSee is a direct drop-in.
Step 3 β Load in ComfyUI
Use the standard Anima workflow, or the official AniSee workflow from workflows/anisee-workflow-SDUltimate.json:
- Load Diffusion Model β
anisee.safetensors - Load Text Encoder β
qwen_3_06b_base.safetensors - Load VAE β
qwen_image_vae.safetensors
Then your usual sampler, encode, decode, save chain.
π§© Official Workflow
A ready-to-use ComfyUI workflow is included at workflows/anisee-workflow-SDUltimate.json.
It features:
- π¦ Model + Text Encoder + VAE loaders pre-configured
- π Auto Quality Prefix β no need to type
masterpiece, best quality, score_7, ...yourself - π² Optional Qwen3-VL Prompt Enhancer β converts short one-liners into full Danbooru tag lists
- π Optional LoRA stack via Lora Manager (one-click toggle)
- πΌ Optional UltimateSDUpscale 2Γ with side-by-side compare
- π¨ Pre-configured with
er_sde/simple/ 40 steps / CFG 4.5 - β Pre-loaded recommended negative prompt
- π Built-in MarkdownNote with all settings + quick reference
Required custom nodes (all installable via ComfyUI Manager):
For the optional 2Γ upscaler, also place 4x-UltraSharp.pth in ComfyUI/models/upscale_models/:
π Repository Structure
AniSee/
βββ README.md
βββ config.json
β
βββ anisee.safetensors # the model (~4.18 GB)
β
βββ text_encoders/
β βββ qwen_3_06b_base.safetensors # text encoder (same as Anima)
β
βββ vae/
β βββ qwen_image_vae.safetensors # VAE (same as Anima)
β
βββ images/
β βββ cover.png # social preview / model cover
β βββ anisee-workflow-cover.png # workflow preview image
β βββ 1.png 2.png 3.png 4.png
β βββ 5.png 6.webp 7.webp 8.png
β βββ 9.png 10.png 11.webp 12.webp
β βββ ...
β
βββ workflows/
βββ anisee-workflow-SDUltimate.json
π Version History
v1.0 β Initial Release
- AniSee Base β full fine-tune of Anima Preview3 Base
- ~20K training steps on a curated anime dataset
- LLM adapter only very lightly co-trained (following Anima's fine-tuning guidelines)
- Diffusion Model variant (single
.safetensorsfile) - Compatible with the standard Anima ComfyUI workflow
- Drop-in replacement for
anima-preview3-base.safetensors - Includes the official ComfyUI workflow with auto quality prefix + Qwen3-VL prompt enhancer
π Links
- CivitAI Page: civitai.red/models/2628747/anisee
- Example Gallery: anisee.anisee.workers.dev
- Base Model: circlestone-labs/Anima
- CDM Paper (planned Turbo variant): byliutao.github.io/cdm_page
- Author: SeeSee21 on Hugging Face
π Credits
- Base Model: Anima Preview3 Base by CircleStone Labs and Comfy Org
- Underlying Architecture: Built on NVIDIA Cosmos-Predict2-2B (Anima is a "Derivative Model")
- Fine-Tune: SeeSee21
- CDM Distillation Method (planned Turbo variant): Continuous-Time Distribution Matching for Few-Step Diffusion Distillation β Liu et al., 2026
- Workflow Custom Nodes: yolain, ssitu, Will Miao, AILab (1038lab), rgthree
π License
AniSee inherits the CircleStone Labs Non-Commercial License from Anima. The model and derivatives are usable only for non-commercial purposes. As a derivative of Cosmos-Predict2-2B-Text2Image, the NVIDIA Open Model License Agreement also applies insofar as it covers Derivative Models.
For commercial licensing of the base model, please contact CircleStone Labs at tdrussell@circlestone.ai.
β€οΈ Notes
AniSee is a personal anime-focused fine-tune of Anima Preview3 Base, built to bring a stronger anime look and visual direction in line with my other checkpoints.
It is still in active testing β the AIO and 4-Step Turbo (CDM) variants will follow once the Base has been validated in the wild.
AniSee β clean anime, built on Anima. π¨
- Downloads last month
- 8
Model tree for SeeSee21/AniSee
Base model
circlestone-labs/Anima










