Full ComfyUI in your browser, no GPU needed. Pay Buzz per second only when you run. Now in preview. [Open Civitai Comfy Cloud](https://comfy.civitai.com)

Civitai Comfy Cloud

Browse, generate, and animate 3D models on Civitai.

3D Models

<h1 id="quality-comparison-between-multiple-precision-formats">Quality comparison between multiple precision formats</h1>Comparisons between the formats from a mixture of my models.The comparison videos from a batch are made with same seeds and prompts.<h2 id="quantization-types-and-quality-estimation-(2026)">Quantization Types and Quality Estimation (2026)</h2>Quantization for massive, multi-modal Diffusion Transformers (DiTs) - like LTX-Video 2.3 and Wan 2.2 - operates on highly sensitive spatial-temporal attention matrices. Unlike text LLMs that degrade gracefully under heavy compression, video architectures are fragile. Dropping below 8 bits on some layers can quickly cause structural tearing, melting anatomy, background flickering, or cross-attention drift where prompts fail to manifest visually.Furthermore, a format's real-world value depends heavily on its runtime flexibility. While block-quantized formats look beautiful in standalone math tests, applying floating-point LoRAs to them at runtime can severely cripple generation speeds or degrade generation details.🧪 Since there are now countless ways to quantize and mix tensors into a checkpoint, this is not always true based on the performed techniques. - But this is a good guidance on what to suspect.<h3 id="bf16-fp16">BF16 / FP16</h3><ul><li>Quality: ⭐⭐⭐⭐⭐</li><li>Performance: Native baseline. Requires flagship or multi-GPU workstations (e.g., 48GB+ VRAM for comfortable inference on full-sized models).</li><li>LoRA Compatibility: Native &amp; Flawless. The baseline standard for training and multi-LoRA stacking with zero translation overhead.</li><li>Hardware Support:<ul><li>NV: Highly optimized on Ampere, Ada, and Blackwell architectures (RTX 30/40/50 series, A100/H100/B200).</li><li>AMD: Fully supported via ROCm on RDNA2/3/4 and CDNA architectures (RX 6000/7000/8000, MI200/MI300).</li></ul></li></ul><h3 id="fp8-mixed-(preserved-layers)">FP8 Mixed (Preserved Layers)</h3><ul><li>Quality: ⭐⭐⭐⭐⭐</li><li>Performance: The optimal balance for 16-24GB GPUs. Keeping sensitive text embedding and cross-attention layers in uncompressed BF16/FP16 protects prompt logic, while the core processing blocks run in fast e4m3 FP8.</li><li>LoRA Compatibility: Native &amp; Flawless. Standard adapters inject seamlessly directly into the pipeline with zero runtime conversion or speed degradation, even when stacking multiple styles and characters.</li><li>Hardware Support:<ul><li>NV: Native hardware acceleration on Ada Lovelace and Blackwell (RTX 40/50 series, H100/B200).</li><li>AMD: Natively accelerated on MI300 series and RDNA3/RDNA4 architectures (RX 7000/8000 series).</li></ul></li></ul><h3 id="gguf-q8_0-q8_k">GGUF Q8_0 / Q8_K</h3><ul><li>Quality: ⭐⭐⭐⭐✬</li><li>Performance: High visual fidelity but heavy VRAM burden. Uses Integer Block Quantization with high-precision FP16 scaling factors computed for every 32 elements. This 7-bit fractional precision cleanly matches the original BF16 master.</li><li>LoRA Compatibility: Weak/Heavy Drag. Stacking multiple LoRAs causes extreme computational slowdowns on consumer GPU backends due to the on-the-fly dequantization loops required to patch the integer blocks.</li><li>Hardware Support:<ul><li>NV: Broad universal compatibility across all CUDA-capable cards (RTX 20/30/40/50 series).</li><li>AMD: Universally supported with excellent execution speeds via HIP/ROCm backends (RX 6000/7000/8000 series).</li></ul></li></ul><h3 id="int8-+-convrot">Int8 + ConvRot</h3><ul><li>Quality: ⭐⭐⭐⭐✬</li><li>Performance: Excellent VRAM efficiency with top-tier math retention. By applying an orthogonal Hadamard transformation matrix to rotate weights and activations, it aggressively tames row/column outliers. This allows clean 8-bit integer mapping that beats standard global FP8 in Signal-to-Noise Ratio (SNR) testing.</li><li>LoRA Compatibility: Standard LoRAs are trained on un-rotated distributions, meaning patching them requires specialized loaders or dynamic type-upcasting nodes to prevent runtime errors.</li><li>Hardware Support:<ul><li>NV: Runs efficiently on Turing, Ampere, Ada, and Blackwell Tensor Cores (RTX 20 series and newer).</li><li>AMD: Supported through optimized matrix math libraries on RDNA2/3/4 hardware via ROCm.</li></ul></li></ul><h3 id="mxfp8-mixed">MXFP8 Mixed</h3><ul><li>Quality: ⭐⭐⭐⭐✬</li><li>Performance: Next-gen microscaling standard. Using OCP Microscaling laws (applying scale factors to small bounded blocks of 32 elements), it preserves high-frequency spatial textures and deep color gradients far better than standard per-tensor FP8. Its compatibility lacks, since it is only fully usable on the latest GPUs like NV50 cards.</li><li>LoRA Compatibility: Excellent. Inherits native floating-point patching mechanics, making it smooth for single or stacked adapter pipelines.</li><li>Hardware Support:<ul><li>NV: Requires Blackwell architecture for native, un-emulated hardware scaling (RTX 50 series, B200).</li><li>AMD: Native acceleration present on enterprise MI300X and consumer RDNA4 architectures (RX 8000 series).</li></ul></li></ul><h3 id="fp8-(standard)">FP8 (Standard)</h3><ul><li>Quality: ⭐⭐⭐⭐</li><li>Performance: Universal baseline. Extremely fast on modern Tensor Cores (Ada/Blackwell). However, standard per-tensor e4m3 scaling leaves only 3 bits of fractional precision, which can occasionally cause micro-detail softening or subtle "luminance flashing" during sudden motion.</li><li>LoRA Compatibility: Native &amp; Flawless. Extremely flexible for fast local model configurations.</li><li>Hardware Support:<ul><li>NV: Architecturally optimized for Ada Lovelace and Blackwell line-ups (RTX 40/50 series).</li><li>AMD: Supported natively on MI300 enterprise chips and consumer RX 7000/8000 series.</li></ul></li></ul><h3 id="gguf-q6_k">GGUF Q6_K</h3><ul><li>Quality: ⭐⭐⭐⭐</li><li>Performance: Exceptional standalone size-to-quality compromise. Provides a ~6.5-bit precision curve that is visually indistinguishable from Q8 for solo renders, saving crucial gigabytes of VRAM for longer video timelines.</li><li>LoRA Compatibility: Moderate. Noticeable performance hit when processing complex adapter weights at runtime.</li><li>Hardware Support:<ul><li>NV: Highly compatible; works flawlessly across all mainstream CUDA hardware (RTX 20 series and up).</li><li>AMD: Fully supported with optimized matrix math execution via ROCm (RX 6000/7000/8000 series).</li></ul></li></ul><h3 id="nvfp4">NVFP4</h3><ul><li>Quality: ⭐⭐⭐⭐</li><li>Performance: Maximum hardware throughput. Using specialized two-level hierarchical scaling, it runs at incredible speeds on natively supported architectures. While it saves immense VRAM, it can suffer from visual edge softening, text bleeding, and notable temporal ghosting compared to higher-bit counterparts.</li><li>LoRA Compatibility: Highly Restricted. Only usable in very narrow, hardware-optimized pipelines.</li><li>Hardware Support:<ul><li>NV: Strictly requires Blackwell architecture (RTX 50 series, B200) for native 4-bit floating point Tensor Core acceleration.</li><li>AMD: Lacks native hardware execution pathways; requires software emulation or upcasting to FP16, resulting in steep performance penalties.</li></ul></li></ul><h3 id="gguf-q5_k_m-q5_0">GGUF Q5_K_M / Q5_0</h3><ul><li>Quality: ⭐⭐⭐✬</li><li>Performance: The baseline boundary for low VRAM cards. Retains enough primary structural logic to prevent major motion breakdown, but subtle background jitter or slight facial identity drifting can creep in.</li><li>LoRA Compatibility: Poor. The low bit-depth headroom begins to smear the nuanced alterations introduced by fine-tuned adapters.</li><li>Hardware Support:<ul><li>NV: Universal compatibility across older and newer CUDA devices (RTX 20/30/40/50 series).</li><li>AMD: Broadly supported with solid performance on all modern consumer Radeon cards via ROCm.</li></ul></li></ul><h3 id="nf4-(4-bit-normalfloat)-gguf-q4_k_m">NF4 (4-bit NormalFloat) / GGUF Q4_K_M</h3><ul><li>Quality: ⭐⭐⭐</li><li>Performance: High compression for budget rendering. Good for testing raw spatial composition, but fine details (hair strands, smoke, fabric grains) are heavily compressed. In Wan 2.2, the internal Mixture of Experts (MoE) routing parameters can become slightly unstable, leading to robotic or unnatural movements.</li><li>LoRA Compatibility: Weak. The cross-attention layers lack the resolution required to accurately pin LoRA details to the latents, causing concepts to bleed together or ignore prompt modifiers.</li><li>Hardware Support:<ul><li>NV: Ubiquitous support via bitsandbytes or native kernels across all modern RTX series.</li><li>AMD: Fully accessible on consumer platforms through specialized HIP/ROCm quantization forks.</li></ul></li></ul><h3 id="gguf-q3-q2-iq2">GGUF Q3 / Q2 / IQ2</h3><ul><li>Quality: ⭐</li><li>Performance: Extreme VRAM compression but virtually unusable. The spatial-temporal attention maps collapse completely. Videos frequently devolve into deep-fried macroblocks, melting anatomy, or chaotic static noise.</li><li>LoRA Compatibility: Broken. Adapters completely fail to trigger or result in catastrophic generation errors.</li><li>Hardware Support:<ul><li>NV: Universally compatible across CUDA backends, though bottlenecked by calculation overhead rather than VRAM.</li><li>AMD: Universally deployable via ROCm/llama.cpp-derived engines, though heavily degraded visually.</li></ul></li></ul><h1 id="examples">Examples</h1><h2 id="boundbite-v10">BoundBite v10</h2><h3 id="fp8+-vs-nvfp4-vs-gguf">FP8+ vs NVFP4 vs GGUF</h3>Prompt:<pre><code>She smiles, her face expression is slightly arrogant and seductive face. She plays with the viewer. Subtle blood is dripping from her choker heart. A detailed frilled and dark-red liquid headline "DaSiWa" "Bound Bite" "..." appears. Chains from outside the view bounds around the text. She smiles cheeky.</code></pre><edge-media url="256fb49e-86ea-49dd-b456-96db4a47172f" type="video" filename="grid_1.webm"></edge-media><edge-media url="30a663d6-eb33-4143-8385-c3db9436de4a" type="video" filename="grid_2.webm"></edge-media>Prompt:<pre><code>An armored female with a visor and mechanical cat ears helm. She is holding an energy globe. Her hand pushes the globe into her chest, fully absorbing the globe, her black gold trimmed armor starts is fully surrounded by an energy shield glowing blue from the energy.</code></pre><edge-media url="2d9b9ad7-764a-465a-b7e2-9876b715a70a" type="video" filename="grid3.webm"></edge-media><edge-media url="860d0fa1-cfb1-492d-807f-c75ca4989918" type="video" filename="grid4.webm"></edge-media><h2 id="synthseduction-v9">SynthSeduction v9</h2><h3 id="fp8+-vs-gguf-vs-basic">FP8+ vs GGUF vs Basic</h3>Prompt:<pre><code>A cgi anime style video of a android woman with mechanical cat ears, cat tail and samurai armor, holding a red glowing katana inn each hand.
She makes a spinning motion, all her red colors shift to purple. Sakura pedals flaying away with her spinning motion.</code></pre><edge-media url="092e1787-7764-4442-ad93-5a7b5a1d6a8b" type="video" filename="comparison_2x4_output.webm"></edge-media>Prompt:<pre><code>A metallic logo on black background, ensemble the letters "D" "S" "W" and a dragon. The dragon breathes dark-red fire, setting the letters on fire.</code></pre><edge-media url="d579fc66-f59b-484a-bf20-387470e39dd2" type="video" filename="comparison_nvenc_output.webm"></edge-media><h2 id="tastysin-v8">TastySin v8</h2><h3 id="gguf-q8-q2">GGUF Q8-Q2</h3>Prompt:<pre><code>A woman wearing a black, red trimmed gothic clothes. 
It is raining, her hair and clothes clinging wet on her body.
Her black make-up is running from her face like tears.
She turns around, facing her back to the camera. 
She walks away, down the street. 
The camera is following her in moderate speed.</code></pre><edge-media url="a81c8dd2-3fdd-418f-80dd-059fa0f00987" type="video" filename="d99974c6-f819-4e16-8ced-e75ba2d39b9a.mp4"></edge-media>Prompt:<pre><code>A woman wearing a black wedding dress with blonde-pink hair, light-blue eyes, and a blue rose in her hair.
She confidently touches her hips, tracing her hands over her waist.
She swiftly turns around to swing up her skirt.
With the swift turn the black wedding dress becomes a white wedding dress.
The blue rose turns into a red one.
She sways her hips, moving her dress to emphasise her beautiful figure.</code></pre><edge-media url="264e365c-95b8-457c-a969-c00be3a35fac" type="video" filename="d875acb9-2d50-49b1-9f7f-21867232e78f.mp4"></edge-media>