Sign In

HUAWEI PanGu Draw SDXL V3

6
153
1
Updated: Feb 26, 2024
Verified:
SafeTensor
Type
Checkpoint Trained
Stats
111
Reviews
Published
Feb 24, 2024
Base Model
SDXL 1.0
Trigger Words
HUAWEI
Hash
AutoV2
0AA83AE1B9

What is PanGu Draw SDXL V3?

“PanGu-Draw” is a novel latent diffusion model for text-to-image synthesis that adeptly accommodates multiple control signals: a diffusion-based generative model trained by HUAWEI.(learn more)

User Guide

  1. This model and its running program are licensed under Apache 2.0. Please use this model and program in compliance with the license.

  2. This model consists of two sub-models, one is the high_timestep model and the other is the low_timestep model. The running program needs to use both models simultaneously.

  3. The running program of this model is still under development. It can only implement the text-to-image function for now. Please wait for further development if you need more functions.

  4. This model has 5B parameters, and the running program is poorly optimized. Please make sure you have at least 20G of VRAM to run this program.

  5. Comfyui and webui do not support this model,So I created a comfyui node to run this model(here)

PanGu Draw 3.0 PanGu Draw 3.0 models implemented with MindSpore.I migrated it from MindSpore to Torch and implemented it to run on ComfyUI.

Original Github rope

comfyui nodes

Features

In contrast to version 2.0, Pangu Draw 3.0 has been subject to experimentation and updates across various aspects, including multi-language support, diverse resolutions, improved image quality, and model scaling. This includes:

  • The current industry's largest 5-billion-parameter Chinese text-to-image model.

  • Supports bilingual input in both Chinese and English.

  • Supports output of native 1K resolution images.

  • Outputs images in multiple size ratios.

  • Quantifiable stylized adjustments: cartoon, aesthetic, photography controller.

  • Based on Ascend+MindSpore for large-scale training and inference, using a self-developed MindSpore platform and Ascend 910 hardware.

  • Utilizes self-developed RLAIF to enhance image quality and artistic expression.

Author's Note:

This comfyui plugin is a program that I developed as a side project while learning stable diffusion. I am not a professional, and the program may have bugs. I apologize for my lack of technical skills.