home models images videos posts articles bounties challenges events updates shop

HUAWEI PanGu Draw SDXL V3

Name: HUAWEI PanGu Draw SDXL V3
Rating: 5 (7 reviews)
Author: www913363043337

196

Updated: Feb 26, 2024

Download (6.45 GB)

Verified: 2 years ago

SafeTensor

Details

Type	Checkpoint Trained
Stats	140 0
Reviews	Positive (6)
Published	Feb 24, 2024
Base Model	SDXL 1.0
Trigger Words	HUAWEI
Hash	AutoV2 0AA83AE1B9

1 File

About this version

default creator card background decoration

www913363043337

License:

CreativeML Open RAIL++-M Addendum

What is PanGu Draw SDXL V3？

“PanGu-Draw” is a novel latent diffusion model for text-to-image synthesis that adeptly accommodates multiple control signals: a diffusion-based generative model trained by HUAWEI.(learn more)

User Guide

This model and its running program are licensed under Apache 2.0. Please use this model and program in compliance with the license.
This model consists of two sub-models, one is the high_timestep model and the other is the low_timestep model. The running program needs to use both models simultaneously.
The running program of this model is still under development. It can only implement the text-to-image function for now. Please wait for further development if you need more functions.
This model has 5B parameters, and the running program is poorly optimized. Please make sure you have at least 20G of VRAM to run this program.
Comfyui and webui do not support this model，So I created a comfyui node to run this model(here)

PanGu Draw 3.0 PanGu Draw 3.0 models implemented with MindSpore.I migrated it from MindSpore to Torch and implemented it to run on ComfyUI.

Original Github rope

comfyui nodes

Features

In contrast to version 2.0, Pangu Draw 3.0 has been subject to experimentation and updates across various aspects, including multi-language support, diverse resolutions, improved image quality, and model scaling. This includes:

The current industry's largest 5-billion-parameter Chinese text-to-image model.
Supports bilingual input in both Chinese and English.
Supports output of native 1K resolution images.
Outputs images in multiple size ratios.
Quantifiable stylized adjustments: cartoon, aesthetic, photography controller.
Based on Ascend+MindSpore for large-scale training and inference, using a self-developed MindSpore platform and Ascend 910 hardware.
Utilizes self-developed RLAIF to enhance image quality and artistic expression.

Author's Note:

This comfyui plugin is a program that I developed as a side project while learning stable diffusion. I am not a professional, and the program may have bugs. I apologize for my lack of technical skills.