home models images videos posts articles comics challenges events updates shop

AuraFlow VAE

Name: AuraFlow VAE
Rating: 5 (22 reviews)
Author: LOL2024

Updated: Aug 28, 2024

base model

general use vae basemodel base models

Download

1 variant available

SafeTensor

159.58 MB

Verified: 2 years ago

Download (159.58 MB)

Details

Type

VAE

Stats

338

Reviews

Positive

(7)

Published

Aug 22, 2024

Base Model

AuraFlow

Hash

AutoV2

BCB60880A4

Recommended Resources

LOL2024

AuraFlow v0.1 is the fully open-sourced flow-based text-to-image generation model.

AuraFlow v0.2 is the fully open-sourced flow-based text-to-image generation model. The model was trained with more compute compared to the previous version.

AuraFlow v0.3 is the fully open-sourced flow-based text-to-image generation model. The model was trained with more compute compared to the previous version.

This model achieves state-of-the-art results on GenEval. Read our blog post for more technical details. You can also check out the comparison with other models on this gallery page.

The model is currently in beta. We are working on improving it and the community's feedback is important. Join fal's Discord to give us feedback and stay in touch with the model development.

Credits: A huge thank you to @cloneofsimo and @isidentical for bringing this project to life. It's incredible what two cracked engineers can achieve in such a short period of time. We also extend our gratitude to the incredible researchers whose prior work laid the foundation for our efforts.

Usage (v0.1)

$ pip install transformers accelerate protobuf sentencepiece
$ pip install git+https://github.com/huggingface/diffusers.git

from diffusers import AuraFlowPipeline
import torch

pipeline = AuraFlowPipeline.from_pretrained(
    "fal/AuraFlow",
    torch_dtype=torch.float16
).to("cuda")

image = pipeline(
    prompt="close-up portrait of a majestic iguana with vibrant blue-green scales, piercing amber eyes, and orange spiky crest. Intricate textures and details visible on scaly skin. Wrapped in dark hood, giving regal appearance. Dramatic lighting against black background. Hyper-realistic, high-resolution image showcasing the reptile's expressive features and coloration.",
    height=1024,
    width=1024,
    num_inference_steps=50, 
    generator=torch.Generator().manual_seed(666),
    guidance_scale=3.5,
).images[0]

Usage (v0.2)

$ pip install transformers accelerate protobuf sentencepiece
$ pip install git+https://github.com/huggingface/diffusers.git

from diffusers import AuraFlowPipeline
import torch

pipeline = AuraFlowPipeline.from_pretrained(
    "fal/AuraFlow-v0.2",
    torch_dtype=torch.float16,
    variant="fp16",
).to("cuda")

image = pipeline(
    prompt="close-up portrait of a majestic iguana with vibrant blue-green scales, piercing amber eyes, and orange spiky crest. Intricate textures and details visible on scaly skin. Wrapped in dark hood, giving regal appearance. Dramatic lighting against black background. Hyper-realistic, high-resolution image showcasing the reptile's expressive features and coloration.",
    height=1024,
    width=1024,
    num_inference_steps=50, 
    generator=torch.Generator().manual_seed(666),
    guidance_scale=3.5,
).images[0]

image.save("output.png")