Sign In

Rythmind / Loïc Barcourt (Beatboxer) - Flux.1 D

5
59
4
Type
LoRA
Stats
26
Reviews
Published
Nov 8, 2024
Base Model
Flux.1 D
Training
Steps: 3,535
Epochs: 1
Usage Tips
Clip Skip: 1
Strength: 1
Trigger Words
Loïc Barcourt
Hash
AutoV2
86A6CCA91E
Oni Badge
diogod's Avatar
diogod
The FLUX.1 [dev] Model is licensed by Black Forest Labs. Inc. under the FLUX.1 [dev] Non-Commercial License. Copyright Black Forest Labs. Inc.
IN NO EVENT SHALL BLACK FOREST LABS, INC. BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH USE OF THIS MODEL.

Loïc Barcourt (born 19 September 1988), better known as Rythmind, is a French beatboxer and looper. He is a current member of Berywam. He does short videos for YouTube and TikTok with the phrase “Can you remake this with your mouth?”.

Please be responsible, this is based on the resemblance of a real person, so follow Civitai rules when posting. But please do test it! I’ll be glad to see some results.

V1.5 is out, read the "about this model" on the right panel.

There's no need for negative now.

You can check some comparisons here: https://civitai.com/posts/8384839


V1 info (outdated):

This NEEDS negative with at least a "worst quality, jpeg artifacts, screencap,..."

That is because it learned pretty well all the bad quality on the amount of screencaps I used since I didn't adjust the increase in repeats for the high quality ones. (Ostris tool don't have the folder repeat thing Kohya have)

To use negatives on Forge set DynamicThresholding (CFG-Fix) Integrated to enabled, use CFG at 3.5 or more. On the settings Mimic Scale to 1. Threshold Percentile to 0.99. Mimic Mode half cosine up. Mimic Scale Min 0. Cfg Scale Min 0. Sched Val 0. Interpolate Phi 0.75. This is just my settings. Not the best ones.

Trigger Keyword:

Loïc Barcourt

 Supporting prompts:

HD high professional quality photo, best quality

 Negative supporting prompts:

worst quality, jpeg artifacts, screencap, (blurry:1.2)

Flux got the blue eyes pretty good. The pupils quality might sometimes suffer (all the image quality suffer).

I'm not super satisfied with this LoRA. It got close to resemblance, but it's not perfect and the learning of the bad quality kind of sucks. I'll probably have to do another version.

 

Same as SDXL

The dataset is problematic because only a few images are really great high resolution quality images. A lot are low-res and a lot are screencaps.

I also experiment with a different idea to try to make it more flexible. I triplicated the dataset, and for each I used a different natural descriptions captioner. CogVL, Florence Fine tuned and Florence base full. First tag was his name + class “Loïc Barcourt Person”. The second tag was the natural description. After that, I added WD14 tags. I pruned all of it a little.

 

His mole/skin detail below his left eyebrow is pretty much impossible to generate without inpaint or other interferences. I tried creating a folder specific with that focus, but the zoom degraded the quality too much, so I could not use too many repetitions.

The base checkpoint is the “Flux dev original fp16 one”. So, it should be very flexible with any checkpoint.

 

For standard generation:

USE negatives, so CFG > 1

Euler + simple

50-60 steps is way better for photos on Flux

Problems with the current Lora:

  • Will decrease overall image quality to something like a screenshot. This sucks. Use negatives.

  • Resemblance is not always great

  • The mole below his left eye won’t be generated

  • Need more testing to see what else....

  

I wish I could do Block Weights on Flux...

Hopefully you can leave some results and some comments. Any idea is appreciated. Thank you.