home models images videos 3D Models articles comics challenges updates shop

Head POV - Point of view from the back of the head - Camera over the shoulders - Animal Perspective

Name: Head POV - Point of view from the back of the head - Camera over the shoulders - Animal Perspective
Rating: 5 (107 reviews)
Author: diogod

107

835

5.4k

Updated: Apr 24, 2025

concept

pov view from behind behind the head perspective over the shoulders

Download

1 variant available

SafeTensor

109.14 MB

Verified: 2 years ago

Download (109.14 MB)

Optional Files

Details

Type

LoRA

Stats

835

5.4K

1.2K

Reviews

Very Positive

(107)

Published

Mar 1, 2024

Base Model

SDXL 1.0

Training

Steps: 1,098

Epochs: 18

Usage Tips

Clip Skip: 1

Strength: 0.8

Hash

AutoV2

D325A3F613

Trigger Words

shot in the point of view from the back of a

Tensors

About this version

default creator card background decoration

1.2K

2.8K

diogod

Joined Jan 3, 2023

License:

CreativeML Open RAIL++-M Addendum

44642-54764609-,natural lighting, 4k, high quality, Fujifilm XT3 ADDCOMM _a animal shooting laser beans on a man from his back running away loo.jpg

A simple concept that I could not do it right with SDXL

Recommended weight is 0.85‎ / Good from 0.6 up to 1.3

‎

It generalizes quite well for humans and even objects. Try it out.

Trigger Keyword:

a photo shot in the point of view from the back of a SUBJECT's head

Supporting prompts:

on the lower side, cropped, looking at___, ears, bokeh, dof, blur

Negative supporting prompts:

mouth, nose, eyes, facing the camera, bokeh, dof, blur

‎

The dataset was not big and had only animals and one or two bike. So, some animals will be hard to turn like snakes, ostrich, pigs, turtles...

I choose epoch 18, but for some subjects a more trained epoch worked better and manage to turn them better. But introduced more errors, so IMO this one is the best. I might upload a more trained epoch if anyone wants.

Just and example, with this epoch Pikachu red cheeks will always look wrong. Epoch 24 and epoch 40 turned him super well. A mouse ear will also look like it's facing the wrong direction while on epoch 40 it looks correct.

This is a "POV", "over the shoulder shot", but I did not use those exact words on the training, I used "point of view". So I don't know if they help or not.

They migh occupy the whole screen, if you want them only on the lower side I suggest to use Regional Prompter. I works super great. Also if you want to use it with other character Loras you should also use regional prompter or else they will morph.

I hope in the future to increase the dataset and caption the position (right side, left side, bottom, upper side). But right now it is not, so it won't work.

‎Other parameters and settings:

‎

The base checkpoint is the “sdXL_v10VAEFix” 6.7GB. So, it should be very flexible with any checkpoint.

As of right now, I recommend juggernautXL_v8Rundiffusion and juggerxlInpaint_juggerInpaintV8 for inpaiting.

‎

Lighting models works great! I recommend Dreamshaper SDXL

I prefer 6 steps with DPM++ 2S a Karras CFG 2.2 and high-res for 5 steps 0.45 denois and 1.5x res. But the default is DPM++ SDE Karras, CFG 2, 4 steps.

The new Juggernaut lighting is probably excellent too.

For standard generation

CFG: 5.5

DPM++ 3M Exponential (50 steps or more)

DPM++ 2M Karras (25 Steps or more)

DPM++ SDE Karras

DPM++ 2S a Karras

‎‎

Loractl works great if you want to have a more complex prompts, subjects or other Loras, start high and loose later. Like this:

<LoraName:[email protected],[email protected]>

‎

Want to have some “fun”? Install wildcards dynamic prompts extension https://github.com/adieyal/sd-dynamic-prompts and my common_animals.txt to \extensions\sd-dynamic-prompts\wildcards: Here is a prompt I made for testing. Paste on prompt:

a photo shot in the point of view from the back of a __common_animals__'s head close-up, on __YetAnotherWildcardCollection-main/Background/Environment__<lora:HeadPOV_from_behind_vk1-000018:0.85>

‎

Problems with the current Lora:

Might not turn a bunch of animals, needs more data
Sometimes double horns, weird ears and eyes, ears facing the camera

Some more settings: Trained 1024 res. 61 images captioned with the help of CogVL and taggui-v1.15.0-windows. Epoch 18 of 44. now prodigy 1.0. 2 steps folder "Pose" as the concept. constant BATCH 2, rank 16/1,Scale weight norms 1, snr gamma 5, Noise offset 0.0357, no regularization image

Hopefully you can leave some results and some comments. Any idea is appreciated. Thank you.