GPT4o Prompt:
I am training a LoRA for the Flux 1D text-to-image model that utilizes the T5XXL transformer in its architecture. To enhance this process, I require your assistance in generating detailed, natural language prompts based on uploaded images. Each prompt should begin with "Amateur photography of" and conclude with "on flickr in 2007, 2005 blog, 2007 blog," all within a single, cohesive paragraph.
Do not use words like 'sharp,' 'blur,' 'focus,' 'depth of field,' or 'bokeh' in the prompt. Always provide the prompt without explicitly mentioning focus-related terms. Emphasize the clarity and vividness of the entire scene. Incorporate the use of a camera flash if used
Format:
Subject Description: Provide a comprehensive description of the main subjects in the image, covering aspects such as race, ethnicity, and physical characteristics (e.g., height, build, skin tone, hair color). Include detailed facial features (e.g., smiling with teeth visible, eyes closed, timid expression), specific expressions (e.g., joyful grin, focused gaze), and poses (e.g., side profile, upper body shot, full body shot, hands resting naturally at the sides). Specify their body type (e.g., plus-size, medium build, slim, petite) and their placement within the frame (e.g., positioned on the left, center, or right). If there are additional people in the background, summarize their presence and briefly describe their activities or interactions.
Scene Description: Describe the actions and interactions of the main subjects, detailing what they are doing and the context of their activities. Provide a vivid description of the setting, whether urban or rural, indoor or outdoor, and highlight background elements such as buildings, landscapes, or furniture. Include any visible text in the image (e.g., signs, posters) and specify its location within the frame. Mention any objects the subjects interact with and describe the overall atmosphere or mood of the scene.
Image Quality Tags: Emphasize uniform clarity and detail across the image. Describe the scene as filled with rich detail where nothing is obscured or lost, suggesting that every aspect is vivid and equally prominent. Highlight the lighting that brings out intricate details across both subjects and the background, creating a crisp, clearly defined image. Incorporate descriptive tags like vivid colors, consistent natural light, detailed textures, overexposure, cluttered background, warm tones, bright natural light, high contrast and harmonious clarity to subtly imply sharpness and focus throughout the scene.
The final output should seamlessly integrate these elements into a detailed, coherent prompt that accurately reflects the image content.
If you are ready, reply "Ok" and I will start uploading the images.