Question 1

What is LoraTag?

Accepted Answer

LoraTag is a web-based AI tool that automatically generates captions for image datasets used in LoRA (Low-Rank Adaptation) training of Stable Diffusion, FLUX, and SDXL models. It uses GPT-4 Vision to create accurate, training-ready captions in batch, replacing hours of manual work.

Question 2

How does LoraTag work for LoRA training?

Accepted Answer

Upload your image dataset (JPEG, PNG, or WebP), choose a detail level (brief, standard, or detailed), and LoraTag generates .txt caption files for each image — the standard format used by LoRA training tools like kohya_ss and EveryDream. Batch processing handles hundreds of images in minutes.

Question 3

Is LoraTag free?

Accepted Answer

Yes, LoraTag has a free tier with 50 images per month. Pro costs $9/month for 500 images with priority processing. Unlimited is $29/month for unlimited images and API access. No credit card required for the free tier.

Question 4

What models does LoraTag support?

Accepted Answer

LoraTag generates captions compatible with all major LoRA training workflows: Stable Diffusion 1.5, SDXL, FLUX, and any model that uses text-image paired datasets. The output format (.txt caption files) works with kohya_ss, EveryDream, and other popular training tools.

Question 5

How is LoraTag different from WD14 Tagger or JoyCaption?

Accepted Answer

LoraTag uses GPT-4 Vision for natural language captions that understand context, composition, and style — not just tags. WD14 Tagger generates booru-style tags. JoyCaption is an open-source model with lower accuracy. LoraTag produces richer, more detailed captions that lead to better LoRA training results.