Image-to-Image
After Text-to-Image, Image-to-Image is the most common use case of any diffusion models on Automatic 1111. You can very easily generate images using text prompts using reference images using Auto 1111 SDK in the following way:
Load a .safetensors weights file or .checkpoint (ckpt) file and initialize a StableDiffusionPipeline. This initialized pipeline can be the same for Text to Image, or any of our other inference pipelines.
Load the input image
Pass a prompt to the pipeline to generate an image:
The output type will be a list of Image PIL objects. By default, the pipeline will generate 1 image, however, you can specify this in the list of parameters:
Parameters
Currently, Auto 1111 SDK supports the following parameters (and the corresponding default values):
Last updated