Hacklahoma-AI-Image-Workshop/README.md

# Hacklahoma AI Image Workshop (Fall 2023)

## Launching a Stable Diffusion Instance on Cloud

Go to [Google Cloud](https://console.cloud.google.com) (or any other cloud provider), and open console. 
[Trial $300 Google Cloud credits](https://cloud.google.com/free/docs/free-cloud-features). 

_Since Google Cloud seems to be very scarce on GPUs, I will use [RunPod](https://www.runpod.io/console/pods) in my demo today._

Create a VM with a GPU, at least 15GB RAM, and 30GB disk. Connect to SSH, install git, and clone this [Automatic1111 (A1111) repository](https://github.com/AUTOMATIC1111/stable-diffusion-webui).

First `cd` in to the cloned directory and edit `webui-user.sh` for remote access:
```
$ cd stable-diffusion-webui
$ nano webui-user.sh
```
Add the following to `COMMANDLINE_ARGS`
```
--device-id=0 --no-half-vae --xformers --share
```
and exit nano with `Ctrl+X` saving the changes.

Now we are ready to launch A1111 with:
```
$ bash ./webui-user.sh
```
You should see a link like `https://xxxxxxxxxxxxxxxx.gradio.live` after the webui finishes launching. Warning, do NOT share the public link, others can abuse you instance and increase your bill.

## Model Downloads

Popular Base Models
```
SDXL1.0: https://huggingface.co/stabilityai/stable-diffusion-xl-base-1.0/resolve/main/sd_xl_base_1.0.safetensors
SD1.5: https://huggingface.co/runwayml/stable-diffusion-v1-5/blob/main/v1-5-pruned-emaonly.safetensors
DreamShaper8: https://civitai.com/api/download/models/128713
```

LoRA Models: [CivitAI](https://civitai.com/models/)

Place downloaded models in the `stable-diffusion-webui/models`.

## Start Generating

### Image Dimensions (Resolution)

As different models are trained on different image resolutions, it is best to use the training image resolution for generations. For `SD1.5` use 512x512 and for `SDXL1.0` use 1024x1024. You can slightly vary one of the dimensions without significant issues.

### text2img generation

`text2img` can be thought of as generating visual content based on textual descriptions. Popular models include [DALL-E](https://openai.com/dall-e-2), [Midjourney](https://www.midjourney.com/home), and [Stable Diffusion](https://stability.ai/blog/stable-diffusion-public-release).

![](images/sd-latent-space.jpg)

### img2img generation

`img2img` refers to the transformation of one image into another, typically maintaining the same content but changing the style or other visual attributes.

### ControlNet (+Stable Diffusion)

![](images/cn-sd.png)


Install extension for A1111: [`sd-webui-controlnet`](https://github.com/Mikubill/sd-webui-controlnet)
Update 'README.md' 1 year ago			`# Hacklahoma AI Image Workshop (Fall 2023)`
Initial commit 1 year ago
Update 'README.md' 1 year ago			`## Launching a Stable Diffusion Instance on Cloud`
Update 'README.md' 1 year ago
Update 'README.md' 1 year ago			`Go to [Google Cloud](https://console.cloud.google.com) (or any other cloud provider), and open console.`
Update 'README.md' 1 year ago			`[Trial $300 Google Cloud credits](https://cloud.google.com/free/docs/free-cloud-features).`

Update 'README.md' 1 year ago			`_Since Google Cloud seems to be very scarce on GPUs, I will use [RunPod](https://www.runpod.io/console/pods) in my demo today._`
Update 'README.md' 1 year ago
Update 'README.md' 1 year ago			`Create a VM with a GPU, at least 15GB RAM, and 30GB disk. Connect to SSH, install git, and clone this [Automatic1111 (A1111) repository](https://github.com/AUTOMATIC1111/stable-diffusion-webui).`
Update 'README.md' 1 year ago
Update 'README.md' 1 year ago			First `cd` in to the cloned directory and edit `webui-user.sh` for remote access:
Update 'README.md' 1 year ago			```
Update 'README.md' 1 year ago			`$ cd stable-diffusion-webui`
Update 'README.md' 1 year ago			`$ nano webui-user.sh`
			```
			Add the following to `COMMANDLINE_ARGS`
			```
			`--device-id=0 --no-half-vae --xformers --share`
			```
			and exit nano with `Ctrl+X` saving the changes.
Update 'README.md' 1 year ago
Update 'README.md' 1 year ago			`Now we are ready to launch A1111 with:`
Update 'README.md' 1 year ago			```
Update 'README.md' 1 year ago			`$ bash ./webui-user.sh`
Update 'README.md' 1 year ago			```
			You should see a link like `https://xxxxxxxxxxxxxxxx.gradio.live` after the webui finishes launching. Warning, do NOT share the public link, others can abuse you instance and increase your bill.

Update 'README.md' 1 year ago			`## Model Downloads`
Update 'README.md' 1 year ago
Update 'README.md' 1 year ago			`Popular Base Models`
Update 'README.md' 1 year ago			```
			`SDXL1.0: https://huggingface.co/stabilityai/stable-diffusion-xl-base-1.0/resolve/main/sd_xl_base_1.0.safetensors`
			`SD1.5: https://huggingface.co/runwayml/stable-diffusion-v1-5/blob/main/v1-5-pruned-emaonly.safetensors`
			`DreamShaper8: https://civitai.com/api/download/models/128713`
			```

			`LoRA Models: [CivitAI](https://civitai.com/models/)`

Update 'README.md' 1 year ago			Place downloaded models in the `stable-diffusion-webui/models`.

			`## Start Generating`

			`### Image Dimensions (Resolution)`

			As different models are trained on different image resolutions, it is best to use the training image resolution for generations. For `SD1.5` use 512x512 and for `SDXL1.0` use 1024x1024. You can slightly vary one of the dimensions without significant issues.

Update 'README.md' 1 year ago			`### text2img generation`
Update 'README.md' 1 year ago
Update 'README.md' 1 year ago			`text2img` can be thought of as generating visual content based on textual descriptions. Popular models include [DALL-E](https://openai.com/dall-e-2), [Midjourney](https://www.midjourney.com/home), and [Stable Diffusion](https://stability.ai/blog/stable-diffusion-public-release).
Update 'README.md' 1 year ago
Update 'README.md' 1 year ago			`![](images/sd-latent-space.jpg)`
Update 'README.md' 1 year ago
Update 'README.md' 1 year ago			`### img2img generation`
Update 'README.md' 1 year ago
Update 'README.md' 1 year ago			`img2img` refers to the transformation of one image into another, typically maintaining the same content but changing the style or other visual attributes.
Update 'README.md' 1 year ago
Update 'README.md' 1 year ago			`### ControlNet (+Stable Diffusion)`
Update 'README.md' 1 year ago
Update 'README.md' 1 year ago			`![](images/cn-sd.png)`
Update 'README.md' 1 year ago

Update 'README.md' 1 year ago			Install extension for A1111: [`sd-webui-controlnet`](https://github.com/Mikubill/sd-webui-controlnet)