# Hacklahoma AI Image Workshop (Fall 2023) ## Launching a Stable Diffusion Instance on Cloud Go to [Google Cloud](https://console.cloud.google.com) (or any other cloud provider), and open console. [Trial $300 Google Cloud credits](https://cloud.google.com/free/docs/free-cloud-features). _Since Google Cloud seems to be very busy, I will use [RunPod](https://www.runpod.io/console/pods) in my demo today._ Create a VM with a GPU, at least 15GB RAM, and 30GB disk. Connect to SSH, install git, and clone this [Automatic1111 (A1111) repository](https://github.com/AUTOMATIC1111/stable-diffusion-webui). `cd` in to the cloned directory and run: ``` $ nano webui-user.sh ``` Add the following to `COMMANDLINE_ARGS` ``` --device-id=0 --no-half-vae --xformers --share ``` and exit nano with `Ctrl+X` saving the changes. Now we are ready to launch Stable Diffusion with: ``` bash ./webui-user.sh ``` You should see a link like `https://xxxxxxxxxxxxxxxx.gradio.live` after the webui finishes launching. Warning, do NOT share the public link, others can abuse you instance and increase your bill. ## text2img Background: `text2img` can be thought of as generating visual content based on textual descriptions. Popular models include [DALL-E](https://openai.com/dall-e-2), [Midjourney](https://www.midjourney.com/home), and [Stable Diffusion](https://stability.ai/blog/stable-diffusion-public-release). ![](images/sd-latent-space.jpg) ## ControlNet (+ Stable Diffusion) ![](images/cn-sd.png) Install extension for A1111: [`sd-webui-controlnet`](https://github.com/Mikubill/sd-webui-controlnet)