Creating Your Own AI Image Generator: A Guide with Invoke AI and Stable Diffusion

If you’re looking to unleash the full potential of AI image generation without the constraints of cloud-based services, Invoke AI might be your perfect ally. This versatile tool allows you to run a variety of image generation models directly on your hardware, eliminating the need for costly subscriptions or dealing with pesky watermarks. Imagine the creative freedom you can achieve with this powerhouse, especially when using a recent GPU.

Invoke AI is particularly appealing for those who wish to maintain ultimate control over their AI creations. Even if your hardware is modest, you can achieve decent results, making it accessible to many users. The magic begins by downloading the Invoke AI community edition. For Windows users, the installation process is now largely automated, making it easier than ever to get started. However, for Linux and macOS users, a bit more manual work might be required.

For this exploration, a virtual machine running Windows 11 served as the testbed, equipped with an RTX 4070 and 24GB RAM. If you’re an AMD GPU user, Linux is your playground. After installing Invoke AI, it’s advised to activate “Low-VRAM mode.” This involves a quick tweak to the invokeai.yaml file in your installation folder. Just add the line “enable_partial_loading: true” and you’re set.

To get started with image generation, some pre-trained models need to be downloaded. Models like Dreamshaper and CyberRealistic are readily accessible, but for the much-loved Stable Diffusion, you’ll need to grab a token by creating an account with Hugging Face. This process is straightforward, and once the token is integrated, you’re ready to download the model.

Keep in mind that some of these models require significant storage space. For instance, Stable Diffusion 3.9 can take up to 19 GB. But once your setup is complete, access to the Invoke AI interface is just a browser click away, using the URL http://127.0.0.1:9090.

The “canvas” tab is your creative hub where text prompts can lead to stunning image generations. The model options are diverse, ranging from Juggernaut XL to Stable Diffusion 3.5, each offering unique visual styles. Stable Diffusion, although the slowest, frequently produces the most realistic results.

Invoke AI isn’t just about generating images; it empowers you to refine and perfect them, offering tools to rework parts of an image and create intricate workflows. Whether you’re a hobbyist or need royalty-free images for a project, this software puts creativity and control in your hands. Sure, there’s a bigger conversation about AI’s ecological footprint, but for those eager to personalize and expand their digital art repertoire, running AI locally is an exciting and viable option.