Using Models
In this lesson, we will create a realistic portrait of a woman having a drink in Amsterdam
PREREQUISITES
You should have created a fat cat and are ready to start making realistic people, and have activated and logged in to your private creation area. Let’s begin.
Lesson Goals
- Learn how to browse models
- Choose a model and use it in the system
This lesson will teach you how to work with Models, and we’re not talking about supermodels like Naomi Campbell. In lesson 1, we used the model called SDXL. A model is simply a type of computer file used in AI applications. This (oversimplification), is enough to understand what is meant by “downloading a model”, or “choosing a model”. There is more information about models at the end of this tutorial.
Open your WebUI and let’s use a model:
Step 1: Type /concepts to see the list
You can also bookmark this page on the browser
We’re looking for a realistic “full” model, also called a base model. We can browse by tags or also search for what we want
/concept /search:realistic
A good one to start with is <photon>
Step 2: Copy this prompt
It’s good practice to describe who, what, how, and where, in this order. For example:
/render <photon> A portrait of a beautiful 21 year old Swedish woman drinking a beer next to the river in Amsterdam
This is will not get you a great result, but we’re not done yet. Prompting without negatives is a sure fire way to get weird hands and a not quite a realistic photo. Let’s fix that.
You’ll get a giant quality boost by learning how to use negative inversions, or just remembering some handy shortcuts like #boost
Step 3: Add a recipe or Adetailer for better results
Photon is a Stable Diffusion 1.5 model, so you can use an after detailer feature by adding /adetailer to your prompt
You can also use the recipes system, a collection of useful prompt templates. Type /recipes to see a list. A popular one is #boost
Putting it all together:
/render <photon> #boost /adetailer A portrait of a beautiful 21 year old Swedish woman drinking a beer next to the river in Amsterdam
You an also add the aspect ratio shortcut like /portrait
The results are night and day! You can then use Inpaint to continue improving the hands and other imperfections
Prompting tips:
- One base model per image. You can add many models to an image, but the last mentioned “full model”, also called a checkpoint or base model, is the one that will win.
- SDXL models are not compatible with other Stable Diffusion models. As they say in the movies, never mix the streams. At the moment, the most widely used version of Stable Diffusion is 1.5 due to it’s massive year-old ecosystem. We support both as well as Stable Diffusion 2.1, which can be triggered with <sdv21>. The software will warn you if you’ve mixed up the Stable Diffusion families, it simply won’t render until there is alignment.
Glossary
Models are just files, very often large files. Models, (also known as checkpoints, loras, inversions, or concepts) are a blanket term for “AI Models”, meaning there is a computer somewhere with a literal file (the model is a file) that can produces a certain affect.
In lesson 1, a model called Stable Diffusion XL (SDXL) had already ingested literally millions of photos of cats. That was the product of a very costly AI training exercise by the creators of Stable Diffusion.Models are the end-products of many hours of rigorous training that a computer or group of computers perform to understand a pattern.
A model is different from a database, because models store fuzzy patterns rather than specific data. This idea of “training” a model refers to ingesting many photos and looking for similar patterns. That’s why the AI knows what a cat looks like.
Checkpoint Models – Checkpoints are also known as “full models” or “base models”. They are often 2-10GB files. A checkpoint has the most impact on the outcome of an image, and there isn’t a way to reduce the “weight” or influence of a checkpoint. If you select <photon> you are getting 100% of what Photon can do. These contain the overall art style of a composition. SDXL is also an example of a checkpoint. When no checkpoint is specified, your WebUI may default to the most basic version, good old Stable Diffusion 1.5.
Unless you’re researching something retro, you probably don’t want this, so always remember to pick the right model for the job before prompting.
At the time of this update, we are hosting over 3000 models, which weighs in at a hefty 2 terabytes. For most people, this would be an impossibility to store locally on their smartphones or their computers, which is one of the primary reasons we created our service. For the AI enthusiast who wants all of this power on the go, we’re here for you.