As I've blogged about in various threads, I've been doing AI stable diffusion model merging and LORA creation and stuff for a while now. I like to think that I'm pretty good at it. Well, I think I'm almost fully satisfied with the general model, but it st

Name
Options
Comment
File	Or URL:
Whitelist Token

Video Stream Embedding
Advanced Options	Always Noko Always Sage
Video Timestamp
Captcha Type	Captchouli
Spoiler	Unset Spoiler Image NSFW Image
Password	(For file deletion.)
Markup tags exist for bold, itallics, header, spoiler etc. as listed in " [options] > View Formatting "

File:00916-2435206029.png (1.4 MB,864x1152)

Kissu AI Image Gen User Input Prompt Testing Anonymous 07/06/23 (Thu) 23:37:59 No.13144

As I've blogged about in various threads, I've been doing AI stable diffusion model merging and LORA creation and stuff for a while now. I like to think that I'm pretty good at it.
Well, I think I'm almost fully satisfied with the general model, but it still needs some testing. Alas, I have only one brain so I'm limited to the stuff that I can come up with. Since this is something other people will eventually be using, I'd like people to give me prompts to test out. Nudity and other stuff is fine, but I'll probably post the more hardcore stuff on /megu/.
NOTE: This is the general model that is a poor at recognizing character tags. I'll figure out a way around it, but for now this is more general. Although I guess we could test which characters are recognized, but it's not very many of them. Reimu and other super popular ones work. (Kuon works because I trained a LORA for her half a year ago)

Please give me prompts to test out, using various poses and tags. It's a hybrid model that uses booru and probably e621 (furry) tags, so if you're familiar with either of those it'd be good. It is a bit of a "jack of all trades, master of none" situation though apart from how beautiful it is. It can make use of standard real life Stable Diffusion tags, too, like "dinosaur on a motorcycle" or something, but I don't usually use those so I'm not how good they'd be.

As an example, mermaid Kuon here is:
1girl, seashell bikini, rainbow hair, gradient hair, (mermaid:1.2), gold armor, very long hair, jewelry, crown, arms out, fish tail, lying, floating, scales, marine, coral reef, (plant humanoid), petals, underwater, smile, :d

Useful booru tag info:
https://danbooru.donmai.us/wiki_pages/tag_group%3Aimage_composition
https://danbooru.donmai.us/wiki_pages/tag_group%3Ahands (don't count on these being good)
https://danbooru.donmai.us/wiki_pages/tag_group%3Aposture
https://danbooru.donmai.us/wiki_pages/tag_group%3Ahair_styles
https://danbooru.donmai.us/wiki_pages/tag_group%3Aface_tags

E621 has its own set of tags and I'm not very familiar with them, but here's their wiki:
https://e621.net/help/wiki

Anonymous 07/07/23 (Fri) 00:37:36 No.13145

File:00009-1475836378.png (1.3 MB,864x1152)

Hmm, but the Reimu recognition isn't 100% there it would seem

Anonymous 07/07/23 (Fri) 00:53:36 No.13146

Hmm, I don't have a particular scene in mind, but how does the Ume LORA look?

Anonymous 07/07/23 (Fri) 01:24:42 No.13147

File:grid-0005.png (4.93 MB,1728x2304)

>>13146
Alright then, with a simple "1girl, smile, upper body" prompt with the Ume LORA active.
Hmm, I think style LORAs might be something I need to address, so I'll have to keep that in mind. It looks a bit more grainy and noisy here than it should, but the LORA block weight thing can probably fix that up.

Anonymous 07/07/23 (Fri) 01:25:57 No.13148

File:grid-0006.png (4.5 MB,1728x2304)

from IRC:

masterpiece, award-winning, absurdres, 8k wallpaper, immaculate quality, superbly detailed, peerless rendering, stunningly beautiful, divinely inspired, big butt

Anonymous 07/07/23 (Fri) 01:37:48 No.13149

File:grid-0012.png (4.9 MB,1728x2304)

from IRC
loli, plump, flat chest, thick thighs, wide hips, belly, reverse bunnysuit, dark-skinned female, red eyes, white hair, huge ass

Anonymous 07/07/23 (Fri) 01:41:12 No.13150

File:grid-0015.png (4.71 MB,1728x2304)

With emphasis on loli and flat chest and removed 'huge ass'

Anonymous 07/07/23 (Fri) 01:43:59 No.13151

File:grid-0017.png (4.84 MB,1728x2304)

(loli:1.2), (chibi), plump, (flat chest:1.3), thick thighs, wide hips, belly, reverse bunnysuit, dark-skinned female, red eyes, white hair, (huge ass:0.8)

Anonymous 07/07/23 (Fri) 02:10:14 No.13152

File:grid-0020.png (4.76 MB,1728x2304)

with (dark skin:1.3)

Anonymous 07/07/23 (Fri) 04:14:53 No.13153

loli, chair, pink, no legs, white gloves, eyes, frown, sad, summer, 1girl

Anonymous 07/07/23 (Fri) 04:25:39 No.13154

File:grid-0037.png (4.9 MB,1728x2304)

>>13153
Mmm, "no legs" really isn't going to work because it will read "legs" and put legs in there. Such is the problem with AI.
Also maybe I should start asking people how smooth or "painted" they want stuff to look, from a scale of 1 to 3. Anyway, here is 2.

Anonymous 07/07/23 (Fri) 04:27:26 No.13155

File:grid-0038.png (4.61 MB,1728x2304)

>>13154
I think you were probably after some sad amputation thing, but I doubt that's possible. If you want just an upper body look, then "upper body" works.
Here is the "smoothest" look with the least amount of paint strokes, but really it will need more booru tags to begin to look flatter, color-wise.

Anonymous 07/07/23 (Fri) 06:49:16 No.13156

>>13154
>>13155
i was thinking something more like an amputee but thought maybe it would do an upper body instead
still the result is pretty darn close to what i had in my head, only thing missing from my imagination of it, is a gently breeze with some petals but i didn't specify that so not surprised there

Anonymous 07/09/23 (Sun) 06:29:58 No.13159

1girl, loli, pink hair, dark hair, very long hair, camisole, bare shoulders, bare feet, pout

Anonymous 07/09/23 (Sun) 06:59:10 No.13160

File:grid-0189.png (5.03 MB,1728x2304)

>>13159
Bleh, I was messing with checkpoints and made the server crash or someting. Well, running locally...
I'm going to assume anyone telling me to prompt loli will want me to add 'chibi' and 'flat chest' so it produces a loli body, so I added it.
Pout is something that never works. There's a LORA for it, but I haven't messed with it on the new model yet and I don't want to tinker with it as I'm about to fall asleep.

Anonymous 07/09/23 (Sun) 07:02:21 No.13161

>>13160
Good images and good to know.

Anonymous 07/09/23 (Sun) 07:02:27 No.13162

>>13160
hmm well, I guess on closer inspection the bodies aren't very loli-like. Might need better tagging for this, or to use artist LORAs...

Anonymous 07/09/23 (Sun) 07:29:13 No.13163

fingers, toes, hair, joints, patterns

Anonymous 07/09/23 (Sun) 07:34:31 No.13164

File:grid-0205.png (4.87 MB,1728x2304)

>>13163
Well....
(without any blatant booru tags it tends to look more western)

Anonymous 07/09/23 (Sun) 19:47:14 No.13166

File:xyz_grid-0010-1170495876.png (17.72 MB,6912x3058)

An example of steering the art direction by choice of prompt.
The left is danbooru,
second one is the "natural language" of stable diffusion
third and fourth are e621 tagging with slightly different tags

Basically the more booru tags you use the closer to the expected 2D booru image you'll get and likewise for the furry direction

Anonymous 07/09/23 (Sun) 21:20:30 No.13167

big penis, wrinkly penis, (((dick vein))), smelly ballsack, sounding

Anonymous 07/09/23 (Sun) 21:34:00 No.13168

>>13167
I was wondering how long it would be until someone did something disgusting...
Yeah, I think I'll post that one on /secret/ in the "wonders of AI" thread with a spoiler

Anonymous 07/09/23 (Sun) 21:36:29 No.13169

poor forgotten /megu/

Anonymous 07/09/23 (Sun) 21:52:16 No.13170

File:00221-2554430168.png (1.2 MB,864x1152)

>>13169
The existing /megu/ thread is more of a nsfw blog of mine so I'd need to make a new thread for fulfilling lewd AI requests. I'd want a better OP image than that.
Well, I guess I could make it later but so far people are not nearly as interested in prompting pornographic stuff as I am so ehhhh

Anonymous 07/09/23 (Sun) 22:01:51 No.13171

File:Fmw9DYpaEAI4WHP.jpg (197.3 KB,967x1598)

can you do CLIP interrogation or whatever it was called when you feed in an image and get a likely prompt out?

Anonymous 07/09/23 (Sun) 22:11:13 No.13172

File:grid-0056.png (4.73 MB,1728x2304)

>>13171
If you're telling me to take that and turn it into a prompt, sure:

1girl, paw shoes, paw gloves, solo, gloves, animal hands, animal ears, tail, on back, cat ears, lying, ahoge, loli, navel, blush, brown eyes, cat tail, twintails, open mouth, bangs, shoes, long hair, bed sheet, hair between eyes, animal ear fluff, wavy mouth, light brown hair, @_@, flat chest, sweat, spread legs, swimsuit, bikini, panties, underwear, arms up, cameltoe, bare shoulders, fake animal ears

But you then need to weed out the stuff that is wrong or generic. I.E she has "paw shoes" so "shoes" is redundant and would take away from the generation, and it's not a swimsuit but "cat lingerie".

1girl, paw shoes, paw gloves, solo, on back, cat ears, lying, ahoge, loli, navel, blush, brown eyes, cat tail, twintails, open mouth, bangs, long hair, bed sheet, hair between eyes, animal ear fluff, wavy mouth, light brown hair, @_@, flat chest, sweat, spread legs, cat lingerie, arms up, cameltoe, bare shoulders, fake animal ears

Here's the result with the version of the model that is good at character recognition and booru tags

Anonymous 07/09/23 (Sun) 22:12:32 No.13173

File:grid-0057.png (4.79 MB,1728x2304)

>>13172
And here's the 'regular' version. Although, hmm, it did a decent job with the pose. Just need to put more work into getting a loli result with more weights and added tags like 'chibi'

Anonymous 07/09/23 (Sun) 22:15:48 No.13174

>>13172
>>13173
Interesting results. The set of tags is pretty accurate but the regenerated images do not capture the art style or the loli nature (look how big Mahiron's head, hands and feet are as a proportion of his body, that's missing). The @_@ tag doesn't have any discernable effect on the images either.

Anonymous 07/09/23 (Sun) 22:17:29 No.13175

If you wouldn't mind doing one more CLIP interrogation, what does it say about this cat photo? >>>/qa/102927

Anonymous 07/09/23 (Sun) 22:25:11 No.13176

File:C-1688941510151.png (2.54 MB,920x1520)

>>13172
Huh, those look relatively normal compared to what I get when I try to generate that image with that prompt list. I added in "Megumin" and used controlnet on the original image and each generation turns out a whole lot more painted than those.

Anonymous 07/09/23 (Sun) 22:28:08 No.13177

File:grid-0058.png (5.03 MB,2304x1728)

>>13174
Well, I just plugged in what was there. I'd have to add tags like chibi or maybe big head to try and get the desired results. You expect too much from it if you expect it to replicate an art style with a list of tags. And yeah, good luck getting eye stuff to work. I think there are some LORAs for them, though.

>>13175
The SD CLIP says:
a cat sitting in the grass next to a fence and bushes with green leaves on it and a fence behind it, Felix Octavius Carr Darley, regal, a photocopy, naturalism

Anonymous 07/09/23 (Sun) 22:30:27 No.13178

File:grid-0060.png (4.75 MB,2304x1728)

>>13175
>>13177
And here is the result with the booru-based tags:
no humans, cat, animal focus, plant, whiskers

>>13176
Hmm, upload it to catbox so I can see the prompt settings

Anonymous 07/09/23 (Sun) 22:38:42 No.13179

>>13178
https://files.catbox.moe/fsjbmw.png
Here, it's what I tried for my prompts

Anonymous 07/09/23 (Sun) 22:42:23 No.13180

File:C-1688942542686.png (2.24 MB,920x1520)

>>13176
Oh yeah, and this is with the Kissumono mix

Anonymous 07/09/23 (Sun) 22:44:52 No.13181

>>13177
>regal
OMG SO TRUE
my parents actually said once that in that pic our cat looked like a king

Anonymous 07/09/23 (Sun) 22:55:54 No.13182

File:xyz_grid-0016-1651180902.png (11.42 MB,3240x3597)

>>13179
There's a few things:

The sampler is different.
Euler or Euler a for smooth appearance
DPM++ 2S a for, uhh.. "medium"
Karras stuff for more painty noise

The CFG is higher (higher CFG will introduce noise for some reason)

Also I started with a slightly larger image and upscale it less. I start at 576x768 and do 1.5 as my default. I haven't done much testing with this because my GPU can't really do images that are much larger.

And also of course denoise being lower will, uhh.. well, reduce the amount of noise that is being denoised when upscaled

Anonymous 07/16/23 (Sun) 18:42:23 No.13193

>>13172
could you use >>10596 as a basis to create a pout prompt and then mass produce pouts?

Anonymous 07/16/23 (Sun) 19:08:00 No.13194

>>13193
I don't know how good it would look, but yeah. Extremes like a super pout are more difficult to do than basic stuff, but I don't have much experience with the ControlNet stuff since I find it to be too much work and it eats up so much VRAM that I'd rather generate 20 random images than 2 "traced" ones

/ec/ - エッチ/Cute

New Reply