[ home / bans / all ] [ qa / jp ] [ aut ] [ f / ec ] [ b / poll ] [ tv / bann ] [ toggle-new / tab ]

/qa/ - Questions and Answers

Questions and Answers about QA

New Reply

Whitelist Token
Password (For file deletion.)
Markup tags exist for bold, itallics, header, spoiler etc. as listed in " [options] > View Formatting "

[Return] [Bottom] [Catalog]

File:Serial Experiments Lain 04….jpg (227.66 KB,1512x1072)


As time goes on and we consider different avenues of potential growth or new features for kissu, the big question of "What's the best way to do this?" comes into play a fair bit. Recently I've been considering some ideas that make me think the best way to really expand on what kissu has right now is for us to set up a mini-server/pc to allow us to have these features on-site without the need to pay out the butt for expensive and really not cost-effective rental servers. With that said, I'm not necessarily the most technologically inclined of the admins so most of my ideas are wrapped up in idealism and fantasy. As such I think it'd be a good idea to bring the discussion to kissu itself to get more practical ideas and maybe expand the range of what'd make for 'cool features' to add to kissu.

At the moment, the two main ideas that I have for potential additions relate to AI and games. For games, I want something that can serve as a perpetual server for whatever it is kissu may be playing at the moment (if anything). Then for AI my idea is to host a SD webpage on a kissu subdomain, and allow people to generate images, set weights, and do whatever else you can do on the offline SD on the page. The difference between it and other variations of that idea would be that we'd allow (trusted) users to add their own Loras/Hypernetworks and whatnot so that it takes on the role of an open development project rather than being restricted and closed off from improvements.

With those two goals stated, what does kissu think the ideal setup for this would be? Preferably the budget should be in the 3-8k range (preferably on the cheap end) as we don't want to invest a fortune into something that may not even pan out to be something interesting and I don't have mega-millions.


Hmm, I honestly have no idea. All I'd want to say is that SD may need some sort of queue mechanism to stop it from crashing when two people use it at the same time, and you might want to limit certain parameters even prior to that.


A game server shouldn't cost more than an old computer to run.
For SD it depends on how fast you want the image to be generated. And sometimes you can get more computing power per buck by using multiple GPUs which has benefit with a queue system.


File:41912df800edfc951a1407b65c….jpg (140.04 KB,609x565)

A queue sounds nice, but I'm not really sure how you'd set one up...

Also an issue we've considered is who'd take care of the server. I probably have the best space to put it, but I don't know anything about managing a server and would probably prefer vermin be able to fix whatever issues may arise.


Stable Diffusion already works off of a queue, so that shouldn't be an issue.

>I don't know anything about managing a server and would probably prefer vermin be able to fix whatever issues may arise.
Depending on the level of complexity we're talking about, some type of out-of-band management would be ideal for this. Server motherboards and some workstations will have "IPMI" which allows managing a computer and viewing its status across a network. It also allows advanced features such as being able to turn on a computer and even access the BIOS without being physically in the same location. The more "enthusiast", but DIY solution would be a PiKVM: essentially the same thing as an IPMI motherboard, but based off of a raspberry pi that you then plug into the motherboard headers (for power on, power off and reset), a USB port, and a HDMI port.

Now, depending on how complex the setup we're talking about is, and how much performance (and cost) we would want, there are a few different routes to consider:
1. Server-level hardware - reaching into the Enterprise realm, likely making use of a lot of old server equipment.
2. Prosumer - High end consumer hardware.
3. Minimum viable product - few generations out of date to cost-optimize.

So... We should ask ourselves what our requirements are.
1. For game servers, are we just hosting a single server at a time and shutting it down once activity falls off, or are we keeping them online indefinitely?
2. For stable diffuion, what sort of performance do we want? Is 4090 performance necessary (2 seconds to generate a 512x512), or is 3060-level performance okay (~30 seconds)?

If the answer to the first question is, "just one server at a time," then we could go for a more budget-oriented build. Otherwise, we would likely want a CPU that has a high core count to support multiple servers at once without bottle-necking. The second question kind of answers itself.

The third and final question is related to storage:
1. How much data do we actually need?
2. How fast do we realistically need our storage to be?
3. How valuable is our data? Do we care if the server's hard drive dies?

With a bunch of models, I'm currently using around 100GB for Stable Diffusion. A moderately sized Minecraft would can grow to a few gigabytes.
For storage speed, remember that we are limited by the network connection. A 200MB/s 3.5" HDD is generally going to be just as fast a 5000MB/s PCIe 4.0 x4 NVME SSD if our network connection is 200MB/s (1 Gigabit per second is roughly 125MB/s).
If our data is very valuable and we cannot tolerate a drive dying, we likely want to consider some form of RAID. Depending on how much storage we need, and how much redundancy we want will determine what RAID level we should go for. Bear in mind, RAID level will also improve read speed on our drive. Otherwise, if we don't really care about drive deaths, could could use any old drive.

1. What sort of operating system would we plan on using?
2. Would this server stay as-is once it's built, or would we want future expand-ability?

For a more complex setup, it may be beneficial to run a hypervisor such as Proxmox and then run virtual machines for whatever we need to do. For a more basic solution, something like Windows 10 or 11 would likely work fine, however.
If we would want future expand-ability, we should consider what that might mean: If NAS functionality is desirable, then a case that can fit a lot of hard drives like a Fractal Define R5 (8 3.5" HDD capacity) might work well, but it likely wouldn't fit a 4090. Something like a Fractal Meshify 2 XL or Define 7 XL might be worth considering: (16 3.5" HDDs and very spacious).


File:c611b1a5154e04ebb49a99fd9b….jpg (475.19 KB,1000x950)

I think for perfomance/cost I want to be somewhere around prosumer and enterprise. Although, I'm not really sure how much better enterprise would be once we have the next generation of GPUs out which are set to outclass most everything on the market, so I'd probably stick with Prosumer for now to reduce how much we need to spend on stuff that may just get upgraded.

In terms of game servers I think we'll just do hosting one game at a time, not like we're going to be running 2 /qa/ games at once. For SD, I think we want a fairly fast setup to reduce headaches on the user-end (assuming multiple people will be using it at similar times) so I'd probably go with the 4090 option.

For storage I would probably want a fairly large HDD to hold whatever necessary files we may need, and then maybe a nice SSD for the main system and AI generation (Although maybe there's a way to store resulting images in the HDD without affecting the speed of generation...). The RAID consideration is probably a good idea, but I've never set up one myself so I don't really know what it does besides serve as a backup.

>A 200MB/s 3.5" HDD is generally going to be just as fast a 5000MB/s PCIe 4.0 x4 NVME SSD if our network connection is 200MB/s
Maybe that's true for transferring over the net but when it comes to actually writing stuff onto the SSD is that really the case? I've heard somewhere that a fast SSD can actually improve speeds somehow but this may just be gibberish I've heard online.

>1. What sort of operating system would we plan on using?
>2. Would this server stay as-is once it's built, or would we want future expand-ability?
Now this is the part where I completely give up any thoughts and opinions of my own and transfer all my decision-making onto vermin. I really have no idea here.


File:Screenshot 2023-04-02 2231….png (35.93 KB,482x355)

>Although maybe there's a way to store resulting images in the HDD without affecting the speed of generation...
In the case of stable diffusion, bear in mind that the model is loaded into VRAM and RAM so HDD and SSD speed doesn't matter. I run my SD bot off of a WD Gold HDD. The only practical difference it would make is in load times for changing models. I don't think an end user is going to notice the time difference it takes for saving a ~300KiB to ~2MiB image.

>The RAID consideration is probably a good idea, but I've never set up one myself so I don't really know what it does besides serve as a backup.
The general idea is that striped arrays increase read speeds by however many non-parity drives you have. RAID 0 of 2 disks is a 2x increase. RAID 5 is an increase of n-1. RAID 6 is an increase of n-2. To make something both fast and cheap, I think maybe a RAID 10 of some SATA SSDs would be okay; 4 drives total, 2 sets of RAID 0 mirrored. So, a fault tolerance of 1 disk per pair, up to 2 disks in a single pair, but IIRC if 1 disk in each pair fails at the same time then all of the data is lost. Synology has a rough calculator that gives an idea of space and fault tolerance: https://www.synology.com/en-us/support/RAID_calculator?hdds=2%20TB|2%20TB|2%20TB|2%20TB

>Maybe that's true for transferring over the net but when it comes to actually writing stuff onto the SSD is that really the case?
For random reads and writes (i.e lots of tiny files), an SSD is going to be considerably faster than an HDD, whereas sequential reads and writes (few big files) will be perfectly fine. The worst case scenario in terms of reads would be reading from a parked hard drive that needs to spin up, but eh. Shouldn't really be a big deal either way.

If reads are a genuine concern, you could use a program like Primocache. For reference, this is the read speed of my HDD while using Primocache with a RAM cache. Nothing is going to be faster than RAM, but RAM is expensive and you likely wouldn't want to dedicate a large amount of RAM to caching. In that case you could add an M.2 drive (preferably an Intel Optane drive, but anything would work) for read caching so that reads would be accelerated to the speed of the SSD. You could do write caching too, but this is a bit risky since data is liable to become corrupted in case of a power fault or if the system is shut down unsafely; a UPS would be an absolute requirement.

Over all would be my tentative recommendation:

The only thing you might want to change is drives to something else. I chose those Intel data center SSDs because they've got a crazy high write endurance of 36.5 Petabytes, but something more pedestrian like WD blue SSDs or even HDDs would be fine. Stock fans should okay since the 4090 has a huge heatsink and the 13500 comes with a heatsink that should be good enough.


LGA1700 would also have options for future upgrade depending on the life of the server: potentially, you could upgrade to a 13700K or even a 13900K, but realistically it wouldn't help all that much since nobody's going to be gaming on this PC. I've heard that Intel is going to release a Raptor lake refresh so that would mean there'd be Intel 14th gen CPUs to upgrade to as well.

This would be my recommendation if someone wins the lottery though :P


File:[Commie] Teekyuu - 27 [BD ….jpg (118.28 KB,820x720)

>Total: $3068.06
When you take into consideration the GPU is $1600 that's pretty crazy cheap for the rest of the parts that I'd consider pretty nice. I think I'd probably want to swap out an SSD for a nice WD black HDD, yeah, but aside from that I can't really figure out any other issues I'd have with this setup. Best part is that 3k price tag is probably cheap enough for vermin to not feel insane pressure taking care of it so that problem sorts itself out nicely. Can probably get started on that as soon as I have a good day...

Brb buying a powerball ticket


what is this monstrosity whyyy howwww


>I think I'd probably want to swap out an SSD for a nice WD black HDD
If endurance and drive lifetime is a concern, I would recommend WD Gold HDDs instead. From what I remember, the WD Blacks have an MTBF of 1 million hours versus 2.5 million hours on the WD Golds. The Golds are also a bit faster. The only downside is that they can be a bit noisy since they're enterprise grade and not optimized for quiet operation, like you would want for a hard drive at someone's desk. Granted, the Corsair 4000D only has room for 2 3.5" drives so a different case would need to be chosen if you wanted to replace all of the Intel data center SSDs with WD Golds or some other hard drives.

>Brb buying a powerball ticket
I hope you win!

64 core Threadripper for supporting a high number of virtual machines.
2TB of RAM, largely for ZFS cache, but would also help with high performance virtual machines.
15 14TB WD Gold 3.5" HDDs in a RAIDz3 of 11 drives, with 4 drives as hotspares. Usable capacity of 112TB. Used for NAS and bulk storage.
7 Intel Optane P1600X's, mostly for ZFS special vdevs: log, cache, and metadata. Metadata vdev in a RAID10 for safety (if the Metadata vdev dies, all data is lost because it tells where data is stored on each drive, so redundancy is extremely important).
2 Intel Optane P5800X's Maybe for storing virtual machines. I mostly just wanted to utilize the U.2 connectors on the motherboard and P5800X's are the top of the line of Intel's Optane line before they scrapped it.
4x Nvidia RTX A4000's for giving virtual machines dedicated GPUs. Also for giving a Plex instance a GPU for GPU accelerated transcoding. Chose A4000's over A6000's or RTX 6000 Ada's due to power draw concerns. Likewise, the A4000 is a single slot card meaning more can be used, unless the other cards were using water blocks, but that would require a water cooling loop and the HDD's take up most of the space where that would go.
Fans, case, PSU, and UPS are self-explanatory.
Drive brackets because the case only comes with a few.
Mellanox SFP+ 10Gb NIC for high speed networking. In theory, could do 40Gb/s or 100Gb/s with a different card for direct attach storage to another device, but I don't think it's possible to buy 40Gb or 100Gb internet, let alone 10 Gb.
LSI 9400-16i used as an HBA for connecting the 15 hard drives.
Rolly cart because it would be too heavy to carry. Napkin math, it would probably weigh around 80 lbs, I think.
Displayport EDID emulator because sometimes machines get angry if they don't have a "monitor" connected. This emulates a monitor.
Then, finally, an SFP+ RJ45 transceiver because the Mellanox NIC is SFP+ not RJ45.

Over all, the system would be incredibly versatile, capable of running many virtual machines with their own dedicated cores and even GPUs to the point where you could run Windows virtual machines that would be on par with a midrange RTX 3070 system with 64GB of RAM. If I had the money for it, I would love to use a system like this to host every thing Kissu: from multiple concurrent game servers to stable diffusion to potentially even making virtual machines accessible for people to work on their own projects because their own hardware isn't powerful enough, or to just let people mess around with. Most server tasks are fairly lightweight so a system like this could easily last a decade of continuous usage. It's just a pipe dream, of course, but it would be a nice one to come true.


How much space would this monster take up? The server room next to me is like 6x5m.


The Meshify 2 XL is a standard large PC case. 600mm x 240mm x 566mm (23.6" x 9.5" x 22.3"). The other case that people often use for Threadripper builds is the Fractal Design Define 7 XL, but that case has a solid front panel so would likely have worse thermals compared to the Meshify 2 XL.


Oh, small.....


Price is not abusrd
Way too many drives
Why not a dedicated network card?

Likewise, I would rather strip off the 1000$ of SDD and buy myself a home security system and cameras for my room that notify me on a motion detector


Damn, 2TB of RAM. Does that mean text language models could have far longer "memories"?


Almost certainly, yes.


Tard here, doesn't text generation rely on VRAM only so any memory would also need to be in VRAM or is there some way to separate the two?


>Way too many drives
Idea was for a RAID10 of SSDs, and RAID10 has a minimum requirement of 4 drives. As mentioned, if the data isn't that important, then you could easily scale down or change the drives to something else.

>Why not a dedicated network card?
2.5Gb should be good enough, and there likely wouldn't be any real world benefit to using an add-in card. Not sure about clearance with the 4090. You could get a 10GbE networking card potentially, but, again, you're limited by the speed of your internet so if you have a 10GbE NIC going to a 1Gb network connection, it's just a wasted expense.

Depends on the model we're talking about. The Facebook Llama models and their derivatives use RAM instead of VRAM. You are right, however, in regards to other models using VRAM primarily.


I can agree with those reasons. I think SSD is going for something more than the requirements.


a dedicated network card is excessive for a simple setup. However, i believe that there are network devices such as switches, modems, routers, cabels and other network devices that represent that theme and cost


I guess this would be my recommendation without SSDs:

Drops price by $200 to $2897.70. Had to switch cases to accommodate 4 HDDs, however.




But, again, if data integrity isn't that important you could shave off another $239.94 and use a single HDD alongside the boot SSD for a reduced cost of $2,657.76.



or you could just mount it on a piece of plywood and stick it to the wall


looks better


@team just wanted to generate alignment on this follow-up. We would like to know if it's possible to move forward at this time. Could somebody please take an action item to socialize this understanding before the next review?


File:firefox_TQvDWnm7uU.png (66.08 KB,714x480)

Soon enough it'll be a reality.


Can't wait to play PissuCraft.


File:1507422880810.jpg (112.8 KB,734x693)

If we ever go for an upgrade I'm putting in the requirement that the specs all work together right out of the box...


When you ask for price saving you must concede something...


File:1516943858641.jpg (36.48 KB,400x400)

I repent, I'll never try the cheap route again...


File:milky.webm (218.67 KB,1920x1080)

I got to the BIOS screen! Now everything else related to setting up ubuntu and stuff...




Just install Ubuntu with the UI.
You'll just waste too much time trying to do things though the CLI and the system resources are high enough that you can get by with one running


File:20230508_142535.jpg (528.63 KB,2154x3840)

What's the difference? Since I realized I have absolutely no idea how to use CLI, but vern does.


The difference is running a desktop vs not running one. Removing it is a performance booster.

I think adding one to a server is not the hardest thing.



this seems neat, can keep the performance boost of cli while letting my tard self lean on gui when i need to


I'm messing around with Stable Diffusion stuff models again, I might be able to make a better one than Kissu Megamix... maybe. I guess it would be an option at least. Although, I don't really know what I'm aiming for at the moment


File:xyz_grid-0466-1girl,_portr….png (18.63 MB,6912x2304)

Alright, the Kissu Megamix v2 is a go! (top is original, bottom is v2)
I wasn't planning on making another merge any time soon, but with this kissu server thing happening I thought I'd try to make an improvement. However, it's not a binary gain because it trades off some of the 2D flat coloring for more detail, and the faces might be a little uncanny sometimes since it's basically adding more real life influence, but in a subdued way. But, I think a lot of that can be altered with tags like "realistic" in the negative prompt.
There will certainly be times when people prefer v1, and I'm sure some will flatly prefer the old one in all situations. But, I wanted to see if I could create a version that's better than Kissu Megamix in some ways. After generating about 100 comparison charts I think it's good enough to be considered an upgrade.
This most recent merge was about 20 hours of tinkering. I say 20 hours, but like 80% of the time was me doing something else while it generates. But, there was no way for me to make it shorter.

My AI stuff is centered around fapping, so this image doesn't really do it justice. (I'll throw a version of this image on /megu/ with the 'penis' and 'nsfw' tags added)
Anyway, I'll probably do a little bit more tinkering tomorrow to see if there's anything I'm missing, but once the Kissu AI image generation stuff is set up people will have access to this super merge of mine that's hundreds of hours in the making and in my humble opinion the best model in the world for what it does, as these prompts here in the image are all very plain and don't have any LORAs or elaborate scene modifiers. The first set is literally just "1girl, portrait, face".


File:1666214240909488.jpg (51.1 KB,600x450)

So maybe some people already realized it by now, but the Kissu-PC has so far been a resounding success! It took a bit of setup but now that's out of the way and we have both a MC-server and Stable-Diffusion server up and running at the same time. Earlier in the day we ran some tests to see if there'd be any conflicts between the two, but even when running at max usage on the SD server there were no hiccups or anything reported on MC. This means that the two can run perfectly simultaneously!

Although one thing that I want people to keep in mind is that the SD server at the moment probably won't be available to most of kissu for a while until I (Vermin) can get a permission system set up for it and clean up a bit of the UI. Maybe sooner than the full release we'll allow the ability to send in prompts and generate an image with preset settings.


File:00074-2370885358.png (2.3 MB,1616x1136)

Yeah, I'm doing some testing on it as we speak and it's going pretty well. Man, 4090s are so fast and I can make images far larger than I could with my own local setup.
Both of my Kissu Megamixes are on it and we can add other models on request, but my experience with them is limited since I prefer my own. We're going to load this thing up with hundreds of LORAs, including the ones I've made myself including the fabled Kuon LORA.


File:xyz_grid-0015-3620237341.png (11.73 MB,4320x2304)

We installed extensions and did other set up today for the Kissu SD image generator. I'll have to create a write-up on how to use the various things, but they're all optional. Some comparison charts were made to select the 'default' sampler and I think I like DPM++ SDE the most at the moment, but people will be able to change it.
We need to devise a way for people to 'preview' the effects of various LORAs, as the methods inside SD itself aren't that great, especially with one of the buttons to open the "Additional Networks" tab being broken in the most recent SD versions. Maybe a folder somewhere that people can look at with preview images? Or a thread here? Labeled images? It's a bit of a conundrum. Personally for my local install I have [artist name][traits] in a folder, so it looks like "Aoki Ume Cute Colorful" so I can Ctrl+F "cute" and see the results limited to LORAs that are cute.


So how are you going about doing the trusted users thing? Last time I saw this done, albeit with no real safeguards, it didn't take long for trolls to ruin it for everyone and the guy running it took it down out of fear of getting in trouble for hosting a cunny machine.


File:[SubsPlease] Jijou wo Shir….jpg (173.67 KB,1920x1080)

It's possible to set it up so that if a prompt contains a barred word it will only generate blank images. LORAs can be pre-tested to make sure no one is trying to sneak anything in there. I'm not sure I like the idea of people uploading checkpoint models themselves since that's a huge can of worms, so that seems like something to restrict if it's not a public model.


please don't ban cunny


I suppose posting the LORA on an auditing thread should be enough to get it added by a staff member. It's not like it's gonna do something bad when you give it a specific undisclosed prompt, right? Should be pretty straightforward.


you just block it on the UI, no?


File:00450-2998944651.png (3.55 MB,2048x1536)

LORAs can be trained to have a trigger word that can be an obscure token only the maker knows, and the same trigger word is needed for generation to observe its effect.
There is no way to know what's hidden in them if you don't know the exact secret word.


i'm a cunny plowing machine


File:explorer_EJqKfR2S4L.png (15.26 KB,363x481)

I've spent the past 5 days obsessively combing over merges and generating tons and tons of comparison charts and making notes and doing a bunch of other stuff. The great news is that my ISP and its monopoly forced me to buy unlimited bandwidth at an extra $50 a month or else I would spend more by regularly bypassing my lord's data transfer allotment to my peasant residence, so I just download everything now.
I've created a lot of merges which I test with other merges and blahblahblah. Pic related is just the checkpoint folder.

Long story short, I should be able to make some good new kissu megamixes and I think I'll start branching them out into categories


post charts


Uhh... they're all extremely pornographic...


in /megu/


File:9574fab205ba55551ac3028f1….jpeg (Spoiler Image,878.51 KB,858x1200)

Here's a chart


File:explorer_udcgAHhxvu.png (123.57 KB,242x281)

It's a new record and it took 211 minutes to generate. Sadly I forgot to have "show label' thing checked.
I'll try cropping it so I'm able to actually upload it on catbox, but after cutting it in half photoshop still hasn't finished compressing it after 15 minutes. Yeah, I'll have to do some more cropping.
This chart won't be very informative since I do merges with the 'live merge' extension which unfortunately can't be used with charts.

Anyway, my current plan is probably to have 2-3 Kissu Merges with different specializations, and we can/will have other checkpoint models just downloaded online.
-NSFW Focus
-Maybe, MAYBE the "2.5D style" that's a hybrid between 2D and real life. They look like video game characters, like Dead or Alive


You could always try uploading to MEGA or something.



File:xyz_grid-0026-.png (39.76 MB,4320x6912)

This is a SFW example comparison image thingie that I can post here on /qa/. If you can't tell, the bottom three rows are set up so that it's an increasing amount of real life model imagery in the merge.
I've been doing lots of tests and although I made it as an ingredient for the 2.5D mix attempt, I think my REALMIX is really good at what it does. It combines 3 different RL checkpoints into specific ratios and it counters the weaknesses of each one in a pretty good way, at least in my view.
But, uhh.. I feel like it's leaving the scope of Kissu's shtick so I don't think we can include it, among other reasons. But, you can see how using the RL models influences the merges I make.

I think my current goal is to have the flat colors of a furry mix, the relatively decent hands of an RL model, and then to cover all of it with a nice layer of Japanese-style 2D. That is something I could finally crown as Kissumix 3 (or maybe 2 since I don't like 2 that much)
It's pretty hard to get them all working at once, though, especially if you want characters and booru tags to be recognized.


File:xyz_grid-0030-.png (11.17 MB,4320x2304)

For the hell of it I'll show you what the furry models look like with these prompts. It's weird to think that these have anything to offer, but they're actually really good at certain things, the most obvious of which is the nice flat coloring that lacks the weird noise of most models.
The top one is actually in kissu megamix (yiffy-e18)


Hmmm, the bottom one seems to be able to do 2D pretty dang well for some reason. Wonder why that is.


File:grid-0248.png (2.18 MB,1152x1536)

I found the model by luck when doing a search on 4chan because it's only updated on some discord, so I have no idea on what the model actually is apart from its name (easter). It really sucks that so much stuff is locked away behind random discords instead of shared freely on the internet, but that's a rant I've already done a few times on kissu.
Anyway, It's definitely merged with some booru data somewhere since it recognized Remi and she's definitely not popular enough on furry sites. The top row is "pure" and that's why it looks so abstract- it uses e621 tags because all of its data is from e621. I never actually learned how to use it since kissu megamix thankfully didn't require e621 tags despite having furry data in it.


Make sure to have backups. Lain site got filled with soyjaks today.


If its lainchan lol




should have had cloudflare


File:01901-688197011.png (1.28 MB,864x1152)

I've spent the past two days doing a whole lot of checkpoint merging and testing for Kissu's eventual AI thingie.
However, due to the very NSFW nature of the testing my blog progress is on >>>/megu/ as usual.


File:01916-736400559.png (1.31 MB,864x1152)

Alright, I'm OFFICIALLY declaring this the greatest merge I've ever made in 8 months of doing stable diffusion merges.
We need to get this up and running sometime this year


This looks insanely good. Seems like the model testing is going swimmingly from the sound of it?


File:01995-1468094864.png (1.18 MB,864x1152)

Yeah, I took a break for a bit to do other stuff since merging isn't terribly fun (90% of the time you're doing other stuff waiting for it to finish, but you need to check it every couple minutes). I feel like this is the pinnacle that I can reach at the moment in any reasonable timeframe. The return on investment for doing more is rapidly fading away again.
The unfortunate thing is I still haven't found many patterns when it comes to merging when it comes to extrapolating it for future merges. I can't say I learned anything from the past two days when it comes to SD, just what works with this specific combination of checkpoints.


File:00511-3274723526.png (819.87 KB,768x864)

I have to agree, pretty much everything I've generated has come out looking at least decent, can't wait to release this to the rest of kissu.


Looking at these and the ones on /megu/ I gotta say, it's beautiful, absolutely beautiful. Honestly way beyond what I expected.


File:01211-3102990878.png (1.12 MB,864x1152)

Kissumono Mix possibly, maybe, well... no, probably not done. The colors are a bit weird and the character recognition isn't there. Not much of a Kissu Kemono mix if people can't prompt furry Patchy.
Not sure what to do on this one, guess I could merge it with something else...


File:00935-3585090753.png (1.32 MB,864x1152)

Not everyone has /ec/ visible, but I made a thread for testing out prompts can people can submit to me with the primary model we have loaded: >>>/ec/13144
This is just a temporary thing for testing purposes


File:1690689091967.png (1011.88 KB,712x960)

Here's an explanation for a good effect I created:
.sdC (eldritch_abomination:1.5) extremely_long_hair thai_text eclipse cityscape long_skirt rain monster hair_ornament (squid:1.0) (pixel_art:1.1)

(eldritch_abomination:1.5) - Pretty rare tag, so I need to emphasize it to increase it's features.
extremely_long_hair / long_skirt / hair_ornament - I want to empasize something long coming off of the head or torso that looks feminine and emphasizes shape
thai_text - It's something random I put in, but I figure it draws the model closer to artwork created by Thai artists. I quite like their version of anime art style
eclipse / cityscape / rain - Just a style. Gives a good background effect
monster / squid - boosters to a more demonic appearance
pixel_art - a stylistic modifier, increasing it modifies the style


File:f9t1iq.png (731.93 KB,712x960)

Adjusting the pixel art slider gave me a more feminine look too


File:tg3yhz.png (757.64 KB,720x960)

>Pretty rare tag
These models are all built on top of Stable Diffusion 1.5 as a base so it's not just pulling from boorus so stuff can work that you might not think will work. I really don't know how it functions, but all of this stuff is possible because SD had its own "definitions" of things that the booru scrapes used as a guideline. The best example of this is food which is surprisingly accurate and realistic. Typing "woman" instead of "1girl" will push it closer to reality, but not by much since my newer merges don't have much RL data in them (which makes hands a lot less reliable)
"Thai text" is most likely a placebo, but I'd have to test it on my own local model since you need to be able to repeat the seed to test influences.
Hmm... could try to add some real life merges in to improve hands, but ehhh


I mean, the set of all images that are given eldritch tags is far less than others, so when you ask for this tag it's going to start shoving into it more humanoid characteristics.
So you have to override the humanoid preferences with monster ones by giving the weaker traits a higher priority


File:rdd7kz.png (665.25 KB,712x960)

The thai_text thing is kind of interesting. It makes the pixelation have less weight, but removing it also removes any of the human like appearances of the generations and puts it definitely into monster territory


File:[SubsPlease] Eiyuu Kyoushi….jpg (359.18 KB,1920x1080)

I've been looking into locally hosted text generation and it's a bit limited. Retarded interactions aren't the problem, that stuff is actually fun, but the way text models are loaded is essentially "all or nothing". It sits in VRAM all day every day unless you manually unload it. Stable Diffusion reserves about 3GB once you boot it up and will then increase as it actively generates, but these models would be 4-12GB at all times. The kissu server's 4090 does have 24gb of VRAM, but having half of it reserved for text that people will use once in a while seems really wasteful...
I guess I can look into the really small models and see if they're coherent enough for it to not be entirely gibberish.

[Return] [Top] [Catalog] [Post a Reply]
Delete Post [ ]

[ home / bans / all ] [ qa / jp ] [ aut ] [ f / ec ] [ b / poll ] [ tv / bann ] [ toggle-new / tab ]