this post was submitted on 29 Jun 2023
7 points (100.0% liked)
StableDiffusion
3 readers
1 users here now
For discussions around Stable Diffusion, a text-to-image generative AI model. Share your generated pictures, discuss the various UI and extensions, share news about releases, bring tutorials and more!
founded 1 year ago
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
Awesome, I am still finding my way and am happy if I simply don't crash. I don't have a frame of reference to compare to a Nvidia card for this, but it does seem like we have a little more work in getting things smooth with the AMD cards. I can't say that my speed is terrible. Most renders finish in reasonable time. I am simply amazed that we can do this on consumer grade hardware.
Indeed. I'm in complete awe by this technology. It's an amazing pass-time that tickles the creative side. As for getting an idea of how different cards and system compare, you can check out https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html
I also have an 6800 XT, and the performance on that particular benchmark is around 9 it/s. Something like this looks to be a rough indication.
AMD Cards:
NVIDIA Cards:
Included my current numbers, any optimisation advice would be much appreciated 😉
I think those numbers are roughly the same I get. It varies a bit from time to time. I also wouldn't know how to improve it, to be honest.
I managed to find an extra iteration or two without sacrificing to much stability.
6.06 / 7.59 / 9.11
Running with Doggettx selected as the optimiser in the optimiser config inside automatic1111.
I installed the google perftools as suggested in this thread
"sudo apt install libgoogle-perftools-dev"
And then added the following memory management options as suggested in this thread
by exporting: "export PYTORCH_CUDA_ALLOC_CONF=garbage_collection_threshold:0.6,max_split_size_mb:128"
Made the following changes to my web-user.sh
Uncommented the command line options as follows:
export COMMANDLINE_ARGS="--medvram --upcast-sampling"
And added the following lines to the end of the file
export LD_PRELOAD=libtcmalloc.so
export PYTORCH_CUDA_ALLOC_CONF=garbage_collection_threshold:0.6,max_split_size_mb:128
Those are some great notes, thanks for sharing.
No problem, backup and be careful. I think AMD still has a lot that can be done in the rocm drivers themselves. There should be gains left to make. Nvidia is not helping with their pricing either. Which should see more users on AMD. Hopefully better support for us. I am pleasantly surprised by what the card can do. I got it for gaming at 1440p, where it was the best bang for buck. The AI stuff is a cool bonus.
I run the tests out of interest. I am leaving some performance on the table due to my launch options, but I need those to avoid to many out of memory and other errors.
Normal Test:
Extensive Test