this post was submitted on 30 Dec 2024
45 points (85.7% liked)

Stable Diffusion

4389 readers
6 users here now

Discuss matters related to our favourite AI Art generation technology

Also see

Other communities

founded 2 years ago
MODERATORS
 

Abstract

We present 1.58-bit FLUX, the first successful approach to quantizing the state-of-the-art text-to-image generation model, FLUX.1-dev, using 1.58-bit weights (i.e., values in {-1, 0, +1}) while maintaining comparable performance for generating 1024 x 1024 images. Notably, our quantization method operates without access to image data, relying solely on self-supervision from the FLUX.1-dev model. Additionally, we develop a custom kernel optimized for 1.58-bit operations, achieving a 7.7x reduction in model storage, a 5.1x reduction in inference memory, and improved inference latency. Extensive evaluations on the GenEval and T2I Compbench benchmarks demonstrate the effectiveness of 1.58-bit FLUX in maintaining generation quality while significantly enhancing computational efficiency.

Paper: https://arxiv.org/abs/2412.18653

Code: https://github.com/Chenglin-Yang/1.58bit.flux (coming soon)

you are viewing a single comment's thread
view the rest of the comments
[–] kwilson@lemmy.world 3 points 1 month ago (4 children)

I'm not an expert on AI, but I'm surprised the comparison photos are so similiar. I was expecting the models to come with completely different images each time. The sky in the dragon pics specially looks like copied and pasted.

[–] Even_Adder@lemmy.dbzer0.com 2 points 1 month ago (3 children)

The hope is that they're similar. The pictures on the right are from a smaller version of the model that the pictures on the left are from. This shows that even though they shrank the model, it still understands the same prompt.

[–] kwilson@lemmy.world 1 points 1 month ago

yeah I get that, I'm just surprised that both times the image is so similar. both times the dragon looks right, both time the sky looks mostly the same, stuff that isn't part of the prompt you know.

The same prompt can sometimes give you completely different pictures, that still comply with the prompt, on the same model

load more comments (2 replies)
load more comments (2 replies)