-
Couldn't load subscription status.
- Fork 435
Image preview #522
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: master
Are you sure you want to change the base?
Image preview #522
Conversation
bab1323 to
d235ded
Compare
65f636a to
a539cb5
Compare
|
+1 for this in main, great work! |
dfd9c4c to
efc6db8
Compare
f03d84c to
1bef24d
Compare
|
Before this is merged, should I rename the "proj" preview method to "latent2rgb" like it's called in ComfyUI? |
|
I think the naming doesn’t really matter. Once the potential license issue I mentioned in the review comments is resolved, this PR can be merged. |
@leejet , your comments aren't showing up for me. But I guess you could be referring to where the projection matrices come from? |
I'm not seeing them either, I was very confused. |
|
But I guess to avoid any licensing issues I could just train the projection matrices myself. It kinda feels like reinventing a perfectly working wheel though. |
|
It could be argued that the matrices are just the product of an algorithm (training, a simple least-squares approximation, etc), and thus not restricted by copyright. The problem is the "arguing" part 😕 Even if that argument is sound (and I personally believe it is), sidestepping the issue through an independent implementation would completely avoid that kind of headache. |
|
My original review comment:
|
|
As far as I know, algorithms themselves are not protected by copyright law — only the specific source code implementations are. |
|
SD3's projection was taken directly from the official inference code (MIT). For the others I'm pretty the data is distilled from the VAEs. I don't think it counts as "creative", but if we really want to be extra safe, we could re-train them. As far as I know, ComfyUI doesn't say where these weights come from. |
|
Ok I updated all latent to RGB projections except for sd3.x. Only SDXL projection feels like a small downgrade, everything else seems about on par or better than the previous version. I trained Wan's 21 and 2.2 proj on still images only, but it seems to handle motion fine (not perfect but good enough for now). |
| // change range | ||
| r = r * .5f + .5f; | ||
| g = g * .5f + .5f; | ||
| b = b * .5f + .5f; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I now see I could easily bake this into the proj matrices and bias (new_proj = proj*0.5, new_bias = bias*0.5+0.5) , not sure if it's worth putting time into this.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Chroma Radiance would still need this though, so probably not worth it.




Forked off #454
Would also probably replace #416
examples/cli/main.cpptostable-diffusion.cpplatent projection(as demonstrated in fast latent image preview #454),TAE, orVAEpreview.pngRelated to #354, if the user uses an image viewer that updates its render when the image file changes, then it's possible to see the progress in real time.