add support for scaled fp8 tensors #915

tsite · 2025-10-24T23:30:21Z

commit 1: add support for using bf16 as a native type - involves a refactor to the type parsing and conversion logic
commit 2: add support for scaled fp8 tensors

wbruna · 2025-10-25T00:58:02Z

remove dummy imatrix from ggml_quantize_chunk call - I don't think it needs it

Actually, it does: quality degrades much more without it. Try e.g. any SDXL model at q5_0.

tsite · 2025-10-25T03:01:47Z

remove dummy imatrix from ggml_quantize_chunk call - I don't think it needs it

Actually, it does: quality degrades much more without it. Try e.g. any SDXL model at q5_0.

interesting, good to know - I added back the dummy imatrix

leejet · 2025-10-25T03:11:42Z

It seems that you have removed some code related to model conversion, such as f64 → f32. This can cause issues when loading certain models. I suggest that if you don’t fully understand the reason behind some parts of the code, you shouldn’t modify them. Instead, you should only implement the parts that you do understand.

tsite · 2025-10-25T03:53:47Z

It seems that you have removed some code related to model conversion, such as f64 → f32. This can cause issues when loading certain models. I suggest that if you don’t fully understand the reason behind some parts of the code, you shouldn’t modify them. Instead, you should only implement the parts that you do understand.

I think the convert_tensor function should handle that - there's a case added that checks for the GGML_TYPE_F64 source type and converts it to GGML_TYPE_F32 when necessary. If you have a model in mind that you think this change may break, I can test it out to make sure that it works properly. Imo it no longer makes sense to use hacky sd types now that ggml has added support for f64, bf16, etc, but if you have other reasons for not using the native ggml types, I'm all ears.

leejet · 2025-10-25T04:33:02Z

Most of ggml's ops do not support f64/i64/bf16, which will cause issues. You can use this model for testing: https://civitai.com/models/7371/rev-animated. This model contains f64, and your changes will cause problems with it.

Green-Sky · 2025-10-25T10:23:44Z

f8_e5m2 now autoconvert to f32 for less precision loss

Why, f16 is e5m10. This should be lossless.

stduhpf · 2025-10-25T10:50:11Z

I think it's a better practice to avoid including too many unrelated changes like that in one PR.

This makes it harder to review, if some changes are bad, the whole PR can't be merged, and it also has a higher chance of breaking many other pending PRs.

tsite · 2025-10-25T22:03:43Z

f8_e5m2 now autoconvert to f32 for less precision loss

Why, f16 is e5m10. This should be lossless.

scaled f8_e5m2 tensors are multiplied by a float32 scaling factor

tsite · 2025-10-26T23:49:25Z

Most of ggml's ops do not support f64/i64/bf16, which will cause issues. You can use this model for testing: https://civitai.com/models/7371/rev-animated. This model contains f64, and your changes will cause problems with it.

Good to know, thanks! I took a closer look at the ggml library and you're right that the f64/i64 types are missing kernels. I think bf16 has full support though for all the ops, as most GPUs have hardware support for this type. I tested the rev-animated model with these changes and it's working both with quantization disabled and quantization set to bf16.

I think it's a better practice to avoid including too many unrelated changes like that in one PR.

This makes it harder to review, if some changes are bad, the whole PR can't be merged, and it also has a higher chance of breaking many other pending PRs.

I moved the wtype changes to a separate pr & split this one into two commits as the changes are stacked.

ggml supports bf16 tensor operations this involves a refactor to the type parsing and conversion logic note that i64 now converts to f32 instead of i32

tsite force-pushed the sd_main branch from 0dc2504 to 04945e5 Compare October 25, 2025 03:02

tsite marked this pull request as draft October 25, 2025 22:13

tsite force-pushed the sd_main branch from 04945e5 to 5619269 Compare October 26, 2025 23:34

tsite marked this pull request as ready for review October 26, 2025 23:51

tsite added 2 commits October 26, 2025 17:35

add support for bf16 as a native type

56d850d

ggml supports bf16 tensor operations this involves a refactor to the type parsing and conversion logic note that i64 now converts to f32 instead of i32

add support for scaled fp8 tensors

42ae14b

tsite force-pushed the sd_main branch from 5619269 to 42ae14b Compare October 27, 2025 00:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add support for scaled fp8 tensors #915

add support for scaled fp8 tensors #915

tsite commented Oct 24, 2025 •

edited

Loading

Uh oh!

wbruna commented Oct 25, 2025

Uh oh!

tsite commented Oct 25, 2025

Uh oh!

leejet commented Oct 25, 2025

Uh oh!

tsite commented Oct 25, 2025

Uh oh!

leejet commented Oct 25, 2025

Uh oh!

Green-Sky commented Oct 25, 2025 •

edited

Loading

Uh oh!

stduhpf commented Oct 25, 2025

Uh oh!

tsite commented Oct 25, 2025 •

edited

Loading

Uh oh!

tsite commented Oct 26, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

add support for scaled fp8 tensors #915

Are you sure you want to change the base?

add support for scaled fp8 tensors #915

Conversation

tsite commented Oct 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

wbruna commented Oct 25, 2025

Uh oh!

tsite commented Oct 25, 2025

Uh oh!

leejet commented Oct 25, 2025

Uh oh!

tsite commented Oct 25, 2025

Uh oh!

leejet commented Oct 25, 2025

Uh oh!

Green-Sky commented Oct 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

stduhpf commented Oct 25, 2025

Uh oh!

tsite commented Oct 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tsite commented Oct 26, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

tsite commented Oct 24, 2025 •

edited

Loading

Green-Sky commented Oct 25, 2025 •

edited

Loading

tsite commented Oct 25, 2025 •

edited

Loading