Propose to update & upgrade SkyReels-V2 #12167

tolgacangoz · 2025-08-17T19:30:23Z

Fixed the visualisation and enhanced the documentation: Previously, HTML rendering at the docs page looked very bad, see: https://huggingface.co/docs/diffusers/main/en/api/pipelines/skyreels_v2#a-visual-demonstration.
Reorganized the code by moving components into the attention dispatcher because the original repo uses "_native_cudnn" for self-attentions and "flash_varlen" or "_flash_varlen_3" for cross-attentions.
Added support for pipeline.transformer.compile_repeated_blocks(fullgraph=True) and looking forward to be merged torch.compile compatibility with varlen APIs #11970.
Took Wan's RoPE directly to be able to compile: Remember Use real-valued instead of complex tensors in Wan2.1 RoPE #11649.

Skywork/SkyReels-V2-DF-1.3B-540P
seed=0
`main`: ~14 min.	Wan's RoPE
main.mp4	Wan.s_RoPE.mp4
Wan's RoPE + `compile_repeated_blocks(fullgraph=True)`: ~12 min.	Wan's RoPE + `compile_repeated_blocks(fullgraph=True)` + `"_native_cudnn"` for `attn1` and `"flash"` for `attn2`, FA=2.8.3: ~8 min.
Wan.s_RoPE+regional.mp4	Wan.s_RoPE+regional+FA.mp4

Reproducer

!uv pip install git+https://github.com/tolgacangoz/diffusers.git@update-skyreels-v2

import torch, os
from diffusers import AutoModel, SkyReelsV2DiffusionForcingPipeline, UniPCMultistepScheduler
from diffusers.utils import export_to_video

# For faster loading into the GPU
os.environ["HF_ENABLE_PARALLEL_LOADING"] = "YES"

model_id = "Skywork/SkyReels-V2-DF-1.3B-540P-Diffusers"
vae = AutoModel.from_pretrained(model_id,
                                subfolder="vae",
                                torch_dtype=torch.float32,
                                device_map="cuda")
pipeline = SkyReelsV2DiffusionForcingPipeline.from_pretrained(
    model_id,
    vae=vae,
    torch_dtype=torch.bfloat16,
    device_map="cuda"
)
flow_shift = 8.0  # 8.0 for T2V, 5.0 for I2V
pipeline.scheduler = UniPCMultistepScheduler.from_config(pipeline.scheduler.config, flow_shift=flow_shift)

# Some acceleration helpers
# Be sure to install Flash Attention: https://github.com/Dao-AILab/flash-attention#installation-and-features
#for block in pipeline.transformer.blocks:
#    block.attn1.set_attention_backend("_native_cudnn")
#    block.attn2.set_attention_backend("flash")
#pipeline.transformer.compile_repeated_blocks(fullgraph=True)

prompt = "A cat and a dog baking a cake together in a kitchen. The cat is carefully measuring flour, while the dog is stirring the batter with a wooden spoon. The kitchen is cozy, with sunlight streaming through the window."

output = pipeline(
    prompt=prompt,
    num_inference_steps=30,
    height=544,  # 720 for 720P
    width=960,   # 1280 for 720P
    num_frames=97,
    base_num_frames=97,  # 121 for 720P
    ar_step=5,  # Controls asynchronous inference (0 for synchronous mode)
    causal_block_size=5,  # Number of frames in each block for asynchronous processing
    overlap_history=None,  # Number of frames to overlap for smooth transitions in long videos; 17 for long video generations
    addnoise_condition=20,  # Improves consistency in long video generation
    generator=torch.Generator("cpu").manual_seed(0)
).frames[0]
export_to_video(output, "T2V.mp4", fps=24, quality=8)

Environment

- 🤗 Diffusers version: 0.35.0 or this branch
- Platform: Linux-4.4.0-x86_64-with-glibc2.36
- Running on Google Colab?: No
- Python version: 3.12.6
- PyTorch version (GPU?): 2.8.0+cu126 (True)
- Flax version (CPU?/GPU?/TPU?): 0.11.0 (gpu)
- Jax version: 0.7.0
- JaxLib version: 0.7.0
- Huggingface_hub version: 0.34.3
- Transformers version: 4.55.0
- Accelerate version: 1.9.0
- PEFT version: not installed
- Bitsandbytes version: not installed
- Safetensors version: 0.6.1
- xFormers version: not installed
- Accelerator: NVIDIA A100-SXM4-40GB, 40960 MiB

@a-r-r-o-w @yiyixuxu @stevhliu

Wraps the visual demonstration section in a Markdown code block. This change corrects the rendering of ASCII diagrams and examples, improving the overall readability of the document.

Improves the readability of the `step_matrix` examples by replacing long sequences of repeated numbers with a more compact `value×count` notation. This change makes the underlying data patterns in the examples easier to understand at a glance.

…ne and sine frequencies

…rt and remove outdated notes

src/diffusers/models/transformers/transformer_skyreels_v2.py

…ine.to("cuda") for GPU allocation

HuggingFaceDocBuilderDev · 2025-08-21T17:12:17Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

stevhliu

Thanks for improving the docs!

docs/source/en/api/pipelines/skyreels_v2.md

…eels_v2.md

stevhliu

Thanks, the docs LGTM!

Let's wait for @a-r-r-o-w to chime in on the other changes before we merge :)

tolgacangoz · 2025-08-22T17:31:34Z

Alright, thanks for your review!

a-r-r-o-w

Nice @tolgacangoz! Thanks for propagating the attention backend changes to SkyReels

docs/source/en/api/pipelines/skyreels_v2.md

src/diffusers/models/transformers/transformer_skyreels_v2.py

…lasses

Removed comments about acceleration helpers and Flash Attention installation.

a-r-r-o-w

Thanks.

Just one last change similar to the following:

diffusers/src/diffusers/models/transformers/transformer_wan.py

Lines 159 to 166 in 0d1c5b0

    
           class WanAttnProcessor2_0: 
        
               def __new__(cls, *args, **kwargs): 
        
                   deprecation_message = ( 
        
                       "The WanAttnProcessor2_0 class is deprecated and will be removed in a future version. " 
        
                       "Please use WanAttnProcessor instead. " 
        
                   ) 
        
                   deprecate("WanAttnProcessor2_0", "1.0.0", deprecation_message, standard_warn=False) 
        
                   return WanAttnProcessor(*args, **kwargs)

Could you add a deprecation message by creating a dummy SkyReelsV2AttnProcessor2_0 class for BC?

tolgacangoz · 2025-08-26T06:35:36Z

Right, I missed realizing the previous code was in the released version.

tolgacangoz · 2025-08-26T07:30:48Z

Thanks for the reviews and merging!

stevhliu · 2025-08-26T14:53:21Z

@sayakpaul , looks like it was merged but feel free to leave any comments and we can iterate on it further!

tolgacangoz and others added 12 commits August 17, 2025 22:28

fix: update SkyReels-V2 documentation and moving into attn dispatcher

3428cc3

Merge branch 'main' into update-skyreels-v2

31ffa05

Refactors SkyReelsV2's attention implementation

42113fc

style

7e237ad

up

4d72277

Fixes formatting in SkyReels-V2 documentation

92dbf97

Wraps the visual demonstration section in a Markdown code block. This change corrects the rendering of ASCII diagrams and examples, improving the overall readability of the document.

Add _repeated_blocks attribute to SkyReelsV2Transformer3DModel

6856ee6

Refactor rotary embedding calculations in SkyReelsV2 to separate cosi…

a7e7b2f

…ne and sine frequencies

Enhance SkyReels-V2 documentation: update model loading for GPU suppo…

07ac70d

…rt and remove outdated notes

up

6e4cc72

up

dbe2454

tolgacangoz commented Aug 19, 2025

View reviewed changes

src/diffusers/models/transformers/transformer_skyreels_v2.py Show resolved Hide resolved

tolgacangoz and others added 3 commits August 19, 2025 17:51

Update model_id in SkyReels-V2 documentation

4743c7e

up

aaf2470

Merge branch 'main' into update-skyreels-v2

1f29453

tolgacangoz marked this pull request as ready for review August 19, 2025 15:57

Copilot AI mentioned this pull request Aug 20, 2025

Comprehensive Review and Analysis of SkyReels-V2 Updates (PR #12167) tolgacangoz/diffusers#8

Closed

Merge branch 'main' into update-skyreels-v2

5c6ce3c

tolgacangoz marked this pull request as draft August 20, 2025 15:20

tolgacangoz force-pushed the update-skyreels-v2 branch from 44d84e4 to 5c6ce3c Compare August 21, 2025 08:39

tolgacangoz changed the title ~~Propose to update SkyReels-V2~~ Propose to update & upgrade SkyReels-V2 Aug 21, 2025

Merge branch 'main' into update-skyreels-v2

1e26139

tolgacangoz marked this pull request as ready for review August 21, 2025 12:59

refactor: remove device_map parameter for model loading and add pipel…

c88cb16

…ine.to("cuda") for GPU allocation

tolgacangoz and others added 2 commits August 21, 2025 20:17

Merge branch 'main' into update-skyreels-v2

46f4f22

stevhliu reviewed Aug 21, 2025

View reviewed changes

docs: enhance parameter examples and formatting in skyreels_v2.md

e82d7b6

docs: update example formatting and add notes on LoRA support in skyr…

fe3af91

…eels_v2.md

tolgacangoz requested a review from stevhliu August 22, 2025 12:03

Merge branch 'main' into update-skyreels-v2

e2f328b

stevhliu approved these changes Aug 22, 2025

View reviewed changes

a-r-r-o-w approved these changes Aug 23, 2025

View reviewed changes

docs/source/en/api/pipelines/skyreels_v2.md Outdated Show resolved Hide resolved

src/diffusers/models/transformers/transformer_skyreels_v2.py Show resolved Hide resolved

src/diffusers/models/transformers/transformer_skyreels_v2.py Outdated Show resolved Hide resolved

tolgacangoz and others added 2 commits August 23, 2025 11:46

refactor: remove copied comments from transformer_wan in SkyReelsV2 c…

06d3a62

…lasses

Merge branch 'main' into update-skyreels-v2

ba1558c

tolgacangoz requested a review from a-r-r-o-w August 23, 2025 08:49

tolgacangoz added 3 commits August 24, 2025 07:34

Merge branch 'main' into update-skyreels-v2

14240d2

Clean up comments in skyreels_v2.md

df1f6b7

Removed comments about acceleration helpers and Flash Attention installation.

Merge branch 'main' into update-skyreels-v2

af24d9d

a-r-r-o-w approved these changes Aug 25, 2025

View reviewed changes

tolgacangoz and others added 2 commits August 26, 2025 09:28

Merge branch 'main' into update-skyreels-v2

77b41fa

Add deprecation warning for SkyReelsV2AttnProcessor2_0 class

2ed4d37

tolgacangoz requested a review from a-r-r-o-w August 26, 2025 06:35

a-r-r-o-w merged commit 5fcd5f5 into huggingface:main Aug 26, 2025
11 checks passed

tolgacangoz deleted the update-skyreels-v2 branch August 26, 2025 07:30

tolgacangoz mentioned this pull request Sep 27, 2025

Add Wan2.2-S2V: Audio-Driven Cinematic Video Generation #12258

Open

tolgacangoz mentioned this pull request Nov 7, 2025

dispatch_attention_fn silently ignores attn_mask for certain backends #12605

Open

	class WanAttnProcessor2_0:
	def __new__(cls, args, *kwargs):
	deprecation_message = (
	"The WanAttnProcessor2_0 class is deprecated and will be removed in a future version. "
	"Please use WanAttnProcessor instead. "
	)
	deprecate("WanAttnProcessor2_0", "1.0.0", deprecation_message, standard_warn=False)
	return WanAttnProcessor(args, *kwargs)

Propose to update & upgrade SkyReels-V2 #12167

Propose to update & upgrade SkyReels-V2 #12167

Uh oh!

Conversation

tolgacangoz commented Aug 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Aug 21, 2025

Uh oh!

stevhliu left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

stevhliu left a comment

Choose a reason for hiding this comment

Uh oh!

tolgacangoz commented Aug 22, 2025

Uh oh!

a-r-r-o-w left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

a-r-r-o-w left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

tolgacangoz commented Aug 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

tolgacangoz commented Aug 26, 2025

Uh oh!

stevhliu commented Aug 26, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

tolgacangoz commented Aug 17, 2025 •

edited

Loading

a-r-r-o-w left a comment •

edited

Loading

tolgacangoz commented Aug 26, 2025 •

edited

Loading