Skip to content

load model to device when not use tp and fsdp #111

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
wants to merge 1 commit into from
Closed

Conversation

qzzz95
Copy link
Contributor

@qzzz95 qzzz95 commented Jul 4, 2025

在某些特定的机器上,spawn pickle 内存数据时可能内存不对齐,会导致 AVX2 指令集 copy 数据时报错 bus error。
将 tensor 转到 gpu 显存上,避免走 AVX2 数据拷贝,且能节省一次内存间的拷贝

@akaitsuki-ii akaitsuki-ii deleted the fix_bus_error branch August 1, 2025 07:00
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants