-
Notifications
You must be signed in to change notification settings - Fork 34
[GDPA] add tlx fwd for blackwell #318
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[GDPA] add tlx fwd for blackwell #318
Conversation
@manman-ren has imported this pull request. If you are a Meta employee, you can view this in D79301971. |
8f1e6fa
to
08b87f7
Compare
416e38a
to
c04d334
Compare
@manman-ren has imported this pull request. If you are a Meta employee, you can view this in D79301971. |
60e819b
to
826c8b7
Compare
826c8b7
to
b89ba44
Compare
b89ba44
to
a5b8f8a
Compare
7435a0d
to
6296c35
Compare
@manman-ren has imported this pull request. If you are a Meta employee, you can view this in D79301971. |
6296c35
to
43fd2fd
Compare
@manman-ren has imported this pull request. If you are a Meta employee, you can view this in D79301971. |
Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: fix Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: fix2 Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: fix Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: update Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: fix Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:
Summary: fix other issues Test Plan: Reviewers: Subscribers: Tasks: Tags:
Summary: we can't use tmem_load from a partition with numWarps of 1 Test Plan: Reviewers: Subscribers: Tasks: Tags:
Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: comments Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:
Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:
Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:
Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:
Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:
Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: adjust phase for producer acquire Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: update consumer_v_view Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: update due to changes in local_store/load Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:
Summary: 1 grid Test Plan: Reviewers: Subscribers: Tasks: Tags:
Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:
Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:
Summary: launch failure with 3 buffers seg fault with 2 buffers with device_print runs fine with 2 buffers without device_print Test Plan: Reviewers: Subscribers: Tasks: Tags:
Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:
Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:
Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:
43fd2fd
to
179c5e3
Compare
@manman-ren has imported this pull request. If you are a Meta employee, you can view this in D79301971. |
This only works with TLX compiler branch, so enabled is False.
Test with "python run.py --op gdpa --only tlx_gdpa_fwd"