Skip to content

Commit 1e4d28e

Browse files
authored
Merge pull request #374 from Modalities/tensor_parallelism
Tensor parallelism
2 parents f6f663b + dcd37e5 commit 1e4d28e

File tree

13 files changed

+11116
-7
lines changed

13 files changed

+11116
-7
lines changed

CHANGELOG_DEV.md

Lines changed: 15 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -171,3 +171,18 @@ Additionally, I added a script that verifies the consistency of the indexation a
171171
* A wrapper for collate functions to include tokens in the loss which appear between indicator tokens
172172
* A new parameter for the PackedMemMapDatasetContinuous to allow not to re-use the last target token
173173
* A tutorial how to apply instruction-tuning on a Huggingface Model
174+
175+
176+
## PR #359 Activation Checkpoint with FSDP2
177+
178+
This PR adds activation checkpointing (AC) support for FSDP2.
179+
There are now three AC variants:
180+
* Full AC (same as before, where entire complete modules get ACed, leading to the largest memory footprint reduction)
181+
* Selective Layer AC (only very nth layer or module is ACed)
182+
* Selective OP Ac (only certain OPs, typically low memory but compute intense, are checkpointed)
183+
184+
## PR #374 Tensor Parallelism Support
185+
186+
* adds support for Tensor Parallelism (including Sequence Parallelism).
187+
* adds a debugging toolkit to track the input and output tensors during a forward pass, gradients during the backward pass and weight tensors.
188+
Tensors can be either normal Tensors or DTensors.

0 commit comments

Comments
 (0)