Skip to content

Commit 9d26431

Browse files
Lintang Sutawikathomasw21Muennighoff
authored
Mlm adaptation (#287)
Co-authored-by: Lintang Sutawika <[email protected]> Co-authored-by: thomasw21 <[email protected]> Co-authored-by: Niklas Muennighoff <[email protected]>
1 parent 987663c commit 9d26431

File tree

5 files changed

+638
-7
lines changed

5 files changed

+638
-7
lines changed

megatron/arguments.py

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -925,6 +925,9 @@ def __call__(self, parser, args, values, option_string=None):
925925
'specific positions. This option tries to un-bias the loss by reweighting loss on specific '
926926
'positions based on how frequently we train on that position.'
927927
'This is mostly used for prefix_lm training')
928+
group.add_argument("--noise_density", type=float, default=None, help="Span corruption noise density")
929+
group.add_argument("--mean_noise_span_length", type=int, default=None, help="Span corruption mean noise span length")
930+
928931

929932
return parser
930933

megatron/data/gpt_dataset.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -35,7 +35,7 @@ def build_train_valid_test_datasets(data_prefix, data_impl, splits_string,
3535

3636
# Single dataset.
3737
if len(data_prefix) == 1:
38-
all_train_datasets, all_valid_datasets, all_test_datasets = _build_train_valid_test_datasets(data_prefix[0],
38+
all_train_datasets, all_valid_datasets, all_test_datasets = _build_train_valid_test_datasets(data_prefix[0],
3939
data_impl, splits_string,
4040
train_valid_test_num_samples,
4141
seq_length, seed, skip_warmup)

0 commit comments

Comments
 (0)