-
Notifications
You must be signed in to change notification settings - Fork 539
Add tensorboard to display training and evaluation metrics #3163
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
This pull request was exported from Phabricator. Differential Revision: D77841795 |
This pull request was exported from Phabricator. Differential Revision: D77841795 |
… implementation to support DLRMv2 (pytorch#3163) Summary: Pull Request resolved: pytorch#3163 ### Major changes - Add tensorboard to the benchmark testbed, specifically in `benchmark_zch.py`. - Count the number of unique values received by each rank in each epoch by revising `benchmark_zch_utils.py`. - Revise `data/non_zch_remapper.py` to not depend on `batch.to_dict()` method, instead it fetch dataclass `batch`'s attribute with the built-in `vars()` method. - Revise DLRMv2 model EBC config initialization to make the table name identical with the feature name. Rollback Plan: Differential Revision: D77841795
c96a893
to
f120f53
Compare
This pull request was exported from Phabricator. Differential Revision: D77841795 |
f120f53
to
20ce4f0
Compare
… implementation to support DLRMv2 (pytorch#3163) Summary: Pull Request resolved: pytorch#3163 ### Major changes - Add tensorboard to the benchmark testbed, specifically in `benchmark_zch.py`. - Count the number of unique values received by each rank in each epoch by revising `benchmark_zch_utils.py`. - Revise `data/non_zch_remapper.py` to not depend on `batch.to_dict()` method, instead it fetch dataclass `batch`'s attribute with the built-in `vars()` method. - Revise DLRMv2 model EBC config initialization to make the table name identical with the feature name. - Revise DLRMv2 configuration yaml file to set table size for each feature. - Revise the default value for "num_embeddings" parameter in `arguments.py` to None. Rollback Plan: Differential Revision: D77841795
Summary: Pull Request resolved: pytorch#3127 Differential Revision: D77033290
This pull request was exported from Phabricator. Differential Revision: D77841795 |
20ce4f0
to
38b0865
Compare
… implementation to support DLRMv2 (pytorch#3163) Summary: Pull Request resolved: pytorch#3163 ### Major changes - Add tensorboard to the benchmark testbed, specifically in `benchmark_zch.py`. - Count the number of unique values received by each rank in each epoch by revising `benchmark_zch_utils.py`. - Revise `data/non_zch_remapper.py` to not depend on `batch.to_dict()` method, instead it fetch dataclass `batch`'s attribute with the built-in `vars()` method. - Revise DLRMv2 model EBC config initialization to make the table name identical with the feature name. - Revise DLRMv2 configuration yaml file to set table size for each feature. - Revise the default value for "num_embeddings" parameter in `arguments.py` to None. Rollback Plan: Differential Revision: D77841795
This pull request was exported from Phabricator. Differential Revision: D77841795 |
… implementation to support DLRMv2 (pytorch#3163) Summary: Pull Request resolved: pytorch#3163 ### Major changes - Add tensorboard to the benchmark testbed, specifically in `benchmark_zch.py`. - Count the number of unique values received by each rank in each epoch by revising `benchmark_zch_utils.py`. - Revise `data/non_zch_remapper.py` to not depend on `batch.to_dict()` method, instead it fetch dataclass `batch`'s attribute with the built-in `vars()` method. - Revise DLRMv2 model EBC config initialization to make the table name identical with the feature name. - Revise DLRMv2 configuration yaml file to set table size for each feature. - Revise the default value for "num_embeddings" parameter in `arguments.py` to None. Differential Revision: D77841795
38b0865
to
30f42a3
Compare
… implementation to support DLRMv2 (pytorch#3163) Summary: Pull Request resolved: pytorch#3163 ### Major changes - Add tensorboard to the benchmark testbed, specifically in `benchmark_zch.py`. - Count the number of unique values received by each rank in each epoch by revising `benchmark_zch_utils.py`. - Revise `data/non_zch_remapper.py` to not depend on `batch.to_dict()` method, instead it fetch dataclass `batch`'s attribute with the built-in `vars()` method. - Revise DLRMv2 model EBC config initialization to make the table name identical with the feature name. - Revise DLRMv2 configuration yaml file to set table size for each feature. - Revise the default value for "num_embeddings" parameter in `arguments.py` to None. Differential Revision: D77841795
This pull request was exported from Phabricator. Differential Revision: D77841795 |
30f42a3
to
3ba83dc
Compare
Differential Revision: D77841795