MindSpeed/mindspeed/run/gpt_dataset.patch-代码预览-MindSpeed:基于昇腾设备的大模型加速库项目 - AtomGit

Ii-robot!716 perf: gpt_dataset and initialize in megatron

afc5fe19创建于 2024年8月28日历史提交

diff --git a/megatron/core/datasets/gpt_dataset.py b/megatron/core/datasets/gpt_dataset.py
index a645f89..7de00b7 100644
--- a/megatron/core/datasets/gpt_dataset.py
+++ b/megatron/core/datasets/gpt_dataset.py
@@ -340,9 +340,11 @@ class GPTDataset(MegatronDataset):
         else:
             cache_hit = False
 
+        from megatron.training import get_args
+        args = get_args()
         if not path_to_cache or (
             not cache_hit
-            and (not torch.distributed.is_initialized() or torch.distributed.get_rank() == 0)
+            and (not torch.distributed.is_initialized() or torch.distributed.get_rank() % args.tensor_model_parallel_size == 0)
         ):
 
             log_single_rank(