文件最后提交记录最后更新时间
!2180 refactor:hccl buffer adaptive Merge pull request !2180 from 刘哲续/master 1 年前
fix hccl buffer errors for verl cases Co-authored-by: quancs001<quancs@qq.com> # message auto-generated for no-merge-commit merge: !3478 merge fix_hccl_buffer_for_verl into master fix hccl buffer errors for verl cases Created-by: quancs001 Commit-by: quancs001 Merged-by: ascend-robot Description: What this PR does / why we need it? When running RL exps with verl, several errors are raised, e.g.: 1. megatron.training.get_args raises an exception 2. the args for hccl_buffer with ";" could not be parsed by hydra, and the error LexerNoViableAltException is raised. This PR is proposed to solve the errors. Does this PR introduce any user-facing change? No. How was this patch tested? The code is tested and verified locally. See merge request: Ascend/MindSpeed!347815 天前
fix hccl buffer errors for verl cases Co-authored-by: quancs001<quancs@qq.com> # message auto-generated for no-merge-commit merge: !3478 merge fix_hccl_buffer_for_verl into master fix hccl buffer errors for verl cases Created-by: quancs001 Commit-by: quancs001 Merged-by: ascend-robot Description: What this PR does / why we need it? When running RL exps with verl, several errors are raised, e.g.: 1. megatron.training.get_args raises an exception 2. the args for hccl_buffer with ";" could not be parsed by hydra, and the error LexerNoViableAltException is raised. This PR is proposed to solve the errors. Does this PR introduce any user-facing change? No. How was this patch tested? The code is tested and verified locally. See merge request: Ascend/MindSpeed!347815 天前
fix hccl buffer errors for verl cases Co-authored-by: quancs001<quancs@qq.com> # message auto-generated for no-merge-commit merge: !3478 merge fix_hccl_buffer_for_verl into master fix hccl buffer errors for verl cases Created-by: quancs001 Commit-by: quancs001 Merged-by: ascend-robot Description: What this PR does / why we need it? When running RL exps with verl, several errors are raised, e.g.: 1. megatron.training.get_args raises an exception 2. the args for hccl_buffer with ";" could not be parsed by hydra, and the error LexerNoViableAltException is raised. This PR is proposed to solve the errors. Does this PR introduce any user-facing change? No. How was this patch tested? The code is tested and verified locally. See merge request: Ascend/MindSpeed!347815 天前