文件最后提交记录最后更新时间
!2717 moe router feature patch refactor Merge pull request !2717 from mhh001/master_0527 11 个月前
[pytorch][feature] Replace reset_position_ids with reset_attention_mask, and enable ring attention support when reset_attention_mask is active. Co-authored-by: mhh001<mahonghao1@huawei.com> # message auto-generated for no-merge-commit merge: !3506 merge master into master [pytorch][feature] Replace reset_position_ids with reset_attention_mask, and enable ring attention support when reset_attention_mask is active. Created-by: mhh111 Commit-by: mhh001 Merged-by: ascend-robot Description: pack/neat-pack场景ring cp支持 reset-attention-mask 对齐megatron 定长场景的支持 See merge request: Ascend/MindSpeed-LLM!35066 个月前
!3232 [pytorh][refactor]refactor tp-2d Merge pull request !3232 from jwhk/master 8 个月前
[pytorch][feature]PLM-1.8B pretrain/sft Co-authored-by: EVA1<jingsiyu1@huawei.com> # message auto-generated for no-merge-commit merge: !3637 merge master into master [pytorch][feature]PLM-1.8B pretrain/sft Created-by: EVA1 Commit-by: EVA1 Merged-by: ascend-robot Description: 1.PLM-1.8B 模型支持:数据集格式转换、权重转换、微调、预训练; 2.精度已对齐,sft相对误差小于千分之一。 See merge request: Ascend/MindSpeed-LLM!36376 个月前
!3178 [pytorch][refactor]add qlora feature patch Merge pull request !3178 from 丁子叉/master 9 个月前