文件最后提交记录最后更新时间
fix: different scale for mla in master Co-authored-by: wuweiqiang24<wuweiqiang11@huawei.com> # message auto-generated for no-merge-commit merge: !2977 merge fixbug_mla_master into master fix: different scale for mla in master Created-by: wuweiqiang24 Commit-by: wuweiqiang24 Merged-by: ascend-robot Description: 与magatron中mla内的scale参数计算不一致,导致精度无法对齐 See merge request: Ascend/MindSpeed!29776 个月前
add dsa indexer triton impl Co-authored-by: hx_v5<huangxin115@huawei.com> # message auto-generated for no-merge-commit merge: !3049 merge triton_dsa_indexer into master add dsa indexer triton impl Created-by: hx_v5 Commit-by: hx_v5 Merged-by: ascend-robot Description: add dsa indexer triton impl See merge request: Ascend/MindSpeed!30496 个月前
fix:fix atten_mask_shape error when using transformer_engine Co-authored-by: Keilo_W<wangkaiyu11@h-partners.com> # message auto-generated for no-merge-commit merge: !3293 merge master into master fix:fix atten_mask_shape error when using transformer_engine Created-by: Keilo_W Commit-by: Keilo_W Merged-by: ascend-robot Description: An atten_mask_shape error will occur if --attention-mask-type causal is used together with --transformer-impl transformer_engine. To avoid this, you must also enable the --use-flash-attn option. See merge request: Ascend/MindSpeed!32932 个月前
fix(torch/cp): use sbnd format before all2all Co-authored-by: clc2025<chenlucong@huawei.com> # message auto-generated for no-merge-commit merge: !3282 merge fixbug_ulysses_tnd into master fix(torch/cp): use sbnd format before all2all Created-by: clc2025 Commit-by: clc2025 Merged-by: ascend-robot Description: fixbug for ulysses tnd See merge request: Ascend/MindSpeed!32823 个月前
!2117 refactor:generate mask & ailibi pse Merge pull request !2117 from 范文焘/master 1 年前