文件最后提交记录最后更新时间
!438 nanopipe流水线并行,降低流水线bubble Merge pull request !438 from liujianxing/dev_nanopipe_0618 1 年前
!2593 Readme update! Merge pull request !2593 from yangjie/master 10 个月前
!2593 Readme update! Merge pull request !2593 from yangjie/master 10 个月前
低精度优化器 增加reademe Co-authored-by: w30064656<wangzhuangzhuang8@h-partners.com> # message auto-generated for no-merge-commit merge: !3067 merge master into master 低精度优化器 增加reademe Created-by: w30064656 Commit-by: w30064656 Merged-by: ascend-robot Description: 增加reademe 修复bug See merge request: Ascend/MindSpeed!30675 个月前
!1804 自适应内存优化 Merge pull request !1804 from huangzhenyu/dev_adaptive_mem_opt_630 1 年前
!30 自适应选择重计算特性开发 Merge pull request !30 from zengyihang/master 2 年前
!30 自适应选择重计算特性开发 Merge pull request !30 from zengyihang/master 2 年前
AI Qos Feature Co-authored-by: EX_mitsuX<yangjie409@h-partners.com> Co-authored-by: ascend-robot<zhongyuanke@huawei.com> Co-authored-by: Klayyy<wanglei886@h-partners.com> Co-authored-by: LinMingZhe<linmingzhe3@huawei.com> Co-authored-by: clc2025<chenlucong@huawei.com> Co-authored-by: JialiZheng1<jializheng@huawei.com> Co-authored-by: MissingPompeii<guohao120@huawei.com> Co-authored-by: wangjinyi6<wangjinyi6@huawei.com> Co-authored-by: Klayyy<Klayyy@noreply.gitcode.com> Co-authored-by: w30064656<wangzhuangzhuang8@h-partners.com> Co-authored-by: xubin787<mark19980312@126.com> Co-authored-by: libaokui<libaokui@huawei.com> Co-authored-by: LinShua<707894133@qq.com> Co-authored-by: guihaowen666<guihaowen@huawei.com> Co-authored-by: ChenDonYY<caichendong2@huawei.com> Co-authored-by: wuweiqiang24<wuweiqiang11@huawei.com> Co-authored-by: Muuyo<koimuu@163.com> Co-authored-by: ybwang19<1605891897@qq.com> Co-authored-by: lmztju<limingzhao3@h-partners.com> Co-authored-by: xuyujun<xuyujun5@hisilicon.com> Co-authored-by: Jia_Austin<dengjia6@huawei.com> # message auto-generated for no-merge-commit merge: !3161 merge qos into master feat:AI QoS Feature Created-by: Klayyy Commit-by: Klayyy;Jia_Austin;libaokui;EX_mitsuX;wuweiqiang24;Muuyo;xuyujun;lmztju;wangjinyi6;ascend-robot;ybwang19;xubin787;ChenDonYY;guihaowen666;clc2025;LinShua;w30064656;MissingPompeii;JialiZheng1;LinMingZhe Merged-by: ascend-robot Description: AI QoS Feature See merge request: Ascend/MindSpeed!31614 个月前
AI Qos Feature Co-authored-by: EX_mitsuX<yangjie409@h-partners.com> Co-authored-by: ascend-robot<zhongyuanke@huawei.com> Co-authored-by: Klayyy<wanglei886@h-partners.com> Co-authored-by: LinMingZhe<linmingzhe3@huawei.com> Co-authored-by: clc2025<chenlucong@huawei.com> Co-authored-by: JialiZheng1<jializheng@huawei.com> Co-authored-by: MissingPompeii<guohao120@huawei.com> Co-authored-by: wangjinyi6<wangjinyi6@huawei.com> Co-authored-by: Klayyy<Klayyy@noreply.gitcode.com> Co-authored-by: w30064656<wangzhuangzhuang8@h-partners.com> Co-authored-by: xubin787<mark19980312@126.com> Co-authored-by: libaokui<libaokui@huawei.com> Co-authored-by: LinShua<707894133@qq.com> Co-authored-by: guihaowen666<guihaowen@huawei.com> Co-authored-by: ChenDonYY<caichendong2@huawei.com> Co-authored-by: wuweiqiang24<wuweiqiang11@huawei.com> Co-authored-by: Muuyo<koimuu@163.com> Co-authored-by: ybwang19<1605891897@qq.com> Co-authored-by: lmztju<limingzhao3@h-partners.com> Co-authored-by: xuyujun<xuyujun5@hisilicon.com> Co-authored-by: Jia_Austin<dengjia6@huawei.com> # message auto-generated for no-merge-commit merge: !3161 merge qos into master feat:AI QoS Feature Created-by: Klayyy Commit-by: Klayyy;Jia_Austin;libaokui;EX_mitsuX;wuweiqiang24;Muuyo;xuyujun;lmztju;wangjinyi6;ascend-robot;ybwang19;xubin787;ChenDonYY;guihaowen666;clc2025;LinShua;w30064656;MissingPompeii;JialiZheng1;LinMingZhe Merged-by: ascend-robot Description: AI QoS Feature See merge request: Ascend/MindSpeed!31614 个月前
AI Qos Feature Co-authored-by: EX_mitsuX<yangjie409@h-partners.com> Co-authored-by: ascend-robot<zhongyuanke@huawei.com> Co-authored-by: Klayyy<wanglei886@h-partners.com> Co-authored-by: LinMingZhe<linmingzhe3@huawei.com> Co-authored-by: clc2025<chenlucong@huawei.com> Co-authored-by: JialiZheng1<jializheng@huawei.com> Co-authored-by: MissingPompeii<guohao120@huawei.com> Co-authored-by: wangjinyi6<wangjinyi6@huawei.com> Co-authored-by: Klayyy<Klayyy@noreply.gitcode.com> Co-authored-by: w30064656<wangzhuangzhuang8@h-partners.com> Co-authored-by: xubin787<mark19980312@126.com> Co-authored-by: libaokui<libaokui@huawei.com> Co-authored-by: LinShua<707894133@qq.com> Co-authored-by: guihaowen666<guihaowen@huawei.com> Co-authored-by: ChenDonYY<caichendong2@huawei.com> Co-authored-by: wuweiqiang24<wuweiqiang11@huawei.com> Co-authored-by: Muuyo<koimuu@163.com> Co-authored-by: ybwang19<1605891897@qq.com> Co-authored-by: lmztju<limingzhao3@h-partners.com> Co-authored-by: xuyujun<xuyujun5@hisilicon.com> Co-authored-by: Jia_Austin<dengjia6@huawei.com> # message auto-generated for no-merge-commit merge: !3161 merge qos into master feat:AI QoS Feature Created-by: Klayyy Commit-by: Klayyy;Jia_Austin;libaokui;EX_mitsuX;wuweiqiang24;Muuyo;xuyujun;lmztju;wangjinyi6;ascend-robot;ybwang19;xubin787;ChenDonYY;guihaowen666;clc2025;LinShua;w30064656;MissingPompeii;JialiZheng1;LinMingZhe Merged-by: ascend-robot Description: AI QoS Feature See merge request: Ascend/MindSpeed!31614 个月前
!58 TP重计算通信优化算法提交 Merge pull request !58 from Kingsleyandher/master 2 年前
!58 TP重计算通信优化算法提交 Merge pull request !58 from Kingsleyandher/master 2 年前
!242 【readme】增加异步DDP、Alibi、swiglu融合算子 readme Merge pull request !242 from 赵一帆/zyf_1 2 年前
!632 Ampipe流水通信隐藏特性新增 Merge pull request !632 from 张树仁/ampipe 1 年前
Add offline pad_data Co-authored-by: wuweiqiang24<wuweiqiang11@huawei.com> # message auto-generated for no-merge-commit merge: !2938 merge revise_preprocess_data into master Add offline pad_data Created-by: wuweiqiang24 Commit-by: wuweiqiang24 Merged-by: ascend-robot Description: 增加离线预处理pack数据集功能,可提前将数据padding到2\*CP倍,在线使用CP功能时可节约padding部分耗时 * 精度与非离线padding版本存在一定差异 ![2.png](https://raw.gitcode.com/user-images/assets/7404741/07e65a36-a1cd-4f79-ab62-832febdfa052/2.png '2.png') * Llama2-7b,单机16k,GBS=8场景下,性能提升4.8% ![性能提升.png](https://raw.gitcode.com/user-images/assets/7404741/e38f0a9b-e0b2-498a-a42b-c8b59ec05e87/性能提升.png '性能提升.png') See merge request: Ascend/MindSpeed!29386 个月前
!242 【readme】增加异步DDP、Alibi、swiglu融合算子 readme Merge pull request !242 from 赵一帆/zyf_1 2 年前
!263 【增加readme】关于flash attention、权重更新通信隐藏 Merge pull request !263 from Wang Xiaochao/master 2 年前
!263 【增加readme】关于flash attention、权重更新通信隐藏 Merge pull request !263 from Wang Xiaochao/master 2 年前
!263 【增加readme】关于flash attention、权重更新通信隐藏 Merge pull request !263 from Wang Xiaochao/master 2 年前
!583 提交Flex Parallel:多维并行配置自动寻优算法的代码实现 Merge pull request !583 from robert/master 1 年前
!583 提交Flex Parallel:多维并行配置自动寻优算法的代码实现 Merge pull request !583 from robert/master 1 年前
!516 提交PP自动并行算法的代码实现 Merge pull request !516 from gitee-yy/master 1 年前
feat: 内存压缩特性迭代升级 Co-authored-by: NingGuangyou<ningguangyou@h-partners.com> # message auto-generated for no-merge-commit merge: !3173 merge master into master feat: 内存压缩特性迭代升级 Created-by: NingGuangyou Commit-by: NingGuangyou Merged-by: ascend-robot Description: 本次PR将原首节点MLP模块激活值压缩特性迭代为各节点按transformer layer激活值压缩及AdamW一二阶动量压缩,是保留原特性及使用方法不变的前提下增加了新的功能。详情可参考readme。 See merge request: Ascend/MindSpeed!31734 个月前
!2206 feat: dense层激活值压缩 -core_r0.10.0 Merge pull request !2206 from 抄小抄/master 1 年前
!1848 add description for conv3d sequence parallel Merge pull request !1848 from wangyuansheng8/master 1 年前
!2391 dualpipev Merge pull request !2391 from 赵一帆/master 11 个月前
!2391 dualpipev Merge pull request !2391 from 赵一帆/master 11 个月前
!2391 dualpipev Merge pull request !2391 from 赵一帆/master 11 个月前
增加MOE专家负载均衡功能 Co-authored-by: zhanggaolu2<252028123@qq.com> # message auto-generated for no-merge-commit merge: !2845 merge expert_loadbalance2master into master 增加MOE专家负载均衡功能 Created-by: zhanggaolu2 Commit-by: zhanggaolu2 Merged-by: ascend-robot Description: 增加MOE专家负载均衡功能 See merge request: Ascend/MindSpeed!28457 个月前
增加MOE专家负载均衡功能 Co-authored-by: zhanggaolu2<252028123@qq.com> # message auto-generated for no-merge-commit merge: !2845 merge expert_loadbalance2master into master 增加MOE专家负载均衡功能 Created-by: zhanggaolu2 Commit-by: zhanggaolu2 Merged-by: ascend-robot Description: 增加MOE专家负载均衡功能 See merge request: Ascend/MindSpeed!28457 个月前
!2408 【bugfix】【master】moe-fb-overlap修复noop-layers兼容问题 Merge pull request !2408 from yangkai/fb-overlap-fix-noop 11 个月前
!2408 【bugfix】【master】moe-fb-overlap修复noop-layers兼容问题 Merge pull request !2408 from yangkai/fb-overlap-fix-noop 11 个月前
!2408 【bugfix】【master】moe-fb-overlap修复noop-layers兼容问题 Merge pull request !2408 from yangkai/fb-overlap-fix-noop 11 个月前
!263 【增加readme】关于flash attention、权重更新通信隐藏 Merge pull request !263 from Wang Xiaochao/master 2 年前
Support TransformerEngine Co-authored-by: MingzhenWang<wangmingzhen4@huawei.com> Co-authored-by: Muu<koimuu@163.com> Co-authored-by: x30061065<xuyuanhui3@h-partners.com> Co-authored-by: 耿瑞良<gengruiliang@huawei.com> # message auto-generated for no-merge-commit merge: !2947 merge lingqu_master into master Support TransformerEngine Created-by: mingzhenwang Commit-by: mingzhenwang;Muu;MingzhenWang;x30061065;耿瑞良 Merged-by: ascend-robot Description: 1. 支持TELinear层 2. 支持FP8计算,quantmatmul/gmm 3. 支持多种数据类型FP8/HiF8 4. 支持多种量化策略delayed/tensorwise/blockwise/mxfp8 5. TELinear层支持通算融合 See merge request: Ascend/MindSpeed!29476 个月前
!2335 docs: 添加分布式训练加速库迁移指南文档 Merge pull request !2335 from liurong1995/feature_docs 1 年前
initial 2 年前
低精度优化器 增加reademe Co-authored-by: w30064656<wangzhuangzhuang8@h-partners.com> # message auto-generated for no-merge-commit merge: !3067 merge master into master 低精度优化器 增加reademe Created-by: w30064656 Commit-by: w30064656 Merged-by: ascend-robot Description: 增加reademe 修复bug See merge request: Ascend/MindSpeed!30675 个月前
低精度优化器 增加reademe Co-authored-by: w30064656<wangzhuangzhuang8@h-partners.com> # message auto-generated for no-merge-commit merge: !3067 merge master into master 低精度优化器 增加reademe Created-by: w30064656 Commit-by: w30064656 Merged-by: ascend-robot Description: 增加reademe 修复bug See merge request: Ascend/MindSpeed!30675 个月前
!456 提交ND_MatMul算法代码 Merge pull request !456 from robert/master 1 年前
!2546 bugfix: fix error of model-migration.md Merge pull request !2546 from liurong1995/bugfix_docs 10 个月前
!2597 bugfix: fix error of mermaid preview Merge pull request !2597 from liurong1995/bugfix_docs 10 个月前
!269 Efficient-MoE:MoE token dropless性能优化 Merge pull request !269 from Kingsleyandher/master 2 年前
!269 Efficient-MoE:MoE token dropless性能优化 Merge pull request !269 from Kingsleyandher/master 2 年前
!269 Efficient-MoE:MoE token dropless性能优化 Merge pull request !269 from Kingsleyandher/master 2 年前
!269 Efficient-MoE:MoE token dropless性能优化 Merge pull request !269 from Kingsleyandher/master 2 年前
!269 Efficient-MoE:MoE token dropless性能优化 Merge pull request !269 from Kingsleyandher/master 2 年前
!1399 tp-2d适配num_query_group+free优化(master) Merge pull request !1399 from liujianxing/tp_2d_query_group_2 1 年前
!1399 tp-2d适配num_query_group+free优化(master) Merge pull request !1399 from liujianxing/tp_2d_query_group_2 1 年前
!456 提交ND_MatMul算法代码 Merge pull request !456 from robert/master 1 年前
!229 新增文档 Merge pull request !229 from yuqi/master 2 年前
!375 moe性能优化:all2all mlp通信隐藏 Merge pull request !375 from xiangjunwei/master_0530 1 年前
!2593 Readme update! Merge pull request !2593 from yangjie/master 10 个月前
!2593 Readme update! Merge pull request !2593 from yangjie/master 10 个月前
!482 重计算流水线独立调度功能 Merge pull request !482 from huangzhenyu/ripipe-latest 1 年前
!482 重计算流水线独立调度功能 Merge pull request !482 from huangzhenyu/ripipe-latest 1 年前
!254 增加sequence-parallel readme Merge pull request !254 from 李宝奎/master 2 年前
!1326 [master]: 增加共享专家资料说明 和 MLA 测试脚本 Merge pull request !1326 from Wang Xiaochao/master 1 年前
!1922 add smart_swap test Merge pull request !1922 from ChenDonYY/master_test 1 年前
!2341 add swap optimizer to core_r0.12.0 Merge pull request !2341 from wangyuansheng8/master 11 个月前
!570 更新swap_attention相关资料 Merge pull request !570 from wuxue15/swap_attention_readme 1 年前
!570 更新swap_attention相关资料 Merge pull request !570 from wuxue15/swap_attention_readme 1 年前
!570 更新swap_attention相关资料 Merge pull request !570 from wuxue15/swap_attention_readme 1 年前
!570 更新swap_attention相关资料 Merge pull request !570 from wuxue15/swap_attention_readme 1 年前
!699 2d张量并行 Merge pull request !699 from liujianxing/2d_tensor_0824 1 年前
!2593 Readme update! Merge pull request !2593 from yangjie/master 10 个月前
!2593 Readme update! Merge pull request !2593 from yangjie/master 10 个月前
!2264 fix: virtual optimizer bug fix and update doc Merge pull request !2264 from Kingsleyandher/master 1 年前
!229 新增文档 Merge pull request !229 from yuqi/master 2 年前
!2408 【bugfix】【master】moe-fb-overlap修复noop-layers兼容问题 Merge pull request !2408 from yangkai/fb-overlap-fix-noop 11 个月前