文件最后提交记录最后更新时间
feat: Optimize deepseekV4's rmsnorm operator performance Co-authored-by: LinShua<707894133@qq.com> # message auto-generated for no-merge-commit merge: !4553 merge master_rmsnorm_ascendC into master feat: Optimize deepseekV4's rmsnorm operator performance Created-by: LinShua Commit-by: LinShua Merged-by: ascend-robot Description: ## What this PR does / why we need it? 优化deepseekV4's rmsnorm性能,调用融合算子 ## Does this PR introduce any user-facing change? NA ## How was this patch tested? NA See merge request: Ascend/MindSpeed-LLM!45531 天前
!2030 CodeChcek整改-master Merge pull request !2030 from shenjiarun/master 1 年前
!2089 refactor: move spec related structure into right position Merge pull request !2089 from RuanZhiXiang/refactor_mla_attention 1 年前
style: Adjust the reference path of the GDN triton operator. Co-authored-by: LinShua<707894133@qq.com> # message auto-generated for no-merge-commit merge: !4416 merge master_GDN_stype into master style: Adjust the reference path of the GDN triton operator. Created-by: LinShua Commit-by: LinShua Merged-by: ascend-robot Description: ## What this PR does / why we need it? 更新GDN triton算子的引用路径 ## Does this PR introduce any user-facing change? NA ## How was this patch tested? mindspeed仓库GDN triton算子路径调整,更新引用路径,已有相关测试用例 See merge request: Ascend/MindSpeed-LLM!44161 个月前
feat:Added CP feature for deepseek v4 Co-authored-by: sunjunjie1587<sunjunjie8@huawei.com> # message auto-generated for no-merge-commit merge: !4503 merge master into master feat:Added CP feature for deepseek v4 Created-by: sunjunjie1587 Commit-by: sunjunjie1587 Merged-by: ascend-robot Description: ## What this PR does / why we need it? Added CP feature for deepseek v4 ## Does this PR introduce any user-facing change? support dsv4 long context training ## How was this patch tested? test 16k/8k case, See merge request: Ascend/MindSpeed-LLM!45034 天前
!1958 整改仓库文件结构 Merge pull request !1958 from DONGHAORAN/master 1 年前
feat(pytorch): FSDP2 support GDN cp Co-authored-by: mhh111<mahonghao1@huawei.com> # message auto-generated for no-merge-commit merge: !4510 merge 0522_1 into master feat(pytorch): FSDP2 support GDN cp Created-by: mhh111 Commit-by: mhh111 Merged-by: ascend-robot Description: ## What this PR does / why we need it? Please describe the background and detailed changes of the PR. If it is a bugfix, please attach the related issue. ## Does this PR introduce any user-facing change? Please describe whether the PR will result in any user-facing usage changes. If there is related documentation, please specify its path. ## How was this patch tested? Please explain how to verify the correctness and effectiveness of this feature, as well as its usage constraints and limitations. See merge request: Ascend/MindSpeed-LLM!45106 天前
[pytorch][bugfix]fix about cleancode Co-authored-by: qyzqyz<quyueze@h-partners.com> 7 个月前
!2249 【HunyuanLargeMoE】part of model Merge pull request !2249 from zhoubeirong/0218-model-part1 1 年前
style(pytorch): SCA compliance rectification Co-authored-by: zhyebin01<zhangyebin@h-partners.com> # message auto-generated for no-merge-commit merge: !4355 merge master into master style(pytorch): SCA compliance rectification Created-by: zhyebin01 Commit-by: zhyebin01 Merged-by: ascend-robot Description: ## What this PR does / why we need it? SCA compliance rectification ## Does this PR introduce any user-facing change? no ## How was this patch tested? pipeline test passed See merge request: Ascend/MindSpeed-LLM!43552 个月前
add TECPDotproductAttention Module Co-authored-by: 乌兰娜仁<wulannarenzhao@gmail.com> # message auto-generated for no-merge-commit merge: !4170 merge add_TECPDotProductAttn into master add TECPDotproductAttention Module Created-by: hid88941705 Commit-by: 乌兰娜仁 Merged-by: ascend-robot Description: add TECPDotproductAttention Module See merge request: Ascend/MindSpeed-LLM!41702 个月前
fix(pytorch): fix duplicate transports in mla and rope Co-authored-by: zhyebin01<zhangyebin@h-partners.com> # message auto-generated for no-merge-commit merge: !4309 merge bugfix2 into master fix(pytorch): fix duplicate transports in mla and rope Created-by: zhyebin01 Commit-by: zhyebin01 Merged-by: ascend-robot Description: ## What this PR does / why we need it? fix duplicate transports in mla and rope ## Does this PR introduce any user-facing change? no ## How was this patch tested? pipeline test passed See merge request: Ascend/MindSpeed-LLM!43092 个月前
fix(pytorch): fix duplicate transports in mla and rope Co-authored-by: zhyebin01<zhangyebin@h-partners.com> # message auto-generated for no-merge-commit merge: !4309 merge bugfix2 into master fix(pytorch): fix duplicate transports in mla and rope Created-by: zhyebin01 Commit-by: zhyebin01 Merged-by: ascend-robot Description: ## What this PR does / why we need it? fix duplicate transports in mla and rope ## Does this PR introduce any user-facing change? no ## How was this patch tested? pipeline test passed See merge request: Ascend/MindSpeed-LLM!43092 个月前
!3316 [pytorch][model]add qwen3_next model Merge pull request !3316 from guozhihua/qwen3_next_master 8 个月前
[pytorch][feature]add verification for TP and CP in qwen3-next,and install triton-ascend. Co-authored-by: LinShua<707894133@qq.com> # message auto-generated for no-merge-commit merge: !4098 merge master_qwen3_next_arg into master [pytorch][feature]add verification for TP and CP in qwen3-next,and install triton-ascend. Created-by: LinShua Commit-by: LinShua Merged-by: ascend-robot Description: add verification for TP and CP in qwen3-next,and install triton-ascend. See merge request: Ascend/MindSpeed-LLM!40984 个月前
fix(pytorch):fix pipeline testcase Co-authored-by: yanzhixiao<yanzhixiao@h-partners.com> # message auto-generated for no-merge-commit merge: !4443 merge bugfix-pipeline into master fix(pytorch):fix pipeline testcase Created-by: yanzhixiao23 Commit-by: yanzhixiao Merged-by: ascend-robot Description: ## What this PR does / why we need it? fix pipeline testcase ## Does this PR introduce any user-facing change? NA ## How was this patch tested? NA See merge request: Ascend/MindSpeed-LLM!444326 天前
!3025 [pytorch][bugfix] fix the hunyuan model Merge pull request !3025 from yanzhixiao/bugfix-hunyuan 10 个月前
[pytorch][model]longcat-flash-chat development Co-authored-by: guihaowen666<guihaowen@huawei.com> # message auto-generated for no-merge-commit merge: !4012 merge br_master_longcat_model_dev into master [pytorch][model]longcat-flash-chat development Created-by: guihaowen666 Commit-by: guihaowen666 Merged-by: ascend-robot Description: ckpt convert develop See merge request: Ascend/MindSpeed-LLM!40123 个月前