文件最后提交记录最后更新时间
!3171 [pytorch][feature]Deprecate some moe parameters: n-group, topk-group, etc. Merge pull request !3171 from shengjy/dep_moe 9 个月前
refactor(pytorch): update deepseek4 shell Co-authored-by: dingzicha1997<dingzilin@huawei.com> # message auto-generated for no-merge-commit merge: !4423 merge master into master refactor(pytorch): update deepseek4 shell Created-by: dingzicha1997 Commit-by: dingzicha1997 Merged-by: ascend-robot Description: ## What this PR does / why we need it? Please describe the background and detailed changes of the PR. If it is a bugfix, please attach the related issue. ## Does this PR introduce any user-facing change? Please describe whether the PR will result in any user-facing usage changes. If there is related documentation, please specify its path. ## How was this patch tested? Please explain how to verify the correctness and effectiveness of this feature, as well as its usage constraints and limitations. See merge request: Ascend/MindSpeed-LLM!44231 个月前
!3370 [pytorch][bugfix]restore the einsum operation for next states of mamba Merge pull request !3370 from sunjunjie/master 8 个月前
feat: Optimize deepseekV4's rmsnorm operator performance Co-authored-by: LinShua<707894133@qq.com> # message auto-generated for no-merge-commit merge: !4553 merge master_rmsnorm_ascendC into master feat: Optimize deepseekV4's rmsnorm operator performance Created-by: LinShua Commit-by: LinShua Merged-by: ascend-robot Description: ## What this PR does / why we need it? 优化deepseekV4's rmsnorm性能,调用融合算子 ## Does this PR introduce any user-facing change? NA ## How was this patch tested? NA See merge request: Ascend/MindSpeed-LLM!45531 天前
!1998 rename: repo package name from modellink to mindspeed_llm Merge pull request !1998 from MeiFei/master-package-rename 1 年前