文件最后提交记录最后更新时间
Feat: adaptor for DeepSeek V4 Co-authored-by: wuweiqiang24<wuweiqiang11@huawei.com> # message auto-generated for no-merge-commit merge: !3427 merge master into master Feat: adaptor for DeepSeek V4 Created-by: wuweiqiang24 Commit-by: wuweiqiang24 Merged-by: ascend-robot Description: What this PR does / why we need it? Adaptor for DeepSeek V4!!! Does this PR introduce any user-facing change? Please describe whether the PR will result in any user-facing usage changes. If there is related documentation, please specify its path. How was this patch tested? Please explain how to verify the correctness and effectiveness of this feature, as well as its usage constraints and limitations. See merge request: Ascend/MindSpeed!34271 个月前
refactor: rename --sub-seq-length to --fix-sub-seq-length for clarity Co-authored-by: Jia_Austin<dengjia6@huawei.com> # message auto-generated for no-merge-commit merge: !3231 merge sub_seq into master refactor: rename --sub-seq-length to --fix-sub-seq-length for clarity Created-by: Jia_Austin Commit-by: Jia_Austin Merged-by: ascend-robot Description: refactor: rename --sub-seq-length to --fix-sub-seq-length for clarity See merge request: Ascend/MindSpeed!32314 个月前
!2650 Add TE class and args modification to support verl Merge pull request !2650 from Jializheng/args 10 个月前
!2308 Adaptation core_r0.12.0 Merge pull request !2308 from 邓佳/core_r0.12.0_dev 1 年前
!2117 refactor:generate mask & ailibi pse Merge pull request !2117 from 范文焘/master 1 年前