文件最后提交记录最后更新时间
update core0.12.1 Co-authored-by: daikang_123<daikang6@huawei.com> # message auto-generated for no-merge-commit merge: !671 merge master into master update core0.12.1 Created-by: daikang123 Commit-by: daikang123;daikang_123 Merged-by: ascend-robot Description: 长跑其他模型的验证结果:https://wiki.huawei.com/domains/6995/wiki/8/WIKI202510148565350 See merge request: Ascend/MindSpeed-RL!6717 个月前
fix grpo loss&reward Co-authored-by: may_feimei<meifei5@huawei.com> # message auto-generated for no-merge-commit merge: merge master into master fix grpo loss&reward Created-by: may_feimei Commit-by: may_feimei Merged-by: ascend-robot Description: 修复训崩 See merge request: Ascend/MindSpeed-RL!6697 个月前
!490 security fix Merge pull request !490 from TJJ/master 9 个月前
update core0.12.1 Co-authored-by: daikang_123<daikang6@huawei.com> # message auto-generated for no-merge-commit merge: !671 merge master into master update core0.12.1 Created-by: daikang123 Commit-by: daikang123;daikang_123 Merged-by: ascend-robot Description: 长跑其他模型的验证结果:https://wiki.huawei.com/domains/6995/wiki/8/WIKI202510148565350 See merge request: Ascend/MindSpeed-RL!6717 个月前
update core0.12.1 Co-authored-by: daikang_123<daikang6@huawei.com> # message auto-generated for no-merge-commit merge: !671 merge master into master update core0.12.1 Created-by: daikang123 Commit-by: daikang123;daikang_123 Merged-by: ascend-robot Description: 长跑其他模型的验证结果:https://wiki.huawei.com/domains/6995/wiki/8/WIKI202510148565350 See merge request: Ascend/MindSpeed-RL!6717 个月前
!489 支持qwen3 32b模型dapo Merge pull request !489 from xiecheng/master 10 个月前
!474 grpo qwen3-8b A3 Merge pull request !474 from 戴康123/master 10 个月前
支持search tool Co-authored-by: pengnuoheng<pengnuoheng@huawei.com> # message auto-generated for no-merge-commit merge: !846 merge master into master 支持search tool Created-by: pengnuoheng Commit-by: pengnuoheng Merged-by: ascend-robot Description: 支持search tool 自验报告:https://wiki.huawei.com/domains/127887/wiki/246533/WIKI202512179456276 See merge request: Ascend/MindSpeed-RL!8465 个月前