| add pynative trainer.
| 4 个月前 |
| [master][bugfix] Delete the validation of the existing context instance
| 2 个月前 |
| tensorboard extension
| 1 年前 |
| revert del llama3_1
| 6 个月前 |
| [master][bugfix] Rm TestInferParallel testcase
| 10 个月前 |
| [Bugfix] Fixed the issue of test cases in test_deprecation_logs.py reporting errors in py3.12
| 2 个月前 |
| 【master】【bugfix】fall infer test cases
| 18 天前 |
| [master][bugfix] Delete the validation of the existing context instance
| 2 个月前 |
| 【master】【pynative】【ut】补充机间通信合并用例
| 1 天前 |
| 更新Muon优化器测试基线数据
- 更新BASELINE_LOSSES_NESTEROV_TRUE基线数据
- 更新BASELINE_LOSSES_NESTEROV_FALSE基线数据
- 更新BASELINE_LOSSES_DIFF_LR基线数据
- 所有测试用例已通过验证
【master】【bugfix】generator pybind changes seed and offset from parameter into tensor
fix dropout shard
fix dropout shard
fix dropout shard
| 4 个月前 |
| del mixtral
| 6 个月前 |
| 修改resume_utils用例中生成文件的时间戳,保证文件先后顺序准确。
| 9 个月前 |
| 增加权重2.0保存路径保存权重新逻辑:keep_checkpoint_max仅对当前轮次训练有效,不改变过去轮次训练已保存的权重,当出现同名权重,将会直接覆盖
| 2 个月前 |
| [master][bugfix] Delete the validation of the existing context instance
| 2 个月前 |
| 【master】【pynative】【ut】补充recompute、swap接口测试用例
| 1 天前 |
| test9
| 3 年前 |
| 【master】【feature】add current rank print message of load_checkpoint
| 3 个月前 |
| fix testcases.
| 8 个月前 |
| Revert "!6313 下架obs下载链接"
This reverts commit cee191cb521d98ae68ba4b9668a0b0ca53429a05, reversing
changes made to 498dbf7bf2bd756a21b246befecbd98dbff8348a.
| 11 个月前 |
| 增加transform_checkpoint_utils.py测试用例
| 5 个月前 |
| [Models] Delete Llama2
| 11 个月前 |
| 【master】【bugfix】增加dualpipev精度level1用例
| 8 个月前 |