| feat(torch):Support text-only pretraining
Co-authored-by: yaoyaoxu<xuyaoyao.824404@huawei.com>
# message auto-generated for no-merge-commit merge:
!2261 merge pretrain_data_preprocess into master
feat(torch):Support text-only pretraining
Created-by: yaoyaoxu
Commit-by: yaoyaoxu
Merged-by: ascend-robot
Description: ## What this PR does / why we need it?
1.支持纯fsdp的纯文本预训练
2.支持megatron+fsdp双后端的纯文本预训练
3.提供预训练特性文档
## Does this PR introduce any user-facing change?
Please describe whether the PR will result in any user-facing usage changes. If there is related documentation, please specify its path.
## How was this patch tested?
Please explain how to verify the correctness and effectiveness of this feature, as well as its usage constraints and limitations.
See merge request: Ascend/MindSpeed-MM!2261 | 2 个月前 |