MindSpeed-LLM/docs · Ascend/MindSpeed-LLM - AtomGit

ascend-robot[pytorch][doc]update TND doc

文件	最后提交记录	最后更新时间
mindspore	[mindspore][bugfix] fix the bug of mindspore version Co-authored-by: weixin_47897441<wuyouqi1@h-partners.com> # message auto-generated for no-merge-commit merge: !4244 merge 2.2.0 into 2.2.0 [mindspore][bugfix] fix the bug of mindspore version Created-by: weixin_47897441 Commit-by: weixin_47897441 Merged-by: ascend-robot Description: [mindspore][bugfix] fix the bug of mindspore version issue：https://gitcode.com/Ascend/MindSpeed-LLM/issues/1130 ![image.png](https://raw.gitcode.com/user-images/assets/7623105/f0edf3e4-62c6-4aa4-835a-1d299cd0493c/image.png 'image.png') See merge request: Ascend/MindSpeed-LLM!4244	3 个月前
pytorch	[pytorch][doc]update TND doc Co-authored-by: guozhihua2<guozhihua2@huawei.com> # message auto-generated for no-merge-commit merge: !4187 merge update-TND-doc-2.2.0 into 2.2.0 [pytorch][doc]update TND doc Created-by: guozhihua2 Commit-by: guozhihua2 Merged-by: ascend-robot Description: 1. 更新微调文档 See merge request: Ascend/MindSpeed-LLM!4187	3 个月前
quick_start.md	[pytorch][doc]quick_start.md & multi_sample_pack_finetune.md update Co-authored-by: JZY_SC30<jiangzhiyou@huawei.com> # message auto-generated for no-merge-commit merge: !4065 merge 2.2.0_doc_update into 2.2.0 [pytorch][doc]quick_start.md & multi_sample_pack_finetune.md update Created-by: JZY_SC30 Commit-by: JZY_SC30 Merged-by: ascend-robot Description: 2.2.0 doc update based on issue feedback from the community： 1.multi_sample_pack_finetune.md ：add explaination of packed attention mask 2.quick_start.md: add modelscope download link, simplify the instruction See merge request: Ascend/MindSpeed-LLM!4065	4 个月前
readme.md	[pytorch][doc] gitcode url & readme update Co-authored-by: peihan_liu666<liupeihan@huawei.com>	7 个月前

MindSpeed-LLM Docs 目录

内容	链接	备注
环境安装指导	install_guide.md
快速入门	quick_start.md	基于pytorch/mindspore后端的入门指导，从环境安装到预训练拉起
仓库支持模型清单	dense_models	稠密模型清单
	MOE_models	MOE模型清单
	SSM_models	SSM模型清单
特性清单	features	收集了部分仓库支持的性能优化和显存优化的特性
模型前处理操作	checkpoint_convert	支持mcore、hf、lora等各种不同格式权重间的部分转换路径
	checkpoint_convert_v2	支持大参数模型mcore、hf等各种不同格式权重间的转换
	pretrain_dataset	支持的数据集格式包括alpaca,pairwise,sharegpt
	finetune_dataset(待补充)
	reward_dataset
预训练方案	pretrain	多样本预训练方法
预训练方案	pretrain_eod	多样本pack预训练方法
模型微调方法	instruction_finetune	模型全参微调方案
	multi_sample_pack_finetune	多样本Pack微调方案
	multi-turn_conversation	多轮对话微调方案
	lora_finetune	模型lora微调方案
	qlora_finetune	模型qlora微调方案
模型推理方法	inference	模型推理
	chat	对话
	yarn	使用yarn方案来扩展上下文长度，支持长序列推理
模型评估	evaluation	模型评估方案
	evaluation_baseline	仓库模型评估清单
	evaluation_datasets	仓库支持评估数据集

【线性度性能展示】

基于 GPT3-175B 稠密大模型，从128颗 NPU 扩展到 7968颗 NPU 进行 MFU 与线性度实验，下图是实验数据：

图中呈现了对应集群规模下的 MFU 值与集群整体的 线性度情况. 计算公式已经放到社区，点击链接可进行参考：MFU计算公式，线性度计算公式.