MindSpeed-LLM/examples/mcore/gemma2 · Ascend/MindSpeed-LLM - AtomGit

33fc8b89创建于 2025年10月30日历史提交

文件	最后提交记录	最后更新时间
chat_gemma2_9b_ptd.sh	!1745 新增baichuan2全参微调脚本和相应模版	1 年前
ckpt_convert_gemma2_hf2mcore.sh	!2828 [pytorch][sh]Add CUDA_DEVICE_MAX_CONNECTIONS	11 个月前
ckpt_convert_gemma2_mcore2hf.sh	!2828 [pytorch][sh]Add CUDA_DEVICE_MAX_CONNECTIONS	11 个月前
data_convert_gemma2_instruction.sh	!2673 docs readme modify	1 年前
data_convert_gemma2_pretrain.sh	!1608 新增gemma系列模型ST及UT	1 年前
evaluate_gemma2_27b_ptd.sh	!1530 refactor: support grok and gemma specification	1 年前
evaluate_gemma2_9b_ptd.sh	!1530 refactor: support grok and gemma specification	1 年前
generate_gemma2_27b_ptd.sh	!1530 refactor: support grok and gemma specification	1 年前
generate_gemma2_9b_ptd.sh	!1530 refactor: support grok and gemma specification	1 年前
pretrain_gemma2_27b_ptd.sh	!2336 更新examples下的脚本	1 年前
pretrain_gemma2_9b_ptd.sh	!2336 更新examples下的脚本	1 年前
tune_gemma2_9b_full_ptd.sh	[pytorch][feature]Decouple the 'variable-seq-lengths' and 'no-pad-to-seq-lengths' parameters	7 个月前