Star179
222
代码介绍
代码
Issues49
Pull Requests55
流水线
Actions
讨论
Wiki
项目成员54
分析
项目设置
Star179
222
  1. MindSpeed-LLM
  2. /
  3. tests
  4. /
  5. tools
  6. /
  7. fsdp2
ascend-robotascend-robotchore(fsdp2): develop longcat-flash-lite model in fsdp2
518f5f29创建于 9 天前历史提交
文件最后提交记录最后更新时间
longcat_flash_lite_moe_hf_weight_convert.py
chore(fsdp2): develop longcat-flash-lite model in fsdp29 天前
longcat_flash_lite_moe_hf_weight_convert.sh
chore(fsdp2): develop longcat-flash-lite model in fsdp29 天前
moe_hf_param_merge_experts.py
[pytorch][bugfix]delete configuration.json merge3 个月前
moe_hf_param_merge_experts.sh
[pytorch][feature]integrate moe merge process into FSDP2 training3 个月前