MindSpeed-MM Agent Configuration

This directory contains shared guidance for AI coding agents working on MindSpeed-MM.

The .agents directory is the single source for reusable agent-facing context. Tool-specific directories such as .codex/, .claude/, .cursor/, or .trae/ can be generated locally from this shared source when needed.

MindSpeed-MM follows the Agent Skills convention for skill layout.

Directory Layout

Path	Purpose
`skills/`	Skill index and implementation conventions.
`knowledge/`	Shared knowledge context for agents.
`setup_agent.sh`	Optional helper for linking `.agents` into local tool-specific directories.

Usage

Link this shared configuration into a local agent directory:

bash .agents/setup_agent.sh codex
bash .agents/setup_agent.sh claude
bash .agents/setup_agent.sh cursor
bash .agents/setup_agent.sh trae

The script also accepts a custom agent name and creates .<agent-name>/ as a local adapter directory. Generated adapter directories are added to .git/info/exclude.

Architecture Summary

MindSpeed-MM supports two main training backend paths. Agents should identify the active backend before changing model code, data code, checkpoint conversion, examples, or tests.

Backend	Primary Entries	Description
MindSpeed Core / Megatron	`mindspeed_mm/training.py`, `mindspeed_mm/pretrain_.py`, `examples//*.sh`	Megatron-style flow using model/data/forward providers and hybrid parallelism.
FSDP2	`mindspeed_mm/fsdp/train/trainer.py`, `mindspeed_mm/config/config_manager.py`, `mindspeed_mm/fsdp/utils/register.py`	YAML-driven flow using plugin registration, `ModelHub`, FSDP2 data builders, and parallel plans.

See knowledge/architecture.md for the agent-facing architecture overview.

Skill Plan

Skill	Domain	Status	Priority	Description
mindspeed-mm-fsdp2-model-only-vlm-migration	Integration	Planned	P0	指导新模型接入 FSDP2 后端，覆盖参考样例、注册、配置、数据字段和端到端验收，当前阶段仅支持vlm迁移。
performance-analysis-report	Optimization	Planned	P0	将 profiling 结果和训练日志整理为瓶颈分析报告与优化建议。
fsdp2-dataset-migration	Integration	Planned	P0	指导新数据集接入 FSDP2 数据链路，覆盖 dataset type、collator 和 batch key。
flops-mfu-analysis	Optimization	Planned	P0	基于模型配置、输入形状和运行指标估算 FLOPs 与 MFU。
fused-operator-optimization	Optimization	Planned	P0	规划 RMSNorm、EP-BMM、ROPE 等融合算子替换及精度性能验证。
npu-environment-setup	Integration	Planned	P1	梳理指定模型在 Ascend/NPU 环境下的依赖、环境变量、安装顺序和最小验证方式。
best-configuration-recommendation	Optimization	Planned	P1	结合模型规模和并行策略，推荐可解释的训练配置组合（EP、TP、CP、FSDP）。
transformers-alignment-gate	Verification	Planned	P1	为 Transformers 版本升级提供对齐检查。
checkpoint-conversion-routing	Integration	Planned	P1	根据源格式、目标格式和模型类型选择合适的权重转换路径并检查关键参数。
minimal-doc-sync	Collaboration	Planned	P2	根据代码变更识别 README、特性文档或 example 文档中的最小同步范围。
pr-description-generation	Collaboration	Planned	P2	根据 diff、测试结果、风险和用户影响生成 PR 描述与评审申请内容。
unit-test-authoring	Verification	Planned	P2	辅助编写符合仓库风格的单元测试

See skills/README.md for the full skill index.