ascend-robot修复ais_bench 配置 target_filed崩溃问题

文件	最后提交记录	最后更新时间
device_config	【同步】【非开发代码】代码从 develop 同步到 master Co-authored-by: yydyzr<liuyuncong1@huawei.com> Co-authored-by: gcw_61YBRfIt<chuzhenxing@huawei.com> Co-authored-by: 孔炳翔<1120200577@qq.com> Co-authored-by: zhengxinqian<qianzhengxin@huawei.com> Co-authored-by: hw_whx<wanghexiang7@huawei.com> Co-authored-by: jgong5<steven.gong@gmail.com> Co-authored-by: hw_whx<2952154980@qq.com> # message auto-generated for no-merge-commit merge: !330 merge master into master 【同步】【非开发代码】代码从 develop 同步到 master Created-by: AvadaKedavrua Commit-by: liujiawang;ascend-robot;AvadaKedavrua;lutean;Horacehxw;eveyin1;minghang_c;zwt__;tt0cool;elrond-g;jia_ya_nan;zhenyu_zhang;ChenHuiwen;wangshen001;Hudingyi;wendellX;Secluded_Ocean;jhon-117;yaohan404;jiangruitao;zhenghaojie;stormchasingg;panyj1993;cmh1056291129;yuyinkai1;sunguozhong;genius52;liu_jiaxu;HongMaoShuiGuai;zhengxinqian;weixin_43368449;jsez-li-bin;jgong5;wqh17101;w00609794;yydyzr;JieZhang679;sppedforcy;gcw_61YBRfIt;Jiong Gong;hw_whx;gongjiong;孔炳翔 Merged-by: ascend-robot Description: 代码从 develop 同步到 master，后续基于 master 演进，并支持打包 See merge request: Ascend/msmodeling!330	16 天前
microbench	【同步】【非开发代码】代码从 develop 同步到 master Co-authored-by: yydyzr<liuyuncong1@huawei.com> Co-authored-by: gcw_61YBRfIt<chuzhenxing@huawei.com> Co-authored-by: 孔炳翔<1120200577@qq.com> Co-authored-by: zhengxinqian<qianzhengxin@huawei.com> Co-authored-by: hw_whx<wanghexiang7@huawei.com> Co-authored-by: jgong5<steven.gong@gmail.com> Co-authored-by: hw_whx<2952154980@qq.com> # message auto-generated for no-merge-commit merge: !330 merge master into master 【同步】【非开发代码】代码从 develop 同步到 master Created-by: AvadaKedavrua Commit-by: liujiawang;ascend-robot;AvadaKedavrua;lutean;Horacehxw;eveyin1;minghang_c;zwt__;tt0cool;elrond-g;jia_ya_nan;zhenyu_zhang;ChenHuiwen;wangshen001;Hudingyi;wendellX;Secluded_Ocean;jhon-117;yaohan404;jiangruitao;zhenghaojie;stormchasingg;panyj1993;cmh1056291129;yuyinkai1;sunguozhong;genius52;liu_jiaxu;HongMaoShuiGuai;zhengxinqian;weixin_43368449;jsez-li-bin;jgong5;wqh17101;w00609794;yydyzr;JieZhang679;sppedforcy;gcw_61YBRfIt;Jiong Gong;hw_whx;gongjiong;孔炳翔 Merged-by: ascend-robot Description: 代码从 develop 同步到 master，后续基于 master 演进，并支持打包 See merge request: Ascend/msmodeling!330	16 天前
model-adaptation	【同步】【非开发代码】代码从 develop 同步到 master Co-authored-by: yydyzr<liuyuncong1@huawei.com> Co-authored-by: gcw_61YBRfIt<chuzhenxing@huawei.com> Co-authored-by: 孔炳翔<1120200577@qq.com> Co-authored-by: zhengxinqian<qianzhengxin@huawei.com> Co-authored-by: hw_whx<wanghexiang7@huawei.com> Co-authored-by: jgong5<steven.gong@gmail.com> Co-authored-by: hw_whx<2952154980@qq.com> # message auto-generated for no-merge-commit merge: !330 merge master into master 【同步】【非开发代码】代码从 develop 同步到 master Created-by: AvadaKedavrua Commit-by: liujiawang;ascend-robot;AvadaKedavrua;lutean;Horacehxw;eveyin1;minghang_c;zwt__;tt0cool;elrond-g;jia_ya_nan;zhenyu_zhang;ChenHuiwen;wangshen001;Hudingyi;wendellX;Secluded_Ocean;jhon-117;yaohan404;jiangruitao;zhenghaojie;stormchasingg;panyj1993;cmh1056291129;yuyinkai1;sunguozhong;genius52;liu_jiaxu;HongMaoShuiGuai;zhengxinqian;weixin_43368449;jsez-li-bin;jgong5;wqh17101;w00609794;yydyzr;JieZhang679;sppedforcy;gcw_61YBRfIt;Jiong Gong;hw_whx;gongjiong;孔炳翔 Merged-by: ascend-robot Description: 代码从 develop 同步到 master，后续基于 master 演进，并支持打包 See merge request: Ascend/msmodeling!330	16 天前
msmodeling-env-installer	【同步】【非开发代码】代码从 develop 同步到 master Co-authored-by: yydyzr<liuyuncong1@huawei.com> Co-authored-by: gcw_61YBRfIt<chuzhenxing@huawei.com> Co-authored-by: 孔炳翔<1120200577@qq.com> Co-authored-by: zhengxinqian<qianzhengxin@huawei.com> Co-authored-by: hw_whx<wanghexiang7@huawei.com> Co-authored-by: jgong5<steven.gong@gmail.com> Co-authored-by: hw_whx<2952154980@qq.com> # message auto-generated for no-merge-commit merge: !330 merge master into master 【同步】【非开发代码】代码从 develop 同步到 master Created-by: AvadaKedavrua Commit-by: liujiawang;ascend-robot;AvadaKedavrua;lutean;Horacehxw;eveyin1;minghang_c;zwt__;tt0cool;elrond-g;jia_ya_nan;zhenyu_zhang;ChenHuiwen;wangshen001;Hudingyi;wendellX;Secluded_Ocean;jhon-117;yaohan404;jiangruitao;zhenghaojie;stormchasingg;panyj1993;cmh1056291129;yuyinkai1;sunguozhong;genius52;liu_jiaxu;HongMaoShuiGuai;zhengxinqian;weixin_43368449;jsez-li-bin;jgong5;wqh17101;w00609794;yydyzr;JieZhang679;sppedforcy;gcw_61YBRfIt;Jiong Gong;hw_whx;gongjiong;孔炳翔 Merged-by: ascend-robot Description: 代码从 develop 同步到 master，后续基于 master 演进，并支持打包 See merge request: Ascend/msmodeling!330	16 天前
op-mapping	fix(security): add model source safety checks Co-authored-by: jia_ya_nan<jiayanan3@h-partners.com> # message auto-generated for no-merge-commit merge: !385 merge fix/trust-remote-code-safety into master fix(security): add model source safety checks Created-by: jia_ya_nan Commit-by: jia_ya_nan Merged-by: ascend-robot Description: # PR Template Thanks for your contribution; we appreciate it a lot. The following instructions will make your pull request healthier and help you get feedback more easily. If you do not understand some items, don't worry, just make the pull request and seek help from maintainers. 感谢您的贡献，我们非常重视。以下说明将使您的拉取请求更健康，更易于获得反馈。如果您不理解某些项目，请不要担心，只需提交拉取请求并从维护人员那里寻求帮助即可。 PR Type / PR类型 - [ ] Feature（功能新增） - [x] Bugfix（Bug 修复） - [ ] Docs（文档更新） - [ ] CI/CD（持续集成/持续部署） - [ ] Refactor（代码重构） - [ ] Perf（性能优化） - [ ] Test-Cases（测试用例更新） - [x] Other（其他） ## 🔍 Motivation / 变更动机 Please describe the motivation of this PR and the goal you want to achieve through this PR. 请描述您的拉取请求的动机和您希望通过此拉取请求实现的目标。安全加固 ------ ## 📝 Modification / 修改内容 Please briefly describe what modification is made in this PR. 请简要描述此拉取请求中进行的修改。增加本地路径权限校验；增加日志风险提示去掉不维护的老接口 ------ ## 📐 Associated Test Results / 关联测试结果 Please provide the related test results, such as test reports, etc. 请提供相关测试结果，例如测试报告等。 ![image.png](https://raw.gitcode.com/user-images/assets/8428112/ef4f75a5-1346-4320-8de2-a19703ebedb3/image.png 'image.png') ------ ## 🌟 Use cases (Optional) / 使用案例（可选） If this PR introduces a new feature, it is better to list some use cases here and update the documentation. 如果此拉取请求引入了新功能，最好在此处列出一些用例并更新文档。 ------ ## ✅ Checklist / 检查列表 Before PR: - [x] Bug fixes are fully covered by unit tests, the case that causes the bug should be added in the unit tests. / 修复的 Bug 已完全由单元测试覆盖，导致 Bug 的情况应在单元测试中添加。 - [x] The modification is covered by complete unit tests. If not, please add more unit tests to ensure the correctness. / 此拉取请求中的修改已完全由单元测试覆盖。如果不是，请添加更多单元测试以确保正确性。 - [x] All relevant documentation (API docs, docstrings, example tutorials) has been updated to reflect these changes. / 所有相关文档（API 文档、文档字符串、示例教程）已更新以反映这些更改。 - [x] Please ensure code files contain no Chinese comments. / 请保证代码文件中不含中文注释。 ------ See merge request: Ascend/msmodeling!385	4 天前
optix-config	【FEAT】发布 msmodeling 统一 wheel 包与 CLI 入口 Co-authored-by: liujiawang<anonymousdev@163.com> # message auto-generated for no-merge-commit merge: !370 merge pack into master 【FEAT】发布 msmodeling 统一 wheel 包与 CLI 入口 Created-by: AvadaKedavrua Commit-by: liujiawang Merged-by: ascend-robot Description: ## 修改原因 msmodeling 此前无标准 pip 安装路径，OptiX 独立于 `experimental/` 维护，CLI 入口分散。需要统一为 `pip install msmodeling` 可安装的主 wheel，并保留源码 `python -m` 双轨用法。 --- ## 修改内容 - 包名改为 `msmodeling`（0.2.0），使用 hatchling + `uv build --wheel` 构建 `py3-none-any` wheel - `experimental/optix/` 迁入根目录 `optix/`，并入主包；专属依赖（loguru、pyswarms 等）作为核心依赖 - 新增 `cli/main.py` 作为唯一 console script：`msmodeling inference\|optix ...` - 支持 `python -m optix`；OptiX CLI 集成 logo（`--help` 不打印） - `web_ui/` 默认打入主 wheel；`gradio` 保留在核心依赖 - 新增 `scripts/build.sh`，支持 `MSMODELING_WHEEL_OUTPUT_DIR` 控制 wheel 输出目录（CI 归档） - CI gate `tests/.ci/gate_policy.yaml` 新增 `optix/` 为 coverage 源码根 - 保留 `python -m cli.inference.*` 双轨并列 --- ## 自验证 ### Wheel 构建与包边界目的：确认 wheel 产物名称、内容与包边界符合方案步骤： 1. 执行 `bash scripts/build.sh` 2. 检查 `dist/msmodeling-0.2.0-py3-none-any.whl` 内容结果： `Building wheel... Successfully built dist/msmodeling-0.2.0-py3-none-any.whl Built wheel: dist/msmodeling-0.2.0-py3-none-any.whl Archive: dist/msmodeling-0.2.0-py3-none-any.whl 2206 cli/main.py 4738 optix/config.toml` ### CLI 与 logo 目的：确认统一入口、`python -m optix` 及 logo 行为步骤： 1. `uv run msmodeling -h` 2. `uv run python -m optix --help` 3. `uv run pytest tests/regression/cli/test_logo_cli_hooks.py -q` 结果： `5 passed in 0.59s` ### CI 输出目录环境变量目的：确认 CI 可通过环境变量指定 wheel 归档目录步骤： 1. `MSMODELING_WHEEL_OUTPUT_DIR=/tmp/msmodeling-whl-test bash scripts/build.sh` 2. `ls /tmp/msmodeling-whl-test/` 结果： `msmodeling-0.2.0-py3-none-any.whl` See merge request: Ascend/msmodeling!370	10 天前
optix-deploy	【FEAT】发布 msmodeling 统一 wheel 包与 CLI 入口 Co-authored-by: liujiawang<anonymousdev@163.com> # message auto-generated for no-merge-commit merge: !370 merge pack into master 【FEAT】发布 msmodeling 统一 wheel 包与 CLI 入口 Created-by: AvadaKedavrua Commit-by: liujiawang Merged-by: ascend-robot Description: ## 修改原因 msmodeling 此前无标准 pip 安装路径，OptiX 独立于 `experimental/` 维护，CLI 入口分散。需要统一为 `pip install msmodeling` 可安装的主 wheel，并保留源码 `python -m` 双轨用法。 --- ## 修改内容 - 包名改为 `msmodeling`（0.2.0），使用 hatchling + `uv build --wheel` 构建 `py3-none-any` wheel - `experimental/optix/` 迁入根目录 `optix/`，并入主包；专属依赖（loguru、pyswarms 等）作为核心依赖 - 新增 `cli/main.py` 作为唯一 console script：`msmodeling inference\|optix ...` - 支持 `python -m optix`；OptiX CLI 集成 logo（`--help` 不打印） - `web_ui/` 默认打入主 wheel；`gradio` 保留在核心依赖 - 新增 `scripts/build.sh`，支持 `MSMODELING_WHEEL_OUTPUT_DIR` 控制 wheel 输出目录（CI 归档） - CI gate `tests/.ci/gate_policy.yaml` 新增 `optix/` 为 coverage 源码根 - 保留 `python -m cli.inference.*` 双轨并列 --- ## 自验证 ### Wheel 构建与包边界目的：确认 wheel 产物名称、内容与包边界符合方案步骤： 1. 执行 `bash scripts/build.sh` 2. 检查 `dist/msmodeling-0.2.0-py3-none-any.whl` 内容结果： `Building wheel... Successfully built dist/msmodeling-0.2.0-py3-none-any.whl Built wheel: dist/msmodeling-0.2.0-py3-none-any.whl Archive: dist/msmodeling-0.2.0-py3-none-any.whl 2206 cli/main.py 4738 optix/config.toml` ### CLI 与 logo 目的：确认统一入口、`python -m optix` 及 logo 行为步骤： 1. `uv run msmodeling -h` 2. `uv run python -m optix --help` 3. `uv run pytest tests/regression/cli/test_logo_cli_hooks.py -q` 结果： `5 passed in 0.59s` ### CI 输出目录环境变量目的：确认 CI 可通过环境变量指定 wheel 归档目录步骤： 1. `MSMODELING_WHEEL_OUTPUT_DIR=/tmp/msmodeling-whl-test bash scripts/build.sh` 2. `ls /tmp/msmodeling-whl-test/` 结果： `msmodeling-0.2.0-py3-none-any.whl` See merge request: Ascend/msmodeling!370	10 天前
optix-param-recommend	修复ais_bench 配置 target_filed崩溃问题 Co-authored-by: liu977803265<liushuai165@huawei.com> # message auto-generated for no-merge-commit merge: !452 merge master into master 修复ais_bench 配置 target_filed崩溃问题 Created-by: liu977803265 Commit-by: liu977803265 Merged-by: ascend-robot Description: # PR Template Thanks for your contribution; we appreciate it a lot. The following instructions will make your pull request healthier and help you get feedback more easily. If you do not understand some items, don't worry, just make the pull request and seek help from maintainers. 感谢您的贡献，我们非常重视。以下说明将使您的拉取请求更健康，更易于获得反馈。如果您不理解某些项目，请不要担心，只需提交拉取请求并从维护人员那里寻求帮助即可。 PR Type / PR类型 - [ ] Feature（功能新增） - [ ] Bugfix（Bug 修复） - [ ] Docs（文档更新） - [ ] CI/CD（持续集成/持续部署） - [ ] Refactor（代码重构） - [ ] Perf（性能优化） - [ ] Test-Cases（测试用例更新） - [ ] Other（其他） ## 🔍 Motivation / 变更动机 Please describe the motivation of this PR and the goal you want to achieve through this PR. 请描述您的拉取请求的动机和您希望通过此拉取请求实现的目标。 ------ ## 📝 Modification / 修改内容 Please briefly describe what modification is made in this PR. 请简要描述此拉取请求中进行的修改。 ## 📐 Associated Test Results / 关联测试结果 Please provide the related test results, such as test reports, etc. 请提供相关测试结果，例如测试报告等。修改前 ais_bench 中配置target_filed, 会报如下错误： ![Snipaste_2026-06-26_18-22-54.png](https://raw.gitcode.com/user-images/assets/8428112/05faf384-aec5-4281-98e0-4deef86ec5ea/Snipaste_2026-06-26_18-22-54.png 'Snipaste_2026-06-26_18-22-54.png') 修改后，可以正常运行： ![Snipaste_2026-06-26_18-24-02.png](https://raw.gitcode.com/user-images/assets/8428112/b634f584-51fb-46d1-9ec9-5af000351d95/Snipaste_2026-06-26_18-24-02.png 'Snipaste_2026-06-26_18-24-02.png') ## 🌟 Use cases (Optional) / 使用案例（可选） If this PR introduces a new feature, it is better to list some use cases here and update the documentation. 如果此拉取请求引入了新功能，最好在此处列出一些用例并更新文档。 ------ ## ✅ Checklist / 检查列表 Before PR: - [ ] Bug fixes are fully covered by unit tests, the case that causes the bug should be added in the unit tests. / 修复的 Bug 已完全由单元测试覆盖，导致 Bug 的情况应在单元测试中添加。 - [ ] The modification is covered by complete unit tests. If not, please add more unit tests to ensure the correctness. / 此拉取请求中的修改已完全由单元测试覆盖。如果不是，请添加更多单元测试以确保正确性。 - [ ] All relevant documentation (API docs, docstrings, example tutorials) has been updated to reflect these changes. / 所有相关文档（API 文档、文档字符串、示例教程）已更新以反映这些更改。 - [ ] Please ensure code files contain no Chinese comments. / 请保证代码文件中不含中文注释。 ------ See merge request: Ascend/msmodeling!452	1 天前
text-generate-executor	fix(security): add model source safety checks Co-authored-by: jia_ya_nan<jiayanan3@h-partners.com> # message auto-generated for no-merge-commit merge: !385 merge fix/trust-remote-code-safety into master fix(security): add model source safety checks Created-by: jia_ya_nan Commit-by: jia_ya_nan Merged-by: ascend-robot Description: # PR Template Thanks for your contribution; we appreciate it a lot. The following instructions will make your pull request healthier and help you get feedback more easily. If you do not understand some items, don't worry, just make the pull request and seek help from maintainers. 感谢您的贡献，我们非常重视。以下说明将使您的拉取请求更健康，更易于获得反馈。如果您不理解某些项目，请不要担心，只需提交拉取请求并从维护人员那里寻求帮助即可。 PR Type / PR类型 - [ ] Feature（功能新增） - [x] Bugfix（Bug 修复） - [ ] Docs（文档更新） - [ ] CI/CD（持续集成/持续部署） - [ ] Refactor（代码重构） - [ ] Perf（性能优化） - [ ] Test-Cases（测试用例更新） - [x] Other（其他） ## 🔍 Motivation / 变更动机 Please describe the motivation of this PR and the goal you want to achieve through this PR. 请描述您的拉取请求的动机和您希望通过此拉取请求实现的目标。安全加固 ------ ## 📝 Modification / 修改内容 Please briefly describe what modification is made in this PR. 请简要描述此拉取请求中进行的修改。增加本地路径权限校验；增加日志风险提示去掉不维护的老接口 ------ ## 📐 Associated Test Results / 关联测试结果 Please provide the related test results, such as test reports, etc. 请提供相关测试结果，例如测试报告等。 ![image.png](https://raw.gitcode.com/user-images/assets/8428112/ef4f75a5-1346-4320-8de2-a19703ebedb3/image.png 'image.png') ------ ## 🌟 Use cases (Optional) / 使用案例（可选） If this PR introduces a new feature, it is better to list some use cases here and update the documentation. 如果此拉取请求引入了新功能，最好在此处列出一些用例并更新文档。 ------ ## ✅ Checklist / 检查列表 Before PR: - [x] Bug fixes are fully covered by unit tests, the case that causes the bug should be added in the unit tests. / 修复的 Bug 已完全由单元测试覆盖，导致 Bug 的情况应在单元测试中添加。 - [x] The modification is covered by complete unit tests. If not, please add more unit tests to ensure the correctness. / 此拉取请求中的修改已完全由单元测试覆盖。如果不是，请添加更多单元测试以确保正确性。 - [x] All relevant documentation (API docs, docstrings, example tutorials) has been updated to reflect these changes. / 所有相关文档（API 文档、文档字符串、示例教程）已更新以反映这些更改。 - [x] Please ensure code files contain no Chinese comments. / 请保证代码文件中不含中文注释。 ------ See merge request: Ascend/msmodeling!385	4 天前
throughput-optimizer-executor	fix(security): add model source safety checks Co-authored-by: jia_ya_nan<jiayanan3@h-partners.com> # message auto-generated for no-merge-commit merge: !385 merge fix/trust-remote-code-safety into master fix(security): add model source safety checks Created-by: jia_ya_nan Commit-by: jia_ya_nan Merged-by: ascend-robot Description: # PR Template Thanks for your contribution; we appreciate it a lot. The following instructions will make your pull request healthier and help you get feedback more easily. If you do not understand some items, don't worry, just make the pull request and seek help from maintainers. 感谢您的贡献，我们非常重视。以下说明将使您的拉取请求更健康，更易于获得反馈。如果您不理解某些项目，请不要担心，只需提交拉取请求并从维护人员那里寻求帮助即可。 PR Type / PR类型 - [ ] Feature（功能新增） - [x] Bugfix（Bug 修复） - [ ] Docs（文档更新） - [ ] CI/CD（持续集成/持续部署） - [ ] Refactor（代码重构） - [ ] Perf（性能优化） - [ ] Test-Cases（测试用例更新） - [x] Other（其他） ## 🔍 Motivation / 变更动机 Please describe the motivation of this PR and the goal you want to achieve through this PR. 请描述您的拉取请求的动机和您希望通过此拉取请求实现的目标。安全加固 ------ ## 📝 Modification / 修改内容 Please briefly describe what modification is made in this PR. 请简要描述此拉取请求中进行的修改。增加本地路径权限校验；增加日志风险提示去掉不维护的老接口 ------ ## 📐 Associated Test Results / 关联测试结果 Please provide the related test results, such as test reports, etc. 请提供相关测试结果，例如测试报告等。 ![image.png](https://raw.gitcode.com/user-images/assets/8428112/ef4f75a5-1346-4320-8de2-a19703ebedb3/image.png 'image.png') ------ ## 🌟 Use cases (Optional) / 使用案例（可选） If this PR introduces a new feature, it is better to list some use cases here and update the documentation. 如果此拉取请求引入了新功能，最好在此处列出一些用例并更新文档。 ------ ## ✅ Checklist / 检查列表 Before PR: - [x] Bug fixes are fully covered by unit tests, the case that causes the bug should be added in the unit tests. / 修复的 Bug 已完全由单元测试覆盖，导致 Bug 的情况应在单元测试中添加。 - [x] The modification is covered by complete unit tests. If not, please add more unit tests to ensure the correctness. / 此拉取请求中的修改已完全由单元测试覆盖。如果不是，请添加更多单元测试以确保正确性。 - [x] All relevant documentation (API docs, docstrings, example tutorials) has been updated to reflect these changes. / 所有相关文档（API 文档、文档字符串、示例教程）已更新以反映这些更改。 - [x] Please ensure code files contain no Chinese comments. / 请保证代码文件中不含中文注释。 ------ See merge request: Ascend/msmodeling!385	4 天前
throughput-optimizer-explainer	feat(skills): add throughput-optimizer-explainer Co-authored-by: lutean<lutean1@huawei.com> Co-authored-by: gitcode-bot<noreply@gitcode.com> # message auto-generated for no-merge-commit merge: !413 merge master into master feat(skills): add throughput-optimizer-explainer Created-by: lutean Commit-by: lutean;gitcode-bot Merged-by: ascend-robot Description: # PR Template Thanks for your contribution; we appreciate it a lot. The following instructions will make your pull request healthier and help you get feedback more easily. If you do not understand some items, don't worry, just make the pull request and seek help from maintainers. 感谢您的贡献，我们非常重视。以下说明将使您的拉取请求更健康，更易于获得反馈。如果您不理解某些项目，请不要担心，只需提交拉取请求并从维护人员那里寻求帮助即可。 PR Type / PR类型 - [x] Feature（功能新增） - [ ] Bugfix（Bug 修复） - [ ] Docs（文档更新） - [ ] CI/CD（持续集成/持续部署） - [ ] Refactor（代码重构） - [ ] Perf（性能优化） - [ ] Test-Cases（测试用例更新） - [ ] Other（其他） ## 🔍 Motivation / 变更动机 Please describe the motivation of this PR and the goal you want to achieve through this PR. 请描述您的拉取请求的动机和您希望通过此拉取请求实现的目标。新增throughput-optimizer-explainer skill用于对throughput-optimizer结果的分析解释 ------ ## 📝 Modification / 修改内容 Please briefly describe what modification is made in this PR. 请简要描述此拉取请求中进行的修改。触发方式这个 skill 用于解释 python -m cli.inference.throughput_optimizer 的结果。典型触发包括： ·用户问吞吐、TTFT、TPOT、PD ratio 是否合理。 ·用户要比较不同硬件、并行策略或最优行。 ·用户要分析 Cube/Vec/Comm/Mem 瓶颈。 ·用户提供 --dump-original-results、text_generate、--dump-op-bound-results 或 profiler trace。 ·用户想把 throughput_optimizer 的 best row 映射成 python -m cli.inference.text_generate 验证命令。使用场景核心场景是“解释优化器结果，但不超出证据范围”： ·解释 aggregation / disaggregation / PD ratio 模式下的最优策略。 ·判断结果等级：basically reasonable、partly explainable、suspicious、insufficient evidence。 ·基于 TTFT、TPOT、吞吐、并发、batch、并行配置做宏观判断。 ·基于 phase breakdown 分析 Prefill / Decode 的 Cube、Vec、Comm、Mem 占比。 ·基于 text_generate --dump-op-bound-results 做模拟 operator 级归因。 ·基于真实 profiler 或 chrome trace 做更强的 operator/kernel 级判断。 ·在证据不足时，生成最小必要的验证命令。工作流 1、识别 optimizer 模式：aggregation、disaggregation 或 PD ratio。 2、提取可比较条件：模型、设备、设备数、输入输出长度、SLO、量化、compile、prefix cache、MTP、搜索空间等。 3、提取 best row / top candidates：throughput、TTFT、TPOT、concurrency、batch size、parallel strategy、PD ratio、QPS、breakdown。 4、先判定证据等级：macro_only、optimizer_phase_breakdown、text_generate_phase_breakdown、text_generate_op_bound、profiler_trace。 5、aggregation 模式必须拆成 Prefill forward + Decode forward + scheduling 公式，不能当成单次 forward。 6、disaggregation 模式直接映射到 Prefill 或 Decode 阶段。 7、如果缺少 breakdown 且需要瓶颈分析，生成 text_generate 验证命令；需要 operator 级归因时加 --dump-op-bound-results。 8、如果有 op-bound 输出，先看 top total-time operators、dominant bound、memory/comm/mma/gp 百分比。 9、比较硬件或策略时，优先级是 phase breakdown、op-bound、macro metrics，硬件规格比例只作辅助。 10、给出合理性等级和主要判断。 11、结束时给出最小验证动作。关键证据规则不能在只有宏观输出时断言具体 operator 或 Cube/Vec/Comm/Mem 瓶颈。text_generate --dump-op-bound-results 只能算 TensorCast 模拟 operator 归因，不是真实 profiler/kernel 证据。真实 runtime 结论必须有 profiler 或实际测量支撑。用到的脚本功能 parse_optimizer_output.py ·输入 raw optimizer 输出、dump 表、text_generate 输出或 op-bound 输出。 ·输出结构化 JSON。 ·可解析 mode、Best Throughput、TTFT、TPOT、PD Ratio、Prefill/Decode QPS。 ·可提取 pretty tables、percentage_breakdowns dump rows、Stats breakdowns、op-bound operator 表。 build_text_generate_commands.py ·从 normalized best row JSON 生成 text_generate 验证命令。 ·支持 --mode aggregation 和 --mode disaggregation。 ·aggregation 会生成 Prefill 和 Decode 两条命令，并计算 effective_input_length、prefill_batch_size、partial Prefill wave。 ·disaggregation 要求指定 phase=prefill\|decode，生成对应单阶段命令。 ·--include-op-bound 会追加 --dump-op-bound-results。 compare_phase_breakdowns.py ·比较两个 JSON 中的 Cube/Vec/Comm/Mem breakdown。 ·输出左右值、差值 delta_right_minus_left 和比例 ratio_right_over_left。 ·加 --op-bound 时比较两个 op-bound 表：bound 分布、top operators 差异、total time 和 memory/comm/mma/gp 百分比变化。 ------ ## 📐 Associated Test Results / 关联测试结果 Please provide the related test results, such as test reports, etc. 请提供相关测试结果，例如测试报告等。 ------ ## 🌟 Use cases (Optional) / 使用案例（可选） If this PR introduces a new feature, it is better to list some use cases here and update the documentation. 如果此拉取请求引入了新功能，最好在此处列出一些用例并更新文档。 ------ ## ✅ Checklist / 检查列表 Before PR: - [ ] Bug fixes are fully covered by unit tests, the case that causes the bug should be added in the unit tests. / 修复的 Bug 已完全由单元测试覆盖，导致 Bug 的情况应在单元测试中添加。 - [ ] The modification is covered by complete unit tests. If not, please add more unit tests to ensure the correctness. / 此拉取请求中的修改已完全由单元测试覆盖。如果不是，请添加更多单元测试以确保正确性。 - [ ] All relevant documentation (API docs, docstrings, example tutorials) has been updated to reflect these changes. / 所有相关文档（API 文档、文档字符串、示例教程）已更新以反映这些更改。 - [ ] Please ensure code files contain no Chinese comments. / 请保证代码文件中不含中文注释。 ------ See merge request: Ascend/msmodeling!413	3 天前
README.md	feat(skills): add throughput-optimizer-explainer Co-authored-by: lutean<lutean1@huawei.com> Co-authored-by: gitcode-bot<noreply@gitcode.com> # message auto-generated for no-merge-commit merge: !413 merge master into master feat(skills): add throughput-optimizer-explainer Created-by: lutean Commit-by: lutean;gitcode-bot Merged-by: ascend-robot Description: # PR Template Thanks for your contribution; we appreciate it a lot. The following instructions will make your pull request healthier and help you get feedback more easily. If you do not understand some items, don't worry, just make the pull request and seek help from maintainers. 感谢您的贡献，我们非常重视。以下说明将使您的拉取请求更健康，更易于获得反馈。如果您不理解某些项目，请不要担心，只需提交拉取请求并从维护人员那里寻求帮助即可。 PR Type / PR类型 - [x] Feature（功能新增） - [ ] Bugfix（Bug 修复） - [ ] Docs（文档更新） - [ ] CI/CD（持续集成/持续部署） - [ ] Refactor（代码重构） - [ ] Perf（性能优化） - [ ] Test-Cases（测试用例更新） - [ ] Other（其他） ## 🔍 Motivation / 变更动机 Please describe the motivation of this PR and the goal you want to achieve through this PR. 请描述您的拉取请求的动机和您希望通过此拉取请求实现的目标。新增throughput-optimizer-explainer skill用于对throughput-optimizer结果的分析解释 ------ ## 📝 Modification / 修改内容 Please briefly describe what modification is made in this PR. 请简要描述此拉取请求中进行的修改。触发方式这个 skill 用于解释 python -m cli.inference.throughput_optimizer 的结果。典型触发包括： ·用户问吞吐、TTFT、TPOT、PD ratio 是否合理。 ·用户要比较不同硬件、并行策略或最优行。 ·用户要分析 Cube/Vec/Comm/Mem 瓶颈。 ·用户提供 --dump-original-results、text_generate、--dump-op-bound-results 或 profiler trace。 ·用户想把 throughput_optimizer 的 best row 映射成 python -m cli.inference.text_generate 验证命令。使用场景核心场景是“解释优化器结果，但不超出证据范围”： ·解释 aggregation / disaggregation / PD ratio 模式下的最优策略。 ·判断结果等级：basically reasonable、partly explainable、suspicious、insufficient evidence。 ·基于 TTFT、TPOT、吞吐、并发、batch、并行配置做宏观判断。 ·基于 phase breakdown 分析 Prefill / Decode 的 Cube、Vec、Comm、Mem 占比。 ·基于 text_generate --dump-op-bound-results 做模拟 operator 级归因。 ·基于真实 profiler 或 chrome trace 做更强的 operator/kernel 级判断。 ·在证据不足时，生成最小必要的验证命令。工作流 1、识别 optimizer 模式：aggregation、disaggregation 或 PD ratio。 2、提取可比较条件：模型、设备、设备数、输入输出长度、SLO、量化、compile、prefix cache、MTP、搜索空间等。 3、提取 best row / top candidates：throughput、TTFT、TPOT、concurrency、batch size、parallel strategy、PD ratio、QPS、breakdown。 4、先判定证据等级：macro_only、optimizer_phase_breakdown、text_generate_phase_breakdown、text_generate_op_bound、profiler_trace。 5、aggregation 模式必须拆成 Prefill forward + Decode forward + scheduling 公式，不能当成单次 forward。 6、disaggregation 模式直接映射到 Prefill 或 Decode 阶段。 7、如果缺少 breakdown 且需要瓶颈分析，生成 text_generate 验证命令；需要 operator 级归因时加 --dump-op-bound-results。 8、如果有 op-bound 输出，先看 top total-time operators、dominant bound、memory/comm/mma/gp 百分比。 9、比较硬件或策略时，优先级是 phase breakdown、op-bound、macro metrics，硬件规格比例只作辅助。 10、给出合理性等级和主要判断。 11、结束时给出最小验证动作。关键证据规则不能在只有宏观输出时断言具体 operator 或 Cube/Vec/Comm/Mem 瓶颈。text_generate --dump-op-bound-results 只能算 TensorCast 模拟 operator 归因，不是真实 profiler/kernel 证据。真实 runtime 结论必须有 profiler 或实际测量支撑。用到的脚本功能 parse_optimizer_output.py ·输入 raw optimizer 输出、dump 表、text_generate 输出或 op-bound 输出。 ·输出结构化 JSON。 ·可解析 mode、Best Throughput、TTFT、TPOT、PD Ratio、Prefill/Decode QPS。 ·可提取 pretty tables、percentage_breakdowns dump rows、Stats breakdowns、op-bound operator 表。 build_text_generate_commands.py ·从 normalized best row JSON 生成 text_generate 验证命令。 ·支持 --mode aggregation 和 --mode disaggregation。 ·aggregation 会生成 Prefill 和 Decode 两条命令，并计算 effective_input_length、prefill_batch_size、partial Prefill wave。 ·disaggregation 要求指定 phase=prefill\|decode，生成对应单阶段命令。 ·--include-op-bound 会追加 --dump-op-bound-results。 compare_phase_breakdowns.py ·比较两个 JSON 中的 Cube/Vec/Comm/Mem breakdown。 ·输出左右值、差值 delta_right_minus_left 和比例 ratio_right_over_left。 ·加 --op-bound 时比较两个 op-bound 表：bound 分布、top operators 差异、total time 和 memory/comm/mma/gp 百分比变化。 ------ ## 📐 Associated Test Results / 关联测试结果 Please provide the related test results, such as test reports, etc. 请提供相关测试结果，例如测试报告等。 ------ ## 🌟 Use cases (Optional) / 使用案例（可选） If this PR introduces a new feature, it is better to list some use cases here and update the documentation. 如果此拉取请求引入了新功能，最好在此处列出一些用例并更新文档。 ------ ## ✅ Checklist / 检查列表 Before PR: - [ ] Bug fixes are fully covered by unit tests, the case that causes the bug should be added in the unit tests. / 修复的 Bug 已完全由单元测试覆盖，导致 Bug 的情况应在单元测试中添加。 - [ ] The modification is covered by complete unit tests. If not, please add more unit tests to ensure the correctness. / 此拉取请求中的修改已完全由单元测试覆盖。如果不是，请添加更多单元测试以确保正确性。 - [ ] All relevant documentation (API docs, docstrings, example tutorials) has been updated to reflect these changes. / 所有相关文档（API 文档、文档字符串、示例教程）已更新以反映这些更改。 - [ ] Please ensure code files contain no Chinese comments. / 请保证代码文件中不含中文注释。 ------ See merge request: Ascend/msmodeling!413	3 天前

msmodeling skills

本目录存放 msmodeling 项目专用的 Claude Code skills，用于把常见性能建模、设备建模和 profiling 辅助任务沉淀为可复用的执行流程。

使用提示：如需在 Claude Code 中启用这些 skills，请将本目录 .agents/skills 完整复制到 .claude/skills。

AI agents 必读：请先阅读项目根目录的 AGENTS.md，了解项目规范和 Skill 体系。

msmodeling skills
msmodeling-env-installer
model-adaptation
device_config
op_mapping
microbench
text-generate-executor
throughput-optimizer-executor
throughput-optimizer-explainer

msmodeling-env-installer

msmodeling 环境安装器——将“安装 msmodeling 环境依赖”“创建 myenv”“安装当前仓库 requirements.txt”“配置 PYTHONPATH / HF_ENDPOINT”等明确指向 msmodeling 的请求转换为可执行、可验证、可回溯的环境安装流程。用户只说“安装环境”或“安装依赖”时，需要先确认是否安装 msmodeling 当前仓库的环境依赖。

What it does

引导 AI agent 按 RFC 中定义的流程完成开发环境初始化：

仓库根目录校验：确认当前目录包含 README.md 和 requirements.txt。
Python 与 uv 检查：要求 Python 3.10+，缺少 uv 时按镜像安装并解析真实可执行路径。
安装路径选择：默认用 uv 新建 myenv；已有环境 fallback 前检查 torch_npu、torch-npu 和 cudatoolkit。
依赖安装与验证：安装 requirements.txt 后执行 uv pip check --python <venv-python> 或 python -m pip check。
环境变量配置：按需设置当前会话 PYTHONPATH 和 HF_ENDPOINT=https://hf-mirror.com。

File layout

File	Purpose
`msmodeling-env-installer/SKILL.md`	Skill 定义、触发场景、安装流程和安全规则
`msmodeling-env-installer/scripts/install-current-project-deps.ps1`	Windows PowerShell 自动化安装脚本
`msmodeling-env-installer/scripts/install-current-project-deps.sh`	Linux/macOS/WSL/Git Bash 自动化安装脚本

Quick start

在对话中直接提出明确需求，例如“请帮我安装 msmodeling 环境依赖”“按 README 配置 msmodeling 环境”。如果只说“安装环境”，agent 需要先确认是否安装 msmodeling 当前仓库的环境依赖。

Windows PowerShell 可以从仓库根目录直接运行：

.\.agents\skills\msmodeling-env-installer\scripts\install-current-project-deps.ps1

Linux/macOS/WSL/Git Bash 可以从仓库根目录直接运行：

bash ./.agents/skills/msmodeling-env-installer/scripts/install-current-project-deps.sh

Key constraints

不修改 requirements.txt、README 或项目源码。
不默认覆盖已有 myenv，也不默认持久化系统级环境变量。
网络安装需要用户确认和工具权限授权。
scripts/install-current-project-deps.ps1 当前仅适用于 Windows PowerShell；Linux/macOS 使用 README 通用命令。

model-adaptation

TensorCast 新模型接入流程 skill——从仿真命令和 MindStudio Insight raw profiling 出发，引导 agent 运行 model_adapter doctor、审阅 ModelProfile、处理 patch/bug AI task、导出 evidence.yaml 并运行 verify。

What it does

将新模型接入拆成确定性工具和人工 checkpoint：

收集两个必需输入：仿真命令和匹配的 raw profiling。
运行 doctor，审阅 candidate profile、evidence draft、human questions 和 ai tasks。
对需要人工确认的字段生成精确问题，并把确认结果写入 hints.yaml 或 evidence.yaml。
对 patch/bug 场景使用 ai_tasks[].prompt_text 驱动用户或用户的 AI 助手生成代码，并要求人工 review。
使用 export-evidence 导出 evidence.yaml，再运行 verify。

File layout

File	Purpose
`model-adaptation/SKILL.md`	新模型接入的核心工作流、人工 checkpoint 和验证要求

Quick start

当用户说“接入新模型”“生成 ModelProfile”“根据 doctor report 继续适配”“处理 patch AI task”“从 doctor report 导出 evidence”时使用该 skill。

Key constraints

不凭模型名猜 profile 字段。
doctor 不生成模型专属 patch 代码，只生成 AI task 和 prompt。
evidence.yaml 从 doctor_after_profile.json.evidence_draft 导出后再人工审阅。
不提交 raw profiling、本地 walkthrough、私人路径或临时材料。

device_config

设备画像自然语言导入器——通过渐进式对话引导用户将自然语言硬件描述转换为 TensorCast DeviceProfile。

What it does

引导 AI agent 通过渐进式对话流程：

渐进收集信息：首轮只问硬件名称、资料来源和粒度偏好，每轮最多 2-3 个问题。
维护内部事实表：confirmed / ambiguous / missing / needs calibration。
生成可运行 profile：将用户确认的值、临时估值和兜底默认值全部写入 tensor_cast/device.py。
验证 + 输出 CLI 命令：运行导入检查，输出 --device <PROFILE_NAME> 可执行命令。

File layout

File	Purpose
`device_config/SKILL.md`	Skill 定义、约束条件和执行流程

Quick start

在 Claude Code 对话中直接提出需求，例如"我要导入新的设备拓扑"，遵循 agent 的渐进式提问，逐步提供硬件规格。

Key constraints

DeviceProfile.__post_init__ 会自动注册 profile，name 必须唯一。
默认写入 tensor_cast/device.py，只有用户明确要求时才写入 tensor_cast/device_profiles/。
所有默认值、估值和假设必须对用户可见，列入 needs calibration。

op_mapping

op_mapping.yaml 生成器——将 TensorCast 仿真算子映射到 NPU profiling 内核类型。

What it does

通过并行子 Agent 团队（每个算子一个 Agent）追踪完整的 vLLM→CANN 调用链，生成 op_mapping.yaml。

File layout

File	Purpose
`op-mapping/SKILL.md`	核心执行流程、六阶段工作流
`op-mapping/op-mapping-template.yaml`	YAML 模板片段
`op-mapping/single-op-worker-prompt.md`	单算子 Worker Agent 指令
`op-mapping/verifier-prompt.md`	验证阶段指令
`op-mapping/ref/shape_matching_catalog.md`	TC tensor 与 NPU profiling shape 的 10 种差异
`op-mapping/ref/tc_input_count_rules.md`	`tc_input_count` 安全使用规则
`op-mapping/ref/zero_cost_classification.md`	零开销算子分类规则

Quick start

收集完所有输入（model、device、profiling CSV、repo 版本）后，agent 自动执行六阶段流程：GATHER → FORWARD MAPPING → REVERSE MAPPING → VERIFY → WRITE → COMMIT。

Key constraints

kernel_type 必须与 CSV 文件名完全一致（无 .csv 后缀）。
三个映射路径：aten→op-plugin→aclnn、torch_npu.npu_*→op-plugin→aclnn、vllm-ascend 自定义/Triton。
alternate_kernel_types 必须在同一抽象层级，禁止用融合大 op 作为子 op 的备选。

microbench

Microbench Run Script 生成器——从 profiling CSV 生成可在 NPU 上重放的 <KernelType>_run.py。

What it does

为 profiling 内核 CSV 生成可运行的 tools/perf_data_collection/op_replay/<KernelType>_run.py，用于 NPU 实测重放。

File layout

File	Purpose
`microbench/SKILL.md`	Skill 定义和 repo 搜索顺序

Quick start

用户提供 kernel_type、设备 profile、vllm_ascend 版本和 CSV 路径后，agent 生成可重放的 run script。

Key constraints

优先使用本地已克隆的 repos，按指定路径搜索。
repo 缺失时按 SKILL.md 中提供的 clone 命令获取。
生成的 run script 由 run_all_op.py / profile_and_update_db.py 调用。

text-generate-executor

text_generate 单点验证执行器。用于把用户关于 python -m cli.inference.text_generate 的验证诉求转换为可确认、可执行的 CLI 命令，并在确认后运行和总结结果。

What it does

面向已有模型、硬件、batch/query length、prefill 或 decode 模式、固定 TP/DP/EP/MOE 策略、profiling database、trace/debug 或 throughput optimizer 最优行复验的场景，生成单点仿真命令。

File layout

File	Purpose
`text-generate-executor/SKILL.md`	Skill 主说明、默认策略、校验规则和 handoff 规则
`text-generate-executor/references/dialog-flow.md`	渐进式问参流程
`text-generate-executor/references/text-generate-params.md`	`text_generate` 参数速查

Quick start

提出“帮我跑 text_generate 验证”“把 throughput_optimizer 最优行转 text_generate 跑一下”“导出 chrome trace”等请求时，agent 会补齐缺失参数，展示命令和假设，并在用户确认后执行。

Key constraints

执行前必须展示完整命令和关键假设，并要求显式确认。
Decode 模式必须确认 --context-length；profiling 模式必须提供 --profiling-database。
text_generate 只验证固定候选，不执行 TP/EP/MOE-DP 搜索。

throughput-optimizer-executor

throughput_optimizer 部署规划执行器。用于把吞吐规划、硬件对比、并行搜索、PD 聚合/分离/配比优化等自然语言诉求转换为 python -m cli.inference.throughput_optimizer 命令。

What it does

面向搜索和规划场景，收集模型、硬件、设备数、输入/输出长度、SLO、部署模式和搜索空间，生成 optimizer 命令，在确认后运行并总结最佳并行策略、batch、concurrency、throughput、TTFT、TPOT 和 PD ratio 信息。

File layout

File	Purpose
`throughput-optimizer-executor/SKILL.md`	Skill 主说明、默认策略、校验规则和 handoff 规则
`throughput-optimizer-executor/references/dialog-flow.md`	部署模式识别和渐进式问参流程
`throughput-optimizer-executor/references/throughput-optimizer-params.md`	`throughput_optimizer` 参数速查
`throughput-optimizer-executor/scripts/extract_throughput_optimizer_result.py`	optimizer stdout 结构化摘要脚本

Quick start

提出“比较两种硬件”“搜索 Qwen 32B 最佳 TP”“做 PD 分离能力评估”“算 P/D 实例配比”等请求时，agent 会识别 aggregation、disagg 或 PD ratio 模式，补齐 SLO 和搜索空间，并在确认后执行。

Key constraints

--enable-optimize-prefill-decode-ratio 不能与 --disagg 同时使用。
多硬件对比共用同一个 --num-devices，需要在执行前说明。
执行前需要明确确认是否开启 prefix cache 和 MTP；开启后分别补齐 hit rate、MTP token 数和接受率假设。
该 skill 做候选搜索和规划；单点复验应 handoff 到 text-generate-executor。
结果合理性、硬件差异、Cube/Vec/Comm/Mem 和 best row 映射解释应 handoff 到 throughput-optimizer-explainer。

throughput-optimizer-explainer

throughput_optimizer 结果解释器。用于分析 optimizer 输出是否合理、比较硬件或并行策略差异、解释 Prefill/Decode 阶段的 Cube/Vec/Comm/Mem 瓶颈，并把最优行映射为 text_generate 验证命令。

What it does

围绕 optimizer 结果建立证据分级和解释边界：

识别 aggregation、disaggregation 或 PD ratio 模式。
提取模型、硬件、输入/输出长度、SLO、量化、compile、prefix cache、MTP 和搜索空间等可比条件。
提取 best row、top candidates、throughput、TTFT、TPOT、concurrency、batch、并行策略、PD ratio、QPS 和 breakdown。
按 macro_only、optimizer_phase_breakdown、text_generate_phase_breakdown、text_generate_op_bound、profiler_trace 判断证据等级。
aggregation 结果必须拆成 Prefill forward、Decode forward 和调度公式，不能当成单次 forward。
需要 operator 级归因时，使用 text_generate --dump-op-bound-results，并明确它是 TensorCast 模拟归因而不是真实 profiler 证据。

File layout

File	Purpose
`throughput-optimizer-explainer/SKILL.md`	Skill 主说明、证据规则、工作流、映射规则和输出要求
`throughput-optimizer-explainer/references/aggregation-mapping.md`	aggregation best row 到 Prefill/Decode 验证命令的映射
`throughput-optimizer-explainer/references/disaggregation-mapping.md`	disaggregation 和 PD ratio 结果到 `text_generate` 的映射
`throughput-optimizer-explainer/references/evidence-levels.md`	证据等级、可支持结论和禁止过度推断的规则
`throughput-optimizer-explainer/references/bottleneck-rules.md`	Cube/Vec/Comm/Mem 与并行策略解释规则
`throughput-optimizer-explainer/references/output-template.md`	简洁输出模板
`throughput-optimizer-explainer/scripts/parse_optimizer_output.py`	解析 optimizer、dump 表、`text_generate` breakdown 和 op-bound 表为 JSON
`throughput-optimizer-explainer/scripts/build_text_generate_commands.py`	从 normalized best row JSON 生成 Prefill/Decode 验证命令
`throughput-optimizer-explainer/scripts/compare_phase_breakdowns.py`	对比 Cube/Vec/Comm/Mem 或 op-bound 表差异

Quick start

当用户问“这个 throughput_optimizer 结果是否合理”“为什么 A3 比 A2 快/慢”“Cube/Vec/Comm/Mem 谁是瓶颈”“把 best row 转成 text_generate 验证命令”时使用该 skill。

Key constraints

只有宏观输出时，只能做部署、阶段和策略层面的推断，不能断言具体 operator 或真实 kernel 瓶颈。
text_generate --dump-op-bound-results 是 TensorCast 模拟 operator 归因，必须与真实 profiler/kernel 证据区分。
aggregation throughput 不是单次 forward TPS；解释和复验时必须拆成 Prefill 与 Decode 两条验证命令。

msmodeling skills

Table of Contents

msmodeling-env-installer

What it does

File layout

Quick start

Key constraints

model-adaptation

What it does

File layout

Quick start

Key constraints

device_config

What it does

File layout

Quick start

Key constraints

op_mapping

What it does

File layout

Quick start

Key constraints

microbench

What it does

File layout

Quick start

Key constraints

text-generate-executor

What it does

File layout

Quick start

Key constraints

throughput-optimizer-executor

What it does

File layout

Quick start

Key constraints

throughput-optimizer-explainer

What it does

File layout

Quick start

Key constraints