文件最后提交记录最后更新时间
feat:aicpu算子信息库从json合并模式整改到opdef的形式 Co-authored-by: zhaowenrui666<zhaowenrui7@huawei.com> # message auto-generated for no-merge-commit merge: !4864 merge opdef into master feat:aicpu算子信息库从json合并模式整改到opdef的形式 Created-by: zhaowenrui666 Commit-by: zhaowenrui666 Merged-by: cann-robot Description: ## 描述 当前 AttentionWorkerSchedulerFfnWorkerScheduler 的 AICPU 算子信息仍通过手写 JSON 维护,未接入 OpDef 驱动生成链路,存在 JSON 与算子定义分离、真值源不统一、构建链路维护成本高的问题。 本次 PR 将这两个算子的 AICPU 算子信息库从手写 JSON 模式整改为 OpDef 生成链路,主要变更如下: 1. 为 AttentionWorkerSchedulerFfnWorkerScheduler 新增对应的 *_aicpu_def.cpp,使用真实 OpDef 描述输入输出、属性和 AICPU 配置。 2. 删除原有的手写 JSON 真值文件,不再直接维护这两个算子的 AICPU JSON。 3. 调整 CMake 与 AICPU 生成链路,使 cust_aicpu_kernel.json*_aicpu_def.cpp -> libaicpu_ops.so -> op_build --aicpu -> aicpu_kernel.ini -> aicpu_parser_ini_to_json.py 生成。 4. 为仓内新增 AICPU 专用的 aicpu_parser_ini_to_json.py,独立承担 AICPU ini 转 json 的校验与生成。 5. 统一 AttentionWorkerSchedulerFfnWorkerScheduler 的 AICPU 默认配置表达,并补齐 FfnWorkerScheduler 的属性定义。 ## 关联的Issue - 关联 Issue #2196 - https://gitcode.com/cann/ops-transformer/issues/2196 ## 测试 已基于重新编译产物完成验证,结果如下: 1. opInfo vs 原始 JSON:通过 2. I/O normalized vs canndev:通过 3. JSON vs INI 完整对比:通过 4. aicpu_parser_ini_to_json.py 校验:通过 5. change 前后 JSON 对比:通过 关键结果: - 重新生成目标输出 Compile op info cfg successfully. - before 字段路径在 after 中全部存在 - 新增字段仅为 opInfo.formatAgnosticopInfo.opsFlagopInfo.subTypeOfInferShape - AttentionWorkerSchedulerFfnWorkerSchedulerinput0/output0 与 canndev 基线一致 ## 文档更新 无。 ## 类型标签 - [x] 🐛 Bug 修复 - [ ] ✨ 新特性 - [ ] ⚡ 性能优化 - [ ] ♻️ 重构 - [ ] 🧪 测试 - [ ] 📦 构建/CI - [ ] 🔧 配置变更 - [ ] 📝 文档更新 - [ ] ⬆️ 依赖升级 - [ ] 🔒 安全修复 - [ ] 🧹 代码清理 - [ ] ❓ 其他,请描述: See merge request: cann/ops-transformer!486415 天前
fix: aclrtMemcpy host to host for schedule_context, 修正示例代码内存拷贝类型错误 Co-authored-by: Developer user<jialimin1@huawei.com> # message auto-generated for no-merge-commit merge: !5918 merge master into master fix: aclrtMemcpy host to host for schedule_context, 修正示例代码内存拷贝类型错误 Created-by: jialimin1 Commit-by: Developer user Merged-by: cann-robot Description: ## 描述 1、修改aclrtMemcpy host to host for schedule_context, 修正示例代码内存拷贝类型错误 2、修改日志打印内容 ## 关联的Issue Issue #2640 Issue #2641 <!-- 如果这个PR是为了解决特定的Issue,请在这里提供Issue链接。例如:关联Issue #000--> <!-- 如果这个PR是为了解决特定的问题单,请在这里描述问题单单号。--> ## 测试 ``` 编译包验证: [2026-05-26 03:18:07] ./uninstall.sh [2026-05-26 03:18:07] ./upgrade.sh [2026-05-26 03:18:07] CRC: 1888598674 [2026-05-26 03:18:07] SHA256: e5e52d301ebc0a74b5f34866bf194936a5fc441b6658e781ba1bc761fd8fbde1 [2026-05-26 03:18:07] Skipping md5sum at user request [2026-05-26 03:18:07] [2026-05-26 03:18:07] Self-extractable archive "cann-ops-transformer-custom_linux-x86_64.run" successfully created. 样例验证: [2026-05-26 03:21:42] Start compile and run example file: ../ffn/ffn_worker_scheduler/examples/test_aclnn_inplace_ffn_worker_scheduler.cpp [2026-05-26 03:21:49] Init ffn success, token_info_buf_size=192,token_data_buf_size= 1024. [2026-05-26 03:21:49] layer_ids_buf_size = 8. [2026-05-26 03:21:49] session_ids_buf_size = 8. [2026-05-26 03:21:49] micro_batch_ids_buf_size = 8. [2026-05-26 03:21:49] expert_ids_buf_size = 80. [2026-05-26 03:21:49] layer_ids[0] is: 55 [2026-05-26 03:21:49] layer_ids[1] is: 55 [2026-05-26 03:21:49] session_ids[0] is: 0 [2026-05-26 03:21:49] session_ids[1] is: 1 [2026-05-26 03:21:49] micro_batch_ids[0] is: 0 [2026-05-26 03:21:49] micro_batch_ids[1] is: 0 [2026-05-26 03:21:49] expert_ids[0] is: 0 [2026-05-26 03:21:49] expert_ids[1] is: 1 [2026-05-26 03:21:49] expert_ids[2] is: 2 [2026-05-26 03:21:49] expert_ids[3] is: 3 [2026-05-26 03:21:49] expert_ids[4] is: 4 [2026-05-26 03:21:49] expert_ids[5] is: 5 [2026-05-26 03:21:49] expert_ids[6] is: 6 [2026-05-26 03:21:49] expert_ids[7] is: 7 [2026-05-26 03:21:49] expert_ids[8] is: 8 [2026-05-26 03:21:49] expert_ids[9] is: 9 [2026-05-26 03:21:49] expert_ids[10] is: 0 [2026-05-26 03:21:49] expert_ids[11] is: 1 [2026-05-26 03:21:49] expert_ids[12] is: 2 [2026-05-26 03:21:49] expert_ids[13] is: 3 [2026-05-26 03:21:49] expert_ids[14] is: 4 [2026-05-26 03:21:49] expert_ids[15] is: 5 [2026-05-26 03:21:49] expert_ids[16] is: 6 [2026-05-26 03:21:49] expert_ids[17] is: 7 [2026-05-26 03:21:49] expert_ids[18] is: 8 [2026-05-26 03:21:49] expert_ids[19] is: 9 [2026-05-26 03:21:50] run test_aclnn_ffn_worker_scheduler, execute samples success [2026-05-26 03:21:50] Example completed successfully ``` <!--描述进行了哪些测试来验证你的改动。包括但不限于二级冒烟、算子泛化等。--> ## 文档更新 不涉及 <!--如果这个PR包含文档的更新,请在这里指出。例如:更新了README.md文件。--> ## 类型标签 <!-- [x] 表示选中 --> - [x]? Bug 修复 - [ ] ✨ 新特性 - [ ] ⚡ 性能优化 - [ ] ♻️ 重构 - [ ] 🧪 测试 - [ ] 📦 构建/CI - [ ] 🔧 配置变更 - [ ] 📝 文档更新 - [ ] ⬆️ 依赖升级 - [ ] 🔒 安全修复 - [ ] 🧹 代码清理 - [ ] ❓ 其他,请描述: See merge request: cann/ops-transformer!59187 天前
Add the AI CPU implementation of FfnWorkerScheduler and AttentionWorkerScheduler Co-authored-by: @ding-jing12<dingjing19@huawei.com> # message auto-generated for no-merge-commit merge: !1076 merge atten_ffn_op into master Add the AI CPU implementation of FfnWorkerScheduler and AttentionWorkerScheduler Created-by: Ding_Jing Commit-by: @ding-jing12 Merged-by: cann-robot Description: ## 描述 <!--在这里详细描述你的改动,包括改动的原因和所采取的方法。--> Add the AI CPU implementation of FfnWorkerScheduler and AttentionWorkerScheduler ## 关联的Issue <!-- 如果这个PR是为了解决特定的Issue,请在这里提供Issue链接。例如:关联Issue #000--> <!-- 如果这个PR是为了解决特定的问题单,请在这里描述问题单单号。--> ## 测试 <!--描述进行了哪些测试来验证你的改动。包括但不限于二级冒烟、算子泛化等。--> ``` 功能验证:AttentionWorkerScheduler [2026-01-28 11:32:27] [2026-01-28 11:32:27] Start to run example,name:attention_worker_scheduler mode:eager [2026-01-28 11:32:27] Start compile and run example file: ../attention/attention_worker_scheduler/examples/test_aclnn_inplace_attention_worker_scheduler.cpp [2026-01-28 11:32:27] pkg_mode:cust vendor_name:custom [2026-01-28 11:32:33] micro_batch_id = 0. [2026-01-28 11:32:33] run test_aclnn_attention_worker_scheduler, execute samples success [2026-01-28 11:32:33] Example completed successfully 功能验证:FfnWorkerScheduler [2026-01-28 11:40:44] [2026-01-28 11:40:44] Start to run example,name:ffn_worker_scheduler mode:eager [2026-01-28 11:40:44] Start compile and run example file: ../ffn/ffn_worker_scheduler/examples/test_aclnn_inplace_ffn_worker_scheduler.cpp [2026-01-28 11:40:44] pkg_mode:cust vendor_name:custom [2026-01-28 11:40:50] Init ffn success, token_info_buf_size=192,token_data_buf_size= 1024. [2026-01-28 11:40:50] layer_ids_buf_size = 8. [2026-01-28 11:40:50] session_ids_buf_size = 8. [2026-01-28 11:40:50] micro_batch_ids_buf_size = 8. [2026-01-28 11:40:50] expert_ids_buf_size = 80. [2026-01-28 11:40:50] layer_ids[0] is: 55 [2026-01-28 11:40:50] layer_ids[1] is: 55 [2026-01-28 11:40:50] session_ids[0] is: 0 [2026-01-28 11:40:50] session_ids[1] is: 1 [2026-01-28 11:40:50] micro_batch_ids[0] is: 0 [2026-01-28 11:40:50] micro_batch_ids[1] is: 0 [2026-01-28 11:40:50] expert_ids[0] is: 0 [2026-01-28 11:40:50] expert_ids[1] is: 1 [2026-01-28 11:40:50] expert_ids[2] is: 2 [2026-01-28 11:40:50] expert_ids[3] is: 3 [2026-01-28 11:40:50] expert_ids[4] is: 4 [2026-01-28 11:40:50] expert_ids[5] is: 5 [2026-01-28 11:40:50] expert_ids[6] is: 6 [2026-01-28 11:40:50] expert_ids[7] is: 7 [2026-01-28 11:40:50] expert_ids[8] is: 8 [2026-01-28 11:40:50] expert_ids[9] is: 9 [2026-01-28 11:40:50] expert_ids[10] is: 0 [2026-01-28 11:40:50] expert_ids[11] is: 1 [2026-01-28 11:40:50] expert_ids[12] is: 2 [2026-01-28 11:40:50] expert_ids[13] is: 3 [2026-01-28 11:40:50] expert_ids[14] is: 4 [2026-01-28 11:40:50] expert_ids[15] is: 5 [2026-01-28 11:40:50] expert_ids[16] is: 6 [2026-01-28 11:40:50] expert_ids[17] is: 7 [2026-01-28 11:40:50] expert_ids[18] is: 8 [2026-01-28 11:40:50] expert_ids[19] is: 9 [2026-01-28 11:40:50] run test_aclnn_ffn_worker_scheduler, execute samples success [2026-01-28 11:40:50] Example completed successfully ``` ## 文档更新 <!--如果这个PR包含文档的更新,请在这里指出。例如:更新了README.md文件。--> Update the README.md files in attention/attention_worker_scheduler/ and ffn/ffn_worker_scheduler/, adding usage instructions ## 类型标签 <!-- [x] 表示选中 --> - [ ] Bug修复 - [x] 新特性 - [ ] 性能优化 - [ ] 文档更新 - [ ] 其他,请描述: See merge request: cann/ops-transformer!10764 个月前
feat:aicpu算子信息库从json合并模式整改到opdef的形式 Co-authored-by: zhaowenrui666<zhaowenrui7@huawei.com> # message auto-generated for no-merge-commit merge: !4864 merge opdef into master feat:aicpu算子信息库从json合并模式整改到opdef的形式 Created-by: zhaowenrui666 Commit-by: zhaowenrui666 Merged-by: cann-robot Description: ## 描述 当前 AttentionWorkerSchedulerFfnWorkerScheduler 的 AICPU 算子信息仍通过手写 JSON 维护,未接入 OpDef 驱动生成链路,存在 JSON 与算子定义分离、真值源不统一、构建链路维护成本高的问题。 本次 PR 将这两个算子的 AICPU 算子信息库从手写 JSON 模式整改为 OpDef 生成链路,主要变更如下: 1. 为 AttentionWorkerSchedulerFfnWorkerScheduler 新增对应的 *_aicpu_def.cpp,使用真实 OpDef 描述输入输出、属性和 AICPU 配置。 2. 删除原有的手写 JSON 真值文件,不再直接维护这两个算子的 AICPU JSON。 3. 调整 CMake 与 AICPU 生成链路,使 cust_aicpu_kernel.json*_aicpu_def.cpp -> libaicpu_ops.so -> op_build --aicpu -> aicpu_kernel.ini -> aicpu_parser_ini_to_json.py 生成。 4. 为仓内新增 AICPU 专用的 aicpu_parser_ini_to_json.py,独立承担 AICPU ini 转 json 的校验与生成。 5. 统一 AttentionWorkerSchedulerFfnWorkerScheduler 的 AICPU 默认配置表达,并补齐 FfnWorkerScheduler 的属性定义。 ## 关联的Issue - 关联 Issue #2196 - https://gitcode.com/cann/ops-transformer/issues/2196 ## 测试 已基于重新编译产物完成验证,结果如下: 1. opInfo vs 原始 JSON:通过 2. I/O normalized vs canndev:通过 3. JSON vs INI 完整对比:通过 4. aicpu_parser_ini_to_json.py 校验:通过 5. change 前后 JSON 对比:通过 关键结果: - 重新生成目标输出 Compile op info cfg successfully. - before 字段路径在 after 中全部存在 - 新增字段仅为 opInfo.formatAgnosticopInfo.opsFlagopInfo.subTypeOfInferShape - AttentionWorkerSchedulerFfnWorkerSchedulerinput0/output0 与 canndev 基线一致 ## 文档更新 无。 ## 类型标签 - [x] 🐛 Bug 修复 - [ ] ✨ 新特性 - [ ] ⚡ 性能优化 - [ ] ♻️ 重构 - [ ] 🧪 测试 - [ ] 📦 构建/CI - [ ] 🔧 配置变更 - [ ] 📝 文档更新 - [ ] ⬆️ 依赖升级 - [ ] 🔒 安全修复 - [ ] 🧹 代码清理 - [ ] ❓ 其他,请描述: See merge request: cann/ops-transformer!486415 天前