文件最后提交记录最后更新时间
[feature]分布式支持context parallel Co-authored-by: TrHan<hantianrui1@huawei.com> # message auto-generated for no-merge-commit merge: !35927 merge pr_cp_2.7.1_0516 into v2.7.1 [feature]分布式支持context parallel Created-by: TrHan Commit-by: TrHan Merged-by: ascend-robot Description: <!-- PR描述模板更新日期:20260203 --> # 【合入来源】 > <font color="red">**如有社区issue,请关联issue链接**</font>\ > <font color="red">**请勿携带内部流程信息(需求链接、问题单、内部issue等)**</font> - [x] 需求 - [ ] 问题单 - [ ] issue/工单 - [ ] 重构优化 - [ ] 资料更新 [#2029](https://gitcode.com/Ascend/pytorch/issues/2029) # 【修改方案】 > 请描述修改内容的具体实现,涉及哪些组件之间进行交互,可以用1、2、3、...进行罗列\ > 如果是需求或者重构类的PR,需要补充详细设计文档(说明上下游组件关系、时序图、类图、DFX能力等内容) 参考原生torch cp实现方案,注册shard seq dim的策略,通过monkey patch将sdpa+DTensor的dispatch换成cp的dispatch,从而触发ring attention流程 详细方案见issue # 【资料变更】 > 请确认是否涉及资料变更。如涉及,需要在PR中体现,并简要说明修改内容。如不涉及,需填写“不涉及” 涉及,docs/zh/native_apis/pytorch_2-7-1/torch-distributed-tensor.md # 【接口变更】 > 请确认是否涉及跨代码仓或者客户面可见的接口变更。如涉及,需要详细说明接口以及对应的变更内容,同时需要在资料中体现。如不涉及,需填写“不涉及” 涉及新增接口,docs/zh/native_apis/pytorch_2-7-1/torch-distributed-tensor.md # 【功能验证】 > 说明测试场景,测试方法。如果本次测试方式与常规单元测试不同,请详细说明您的测试步骤\ > 新增/变更内容是否已新增/适配UT测试用例看护,并补充测试自验证截图 围绕ring attention,对如下参数矩阵进行交叉验证 ![image.png](https://raw.gitcode.com/user-images/assets/7404318/1e24ea90-28fc-4d24-9b2f-71c54bcf8247/image.png 'image.png') 用例执行结果: ![image.png](https://raw.gitcode.com/user-images/assets/7404318/0916cd7a-20a3-4b33-9d83-bde13fb7b3fc/image.png 'image.png') # 【CheckList】 > PR提交人对以下CheckList自检项进行全量自检,自检通过或不涉及,均修改 [ ] 为 [x] - [x] 代码注释完备,正确记录错误日志 - [x] 代码实现进行了返回值、空指针等校验 - [x] PR标题正确使用类型标签,如:feat、fix、refactor、docs、test等 - [x] PR持续集成流水线(CI)执行通过,代码检查无异常 See merge request: Ascend/pytorch!3592712 天前
[feature]分布式支持context parallel Co-authored-by: TrHan<hantianrui1@huawei.com> # message auto-generated for no-merge-commit merge: !35927 merge pr_cp_2.7.1_0516 into v2.7.1 [feature]分布式支持context parallel Created-by: TrHan Commit-by: TrHan Merged-by: ascend-robot Description: <!-- PR描述模板更新日期:20260203 --> # 【合入来源】 > <font color="red">**如有社区issue,请关联issue链接**</font>\ > <font color="red">**请勿携带内部流程信息(需求链接、问题单、内部issue等)**</font> - [x] 需求 - [ ] 问题单 - [ ] issue/工单 - [ ] 重构优化 - [ ] 资料更新 [#2029](https://gitcode.com/Ascend/pytorch/issues/2029) # 【修改方案】 > 请描述修改内容的具体实现,涉及哪些组件之间进行交互,可以用1、2、3、...进行罗列\ > 如果是需求或者重构类的PR,需要补充详细设计文档(说明上下游组件关系、时序图、类图、DFX能力等内容) 参考原生torch cp实现方案,注册shard seq dim的策略,通过monkey patch将sdpa+DTensor的dispatch换成cp的dispatch,从而触发ring attention流程 详细方案见issue # 【资料变更】 > 请确认是否涉及资料变更。如涉及,需要在PR中体现,并简要说明修改内容。如不涉及,需填写“不涉及” 涉及,docs/zh/native_apis/pytorch_2-7-1/torch-distributed-tensor.md # 【接口变更】 > 请确认是否涉及跨代码仓或者客户面可见的接口变更。如涉及,需要详细说明接口以及对应的变更内容,同时需要在资料中体现。如不涉及,需填写“不涉及” 涉及新增接口,docs/zh/native_apis/pytorch_2-7-1/torch-distributed-tensor.md # 【功能验证】 > 说明测试场景,测试方法。如果本次测试方式与常规单元测试不同,请详细说明您的测试步骤\ > 新增/变更内容是否已新增/适配UT测试用例看护,并补充测试自验证截图 围绕ring attention,对如下参数矩阵进行交叉验证 ![image.png](https://raw.gitcode.com/user-images/assets/7404318/1e24ea90-28fc-4d24-9b2f-71c54bcf8247/image.png 'image.png') 用例执行结果: ![image.png](https://raw.gitcode.com/user-images/assets/7404318/0916cd7a-20a3-4b33-9d83-bde13fb7b3fc/image.png 'image.png') # 【CheckList】 > PR提交人对以下CheckList自检项进行全量自检,自检通过或不涉及,均修改 [ ] 为 [x] - [x] 代码注释完备,正确记录错误日志 - [x] 代码实现进行了返回值、空指针等校验 - [x] PR标题正确使用类型标签,如:feat、fix、refactor、docs、test等 - [x] PR持续集成流水线(CI)执行通过,代码检查无异常 See merge request: Ascend/pytorch!3592712 天前
fix npu_fusion_attention_grad sharding strategy on v2.7.1 Co-authored-by: zhangguoguang<zhangguoguang2@huawei.com> # message auto-generated for no-merge-commit merge: !32723 merge fix_fa_dtensor_27 into v2.7.1 fix npu_fusion_attention_grad sharding strategy on v2.7.1 Created-by: zhangguoguang Commit-by: zhangguoguang Merged-by: ascend-robot Description: <!-- PR描述模板更新日期:20260203 --> # 【合入来源】 > <font color="red">**如有社区issue,请关联issue链接**</font>\ > <font color="red">**请勿携带内部流程信息(需求链接、问题单、内部issue等)**</font> - [ ] 需求 - [x] 问题单 - [ ] issue/工单 - [ ] 重构优化 - [ ] 资料更新 # 【修改方案】 当pse为None时,反向计算pse的梯度也为None,不需要为其设置分片方式 # 【资料变更】 不涉及 # 【接口变更】 不涉及 # 【功能验证】 问题单测试脚本执行结果: ![image.png](https://raw.gitcode.com/user-images/assets/7404318/3fdb3aa8-c3a3-4839-a9e1-acb4ff33644e/image.png 'image.png') # 【CheckList】 > PR提交人对以下CheckList自检项进行全量自检,自检通过或不涉及,均修改 [ ] 为 [x] - [x] 代码注释完备,正确记录错误日志 - [x] 代码实现进行了返回值、空指针等校验 - [x] PR标题正确使用类型标签,如:feat、fix、refactor、docs、test等 - [x] PR持续集成流水线(CI)执行通过,代码检查无异常 See merge request: Ascend/pytorch!327232 个月前
fix DTensor Co-authored-by: jizewei<jizewei@huawei.com> # message auto-generated for no-merge-commit merge: !29015 merge v2.7.1_fix_dtensor_ut into v2.7.1 fix DTensor Created-by: jizewei Commit-by: jizewei Merged-by: ascend-robot Description: <!-- Thanks for sending a pull request! --> **What type of PR is this?** > Uncomment only one /kind <> line, hit enter to put that in a new line, and remove leading whitespaces from that line: > > /kind bug > /kind task > /kind feature /kind bug **What does this PR do / why do we need it**: 1. change DTensor UT from 4 devices to 2 devices, because the CI environment only has 2 devices 2. adapt the newest npu_fusion_attention **Which issue(s) this PR fixes**: <!-- *Automatically closes linked issue when PR is merged. Usage: Fixes #<issue number>, or Fixes (paste link of issue). --> Fixes # **Special notes for your reviewers**: test_attention_ops.py ![image.png](https://raw.gitcode.com/user-images/assets/7404318/72531b6a-9e7e-499a-a374-bddebfd3e964/image.png 'image.png') test_gather_swiglu.py ![image.png](https://raw.gitcode.com/user-images/assets/7404318/54508056-1e78-42a7-8ada-e735a823c69d/image.png 'image.png') test_math_ops.py ![image.png](https://raw.gitcode.com/user-images/assets/7404318/05f20ae1-9408-48cc-a1e6-b6587077c933/image.png 'image.png') test_matrix_ops.py ![image.png](https://raw.gitcode.com/user-images/assets/7404318/d2f60a7a-45bf-464a-8b9a-5bbf89552fdd/image.png 'image.png') test_moe_ops.py ![image.png](https://raw.gitcode.com/user-images/assets/7404318/fd79349c-385f-47f0-b792-fadb22a364e8/image.png 'image.png') See merge request: Ascend/pytorch!290154 个月前
fix DTensor Co-authored-by: jizewei<jizewei@huawei.com> # message auto-generated for no-merge-commit merge: !29015 merge v2.7.1_fix_dtensor_ut into v2.7.1 fix DTensor Created-by: jizewei Commit-by: jizewei Merged-by: ascend-robot Description: <!-- Thanks for sending a pull request! --> **What type of PR is this?** > Uncomment only one /kind <> line, hit enter to put that in a new line, and remove leading whitespaces from that line: > > /kind bug > /kind task > /kind feature /kind bug **What does this PR do / why do we need it**: 1. change DTensor UT from 4 devices to 2 devices, because the CI environment only has 2 devices 2. adapt the newest npu_fusion_attention **Which issue(s) this PR fixes**: <!-- *Automatically closes linked issue when PR is merged. Usage: Fixes #<issue number>, or Fixes (paste link of issue). --> Fixes # **Special notes for your reviewers**: test_attention_ops.py ![image.png](https://raw.gitcode.com/user-images/assets/7404318/72531b6a-9e7e-499a-a374-bddebfd3e964/image.png 'image.png') test_gather_swiglu.py ![image.png](https://raw.gitcode.com/user-images/assets/7404318/54508056-1e78-42a7-8ada-e735a823c69d/image.png 'image.png') test_math_ops.py ![image.png](https://raw.gitcode.com/user-images/assets/7404318/05f20ae1-9408-48cc-a1e6-b6587077c933/image.png 'image.png') test_matrix_ops.py ![image.png](https://raw.gitcode.com/user-images/assets/7404318/d2f60a7a-45bf-464a-8b9a-5bbf89552fdd/image.png 'image.png') test_moe_ops.py ![image.png](https://raw.gitcode.com/user-images/assets/7404318/fd79349c-385f-47f0-b792-fadb22a364e8/image.png 'image.png') See merge request: Ascend/pytorch!290154 个月前
fixing review comments on v2.7.1 Co-authored-by: zhangguoguang<zhangguoguang2@huawei.com> # message auto-generated for no-merge-commit merge: !32766 merge fix_review_271 into v2.7.1 fixing review comments on v2.7.1 Created-by: zhangguoguang Commit-by: zhangguoguang Merged-by: ascend-robot Description: <!-- PR描述模板更新日期:20260203 --> # 【合入来源】 > <font color="red">**如有社区issue,请关联issue链接**</font>\ > <font color="red">**请勿携带内部流程信息(需求链接、问题单、内部issue等)**</font> - [x] 需求 - [ ] 问题单 - [ ] issue/工单 - [ ] 重构优化 - [ ] 资料更新 # 【修改方案】 增加代码可读性和完整性 # 【资料变更】 不涉及 # 【接口变更】 不涉及 # 【功能验证】 ![image.png](https://raw.gitcode.com/user-images/assets/7404318/8fcf51a4-ef98-4bbe-a952-1c41fb9c97f2/image.png 'image.png') # 【CheckList】 > PR提交人对以下CheckList自检项进行全量自检,自检通过或不涉及,均修改 [ ] 为 [x] - [x] 代码注释完备,正确记录错误日志 - [x] 代码实现进行了返回值、空指针等校验 - [x] PR标题正确使用类型标签,如:feat、fix、refactor、docs、test等 - [x] PR持续集成流水线(CI)执行通过,代码检查无异常 See merge request: Ascend/pytorch!327661 个月前
refactor dtensor rules Co-authored-by: kisnwang<wangkaisheng2@huawei.com> # message auto-generated for no-merge-commit merge: !34589 merge v2.7.1 into v2.7.1 refactor dtensor rules Created-by: kisnwang Commit-by: kisnwang Merged-by: ascend-robot Description: <!-- PR描述模板更新日期:20260203 --> # 【合入来源】 > <font color="red">**如有社区issue,请关联issue链接**</font>\ > <font color="red">**请勿携带内部流程信息(需求链接、问题单、内部issue等)**</font> - [ ] 需求 - [ ] 问题单 - [ ] issue/工单 - [x] 重构优化 [#1743](https://gitcode.com/Ascend/pytorch/issues/1743) - [ ] 资料更新 # 【修改方案】 自定义算子注册与社区对齐,删除utils/dtensor.py # 【资料变更】 不涉及 # 【接口变更】 不涉及 # 【功能验证】 适配UT测试用例看护 # 【CheckList】 > PR提交人对以下CheckList自检项进行全量自检,自检通过或不涉及,均修改 [ ] 为 [x] - [ ] 代码注释完备,正确记录错误日志 - [ ] 代码实现进行了返回值、空指针等校验 - [x] PR标题正确使用类型标签,如:feat、fix、refactor、docs、test等 - [ ] PR持续集成流水线(CI)执行通过,代码检查无异常 See merge request: Ascend/pytorch!3458924 天前
register strategy for npu custom ops Co-authored-by: jizewei<jizewei@huawei.com> # message auto-generated for no-merge-commit merge: !27074 merge v2.7.1_register_strategy into v2.7.1 register strategy for npu custom ops Created-by: jizewei Commit-by: jizewei Merged-by: ascend-robot Description: <!-- Thanks for sending a pull request! --> **What type of PR is this?** > Uncomment only one /kind <> line, hit enter to put that in a new line, and remove leading whitespaces from that line: > > /kind bug > /kind task > /kind feature /kind feature **What does this PR do / why do we need it**: register sharding strategies for npu custom ops to support DTensor: 1. npu_add_rms_norm 2. npu_moe_token_permute, npu_moe_token_permute_grad 3. npu_moe_token_unpermute, npu_moe_token_unpermute_grad 4. npu_rotary_mul, npu_rotary_mul_backward **Which issue(s) this PR fixes**: <!-- *Automatically closes linked issue when PR is merged. Usage: Fixes #<issue number>, or Fixes (paste link of issue). --> Fixes # **Special notes for your reviewers**: See merge request: Ascend/pytorch!270746 个月前
refactor dtensor rules Co-authored-by: kisnwang<wangkaisheng2@huawei.com> # message auto-generated for no-merge-commit merge: !34589 merge v2.7.1 into v2.7.1 refactor dtensor rules Created-by: kisnwang Commit-by: kisnwang Merged-by: ascend-robot Description: <!-- PR描述模板更新日期:20260203 --> # 【合入来源】 > <font color="red">**如有社区issue,请关联issue链接**</font>\ > <font color="red">**请勿携带内部流程信息(需求链接、问题单、内部issue等)**</font> - [ ] 需求 - [ ] 问题单 - [ ] issue/工单 - [x] 重构优化 [#1743](https://gitcode.com/Ascend/pytorch/issues/1743) - [ ] 资料更新 # 【修改方案】 自定义算子注册与社区对齐,删除utils/dtensor.py # 【资料变更】 不涉及 # 【接口变更】 不涉及 # 【功能验证】 适配UT测试用例看护 # 【CheckList】 > PR提交人对以下CheckList自检项进行全量自检,自检通过或不涉及,均修改 [ ] 为 [x] - [ ] 代码注释完备,正确记录错误日志 - [ ] 代码实现进行了返回值、空指针等校验 - [x] PR标题正确使用类型标签,如:feat、fix、refactor、docs、test等 - [ ] PR持续集成流水线(CI)执行通过,代码检查无异常 See merge request: Ascend/pytorch!3458924 天前
add sharded tensor npu method Co-authored-by: gcw_Dgfy2aKk<dujunjie2@huawei.com> # message auto-generated for no-merge-commit merge: !29226 merge shard_tensor_npu_patch_260107_2.7.1 into v2.7.1 add sharded tensor npu method Created-by: gcw_Dgfy2aKk Commit-by: gcw_Dgfy2aKk Merged-by: ascend-robot Description: <!-- Thanks for sending a pull request! --> **What type of PR is this?** > Uncomment only one /kind <> line, hit enter to put that in a new line, and remove leading whitespaces from that line: > > /kind bug > /kind task > /kind feature **What does this PR do / why do we need it**: **Which issue(s) this PR fixes**: <!-- *Automatically closes linked issue when PR is merged. Usage: Fixes #<issue number>, or Fixes (paste link of issue). --> Fixes # **Special notes for your reviewers**: See merge request: Ascend/pytorch!292264 个月前
refactor dtensor rules Co-authored-by: kisnwang<wangkaisheng2@huawei.com> # message auto-generated for no-merge-commit merge: !34589 merge v2.7.1 into v2.7.1 refactor dtensor rules Created-by: kisnwang Commit-by: kisnwang Merged-by: ascend-robot Description: <!-- PR描述模板更新日期:20260203 --> # 【合入来源】 > <font color="red">**如有社区issue,请关联issue链接**</font>\ > <font color="red">**请勿携带内部流程信息(需求链接、问题单、内部issue等)**</font> - [ ] 需求 - [ ] 问题单 - [ ] issue/工单 - [x] 重构优化 [#1743](https://gitcode.com/Ascend/pytorch/issues/1743) - [ ] 资料更新 # 【修改方案】 自定义算子注册与社区对齐,删除utils/dtensor.py # 【资料变更】 不涉及 # 【接口变更】 不涉及 # 【功能验证】 适配UT测试用例看护 # 【CheckList】 > PR提交人对以下CheckList自检项进行全量自检,自检通过或不涉及,均修改 [ ] 为 [x] - [ ] 代码注释完备,正确记录错误日志 - [ ] 代码实现进行了返回值、空指针等校验 - [x] PR标题正确使用类型标签,如:feat、fix、refactor、docs、test等 - [ ] PR持续集成流水线(CI)执行通过,代码检查无异常 See merge request: Ascend/pytorch!3458924 天前