torchair/third_party · Ascend/torchair - AtomGit

ascend-robotfeat: AlltoAllMatmul AICPU适配新增通信引擎参数comm_mode

1a2031b5创建于 4 天前历史提交

文件	最后提交记录	最后更新时间
ascend	feat: AlltoAllMatmul AICPU适配新增通信引擎参数comm_mode Co-authored-by: tangjn<tangjianing@huawei.com> # message auto-generated for no-merge-commit merge: !3118 merge master into master feat: AlltoAllMatmul AICPU适配新增通信引擎参数comm_mode Created-by: tangjn Commit-by: tangjn Merged-by: ascend-robot Description: feat: 新增通信引擎参数为了在AlltoAllMatmul和AlltoAllQuantMatmul算子中，支持用户指定HCCL通信引擎（AICPU或CCU），为算子PTA接口新增一个可选入参comm_mode。 [#2777](https://gitcode.com/cann/ops-transformer/issues/2777) See merge request: Ascend/torchair!3118	4 天前
torch_npu	add npu_format_cast_via_cpu + add quantmatmul lite converter # message auto-generated for no-merge-commit merge: !3092 merge master into master add npu_format_cast_via_cpu + add quantmatmul lite converter Created-by: zihan0007 Commit-by: zihan0007 Merged-by: ascend-robot Description: 合入来源： torchair 新增npu_format_cast_via_cpu接口：新增npu_format_cast_via_cpu接口支持npuTensor在cpu将数据排布从Nd转为Nz，当前支持int8/uint8 Nd输入转为Fractal_nz输出 ge_pass增加QuantBatchMatmulV3前的Bitcast和Transpose算子互换位置的能力，方便后续Transpose融合增加lite converter支持QuantBatchMatmulV3 A8W4（int4）场景的converter逻辑 See merge request: Ascend/torchair!3092	7 天前
googletest	!1409 update gtest Merge pull request !1409 from 关龙锋/gtest	1 年前
secure_c	!1366 add submodule Merge pull request !1366 from 关龙锋/submodule	1 年前