MindSpeed/mindspeed/te/pytorch/module · Ascend/MindSpeed - AtomGit

文件	最后提交记录	最后更新时间
ops	feat(fp8): quant mm add Co-authored-by: Muu<koimuu@163.com> # message auto-generated for no-merge-commit merge: !3272 merge feature_quant_mm_add into master feat(fp8): quant mm add Created-by: Muuyo Commit-by: Muu Merged-by: ascend-robot Description: 1. mxfp8 策略支持 npu_add_quant_matmul 算子 2. tensorwise 支持mc2策略 3. 修复在backward时取FP8Tensor2D device 取不到的问题自测报告: https://wiki.huawei.com/domains/76578/wiki/233229/WIKI2026030210253506 See merge request: Ascend/MindSpeed!3272	2 个月前
__init__.py	!2791 [feat!!!]te support v2 Merge pull request !2791 from yangjie/master	8 个月前
checkpoint.py	feature(fp8): te checkpoint Co-authored-by: Muu<koimuu@163.com> # message auto-generated for no-merge-commit merge: !3162 merge feature_checkpoint into master feature(fp8): te checkpoint Created-by: Muuyo Commit-by: Muu Merged-by: ascend-robot Description: 1. 引入 te checkpoint消除重计算中冗余的量化操作 2. refactor(blockwise): 删除128128的blockwise策略, 保留1 128\|128 * 128策略替换 3. perf(hif8): 删除多余的cast 4. fix(delayed): 修复delayed算法 5. refactor(recipe 2x): 重构blockwise和mxfp8策略数据存取, 简化后续算子适配 6. 消除字符串字面量, 采用枚举替代验证报告: https://wiki.huawei.com/domains/76578/wiki/233229/WIKI202601139775970 See merge request: Ascend/MindSpeed!3162	4 个月前
grouped_linear.py	feat: fp8_reuse_quant_w Co-authored-by: Jia_Austin<dengjia6@huawei.com> # message auto-generated for no-merge-commit merge: !3358 merge feat_fp8_reuse_quant_w into master feat: fp8_reuse_quant_w Created-by: Jia_Austin Commit-by: Jia_Austin Merged-by: ascend-robot Description: What this PR does / why we need it? Please describe the background and detailed changes of the PR. If it is a bugfix, please attach the related issue. Does this PR introduce any user-facing change? Please describe whether the PR will result in any user-facing usage changes. If there is related documentation, please specify its path. How was this patch tested? Please explain how to verify the correctness and effectiveness of this feature, as well as its usage constraints and limitations. See merge request: Ascend/MindSpeed!3358	2 个月前
layernorm.py	!2715 adjust LayerNorm patch Merge pull request !2715 from wangyuansheng8/master	9 个月前
layernorm_column_parallel_linear.py	feature(fp8): te checkpoint Co-authored-by: Muu<koimuu@163.com> # message auto-generated for no-merge-commit merge: !3162 merge feature_checkpoint into master feature(fp8): te checkpoint Created-by: Muuyo Commit-by: Muu Merged-by: ascend-robot Description: 1. 引入 te checkpoint消除重计算中冗余的量化操作 2. refactor(blockwise): 删除128128的blockwise策略, 保留1 128\|128 * 128策略替换 3. perf(hif8): 删除多余的cast 4. fix(delayed): 修复delayed算法 5. refactor(recipe 2x): 重构blockwise和mxfp8策略数据存取, 简化后续算子适配 6. 消除字符串字面量, 采用枚举替代验证报告: https://wiki.huawei.com/domains/76578/wiki/233229/WIKI202601139775970 See merge request: Ascend/MindSpeed!3162	4 个月前
linear.py	feat(fp8): quant mm add Co-authored-by: Muu<koimuu@163.com> # message auto-generated for no-merge-commit merge: !3272 merge feature_quant_mm_add into master feat(fp8): quant mm add Created-by: Muuyo Commit-by: Muu Merged-by: ascend-robot Description: 1. mxfp8 策略支持 npu_add_quant_matmul 算子 2. tensorwise 支持mc2策略 3. 修复在backward时取FP8Tensor2D device 取不到的问题自测报告: https://wiki.huawei.com/domains/76578/wiki/233229/WIKI2026030210253506 See merge request: Ascend/MindSpeed!3272	2 个月前
performance_grouped_linear.py	feat:adaptor te gmm tensor paramter initialization Co-authored-by: lizhi<lizhi166@huawei.com> # message auto-generated for no-merge-commit merge: !3172 merge master into master feat:adaptor te gmm tensor paramter initialization Created-by: liz_hw Commit-by: lizhi Merged-by: ascend-robot Description: feat:高性能分支参数初始化对齐local 验证文档：https://wiki.huawei.com/domains/159101/wiki/324516/WIKI202601099745044 See merge request: Ascend/MindSpeed!3172	4 个月前