| feat: fp8 reuse quant w with te_gmm_mode compatible
Co-authored-by: Jia_Austin<dengjia6@huawei.com>
# message auto-generated for no-merge-commit merge:
!3371 merge fp8_reuse_perf_v2 into master
feat: fp8 reuse quant w with te_gmm_mode compatible
Created-by: Jia_Austin
Commit-by: Jia_Austin
Merged-by: ascend-robot
Description: What this PR does / why we need it?
feat: fp8 reuse quant w with te_gmm_mode compatible; perf/fix: fp8 reuse quant w with te_gmm_mode perf
Does this PR introduce any user-facing change?
Please describe whether the PR will result in any user-facing usage changes. If there is related documentation, please specify its path.
How was this patch tested?
Please explain how to verify the correctness and effectiveness of this feature, as well as its usage constraints and limitations.
See merge request: Ascend/MindSpeed!3371 | 1 个月前 |