| fix:fix atten_mask_shape error when using transformer_engine
Co-authored-by: Keilo_W<wangkaiyu11@h-partners.com>
# message auto-generated for no-merge-commit merge:
!3293 merge master into master
fix:fix atten_mask_shape error when using transformer_engine
Created-by: Keilo_W
Commit-by: Keilo_W
Merged-by: ascend-robot
Description: An atten_mask_shape error will occur if --attention-mask-type causal is used together with --transformer-impl transformer_engine. To avoid this, you must also enable the --use-flash-attn option.
See merge request: Ascend/MindSpeed!3293 | 2 个月前 |