| [pytorch][feature] Replace reset_position_ids with reset_attention_mask, and enable ring attention support when reset_attention_mask is active.
Co-authored-by: mhh001<mahonghao1@huawei.com>
# message auto-generated for no-merge-commit merge:
!3506 merge master into master
[pytorch][feature] Replace reset_position_ids with reset_attention_mask, and enable ring attention support when reset_attention_mask is active.
Created-by: mhh111
Commit-by: mhh001
Merged-by: ascend-robot
Description: pack/neat-pack场景ring cp支持
reset-attention-mask 对齐megatron
定长场景的支持
See merge request: Ascend/MindSpeed-LLM!3506 | 6 个月前 |