文件最后提交记录最后更新时间
init 4 年前
!4671 【fix】批量修改模型python版本,兼容环境上的python3.8版本 * fix python version 3 年前
!2025 pytorch批量环境变量整改 * 删除无关文件 * 修改环境变量 * 修改环境变量 3 年前
init 4 年前
init 4 年前
!6807 【资料整改】README中产品形态整改 Merge pull request !6807 from 何昕彦/master 1 年前
fix link validity Co-authored-by: frozenleaves<914814442@qq.com> # message auto-generated for no-merge-commit merge: !7517 merge master into master fix link validity Created-by: frozenn Commit-by: frozenleaves Merged-by: ascend-robot Description: ## Motivation Please describe the motivation of this PR and the goal you want to achieve through this PR. ## Modification Please briefly describe what modification is made in this PR. ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist **Before PR**: - [ ] The new code needs to comply with the Clean Code specification. - [ ] The PR content is self-checked, and the expression can be clear and the writing standardized **After PR**: - [ ] CLA has been signed and all committers have signed the CLA in this PR. - [ ] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/ModelZoo-PyTorch!75171 个月前
init 4 年前
init 4 年前
init 4 年前
!5854 Network address of models to be rectified: 28 Merge pull request !5854 from Yss/network_declaration_28 2 年前
[众智][PyTorch]整改模型中的requirements.txt文件,删除torch,apex Signed-off-by: bailang <bailang12@h-partners.com> 3 年前
init 4 年前
!574 [众筹][PyTorch] 优化缩短TNT模型1p性能训练时间 !574 [众筹][PyTorch] 优化缩短TNT模型1p性能训练时间 3 年前
README.md

TNT

This implements training of TNT on the ImageNet dataset, mainly modified from CV-Backbones/tnt_pytorch.

TNT Detail

TNT divides the image patches into sub-patches. With the structure of visual sentences and visual words, TNT can better perform the image classification task.

Requirements

  • Install PyTorch (pytorch.org)
  • pip install -r requirements.txt Note:Install the torchvision that corresponds to the torch version
  • Download the ImageNet dataset
  • Please add the following shapes into /usr/local/Ascend/ascend-toolkit/5.0.3/arm64-linux/opp/op_impl/built-in/ai_core/tbe/impl/dynamic/transpose.py for better performance.
    • [2, 25088, 4, 16, 6], [25088, 16, 2, 4, 6]

Training

To train a model, run main.py with the desired model architecture and the path to the ImageNet dataset:

# training 1p accuracy
bash ./test/train_full_1p.sh --data_path=real_data_path

# training 1p performance
bash ./test/train_performance_1p.sh --data_path=real_data_path

# training 8p accuracy
bash ./test/train_full_8p.sh --data_path=real_data_path

# training 8p performance
bash ./test/train_performance_8p.sh --data_path=real_data_path

#test 8p accuracy
bash test/train_eval_8p.sh --data_path=real_data_path --pth_path=real_pre_train_model_path

# finetuning 1p 
bash test/train_finetune_1p.sh --data_path=real_data_path --pth_path=real_pre_train_model_path

Log path: test/output/devie_id/train_${device_id}.log # training detail log test/output/devie_id/WideReesnet50_2_bs8192_8p_perf.log # 8p training performance result log test/output/devie_id/WideReesnet50_2_bs8192_8p_acc.log # 8p training accuracy result log

TNT training result

Acc@1 FPS Npu_nums Epochs AMP_Type
- 57 1 1 O1
71.4% 410 8 79 O1

自测结果说明:

  • 项目交付精度要求:Acc@1 81.5%,目前在gpu中Acc@1为81.2%,已达目标精度的99%以上。

由于跑完310个epoch训练时间过长,本项目中测试npu上79个epoch后Acc@1为71.4%,此时gpu的Acc@1为71.7%,Atlas(NPU)精度大于竞品V(GPU)精度的99%(即71.7% * 99% = 71.0%),说明精度已经对齐,符合验收标准。

Statement

For details about the public address of the code in this repository, you can get from the file public_address_statement.md