ascend-robot【fix】完善小模型Atlas 300I DUO硬件描述

文件	最后提交记录	最后更新时间
README.md	【fix】完善小模型Atlas 300I DUO硬件描述 Co-authored-by: Niushiya<niushiya1@huawei.com> # message auto-generated for no-merge-commit merge: !7587 merge master into master 【fix】完善小模型Atlas 300I DUO硬件描述 Created-by: niushiya Commit-by: Niushiya Merged-by: ascend-robot Description: ## Motivation 1、完善小模型Atlas 300I DUO硬件描述，补充单芯字段； ## Modification Please briefly describe what modification is made in this PR. ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist Before PR: - [x] The new code needs to comply with the Clean Code specification. - [x] The PR content is self-checked, and the expression can be clear and the writing standardized After PR: - [x] CLA has been signed and all committers have signed the CLA in this PR. - [x] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/ModelZoo-PyTorch!7587	5 天前
image_demo_npu.py	!7159 [Feat][ACL-PyTorch][cv][GroundingDINO] 新增GroundingDINO NPU Infer 脚本 Merge pull request !7159 from 沈鑫杰/groundingdino2	1 年前
install_requirements.sh	!7141 [Feat][ACL-PyTorch][cv][GroundingDINO] 新增GroundingDINO Merge pull request !7141 from 沈鑫杰/groundingdino	1 年前
mmcv_diff.patch	!7141 [Feat][ACL-PyTorch][cv][GroundingDINO] 新增GroundingDINO Merge pull request !7141 from 沈鑫杰/groundingdino	1 年前
mmdetection_diff.patch	!7412 fix groundingdino and whisper Merge pull request !7412 from 邹亦舟/master	7 个月前
mmengine_diff.patch	!7141 [Feat][ACL-PyTorch][cv][GroundingDINO] 新增GroundingDINO Merge pull request !7141 from 沈鑫杰/groundingdino	1 年前
register_im2col_to_torchair.py	!7159 [Feat][ACL-PyTorch][cv][GroundingDINO] 新增GroundingDINO NPU Infer 脚本 Merge pull request !7159 from 沈鑫杰/groundingdino2	1 年前
register_roll_to_torchair.py	!7159 [Feat][ACL-PyTorch][cv][GroundingDINO] 新增GroundingDINO NPU Infer 脚本 Merge pull request !7159 from 沈鑫杰/groundingdino2	1 年前
requirements.txt	!7412 fix groundingdino and whisper Merge pull request !7412 from 邹亦舟/master	7 个月前
video_demo_npu.py	!7355 modify the readme and error description Merge pull request !7355 from 邹亦舟/master	9 个月前

GroundingDINO(TorchAir)-推理指导

概述
推理环境准备
快速上手

概述

‌GroundingDINO 是一种最先进的开放集检测模型，可解决多项视觉任务，包括开放词汇检测(OVD)、Phrase Grounding(PG)、和指代性表达式理解(REC)。它的有效性已使其被广泛采用，成为各种下游应用的主流架构。

版本说明：

url=https://github.com/open-mmlab/mmdetection
commit_id=cfd5d3a9
model_name=MM-GroundingDINO

推理环境准备

该模型需要以下插件与驱动
表 1 版本配套表

配套	版本	环境准备指导
固件与驱动	25.2.RC1	Pytorch框架推理环境准备
CANN	8.2.RC1	包含kernels包和toolkit包
Python	3.11	-
PyTorch	2.1.0	-
Ascend Extension PyTorch	2.1.0.post13	-
说明：Atlas 800I A2 推理卡和Atlas 300I DUO 推理卡请以CANN版本选择实际固件与驱动版本。目前Atlas 300I DUO仅支持推理图像。	\	\

快速上手

获取源码

获取本仓源码

git clone https://gitcode.com/ascend/ModelZoo-PyTorch.git
cd ModelZoo-PyTorch/ACL_PyTorch/built-in/cv/GroundingDINO

获取开源模型源码

git clone https://github.com/open-mmlab/mmdetection
cd mmdetection
git reset --hard cfd5d3a9

下载相关权重和图片
- 下载BERT权重，并放置于mmdetection目录下
- 下载MM-GroundingDINO权重，并放置于weights目录下
- 下载animals.png，并放置于images目录下
- 下载NLTK权重（可选）。MM-GroundingDINO在进行Phrase Grounding推理时可能会进行名词短语提取，虽然会在运行时下载特定的模型，但是考虑到有些用户运行环境无法联网，因此可以提前下载。
  - 方式一：下载模型到~/nltk_data路径下。
  - 方式二：采用如下命令下载
```
import nltk
nltk.download('punkt', download_dir='~/nltk_data')
nltk.download('averaged_perceptron_tagger', download_dir='~/nltk_data')
```
  下载完成后目录大致为
```
root/nltk_data/
├── taggers
│   └── averaged_perceptron_tagger
│       └── averaged_perceptron_tagger.pickle
└── tokenizers
    └── punkt
        ├── PY3
        │   ├── ...
        │
        ├── ...
```

本地下载完成后的目录树如下，检查依赖项和脚本是否归档正确。

 mmdetection
 ├── demo
 │   ├── image_demo.py
 │   ├── video_demo.py
 │   ├── image_demo_npu.py //本仓提供单图推理脚本
 │   ├── video_demo_npu.py //本仓提供视频推理脚本
 │   ├── register_im2col_to_torchair.py //本仓提供torchair算子注册文件
 │   ├── register_roll_to_torchair.py //本仓提供torchair算子注册文件
 │   ├── requirements.txt //本仓提供
 │   └── install_requirements.sh //本仓提供依赖一键安装脚本
 ├── diff_patch
 │   ├── mmdetection_diff.patch  //本仓提供
 │   ├── mmengine_diff.patch     //本仓提供
 │   └── mmcv_diff.patch         //本仓提供
 ├── bert-base-uncased //BERT权重
 ├── weights
 │   └── grounding_dino_swin-b_pretrain_obj365_goldg_v3de-f83eef00.pth //MM-GroundingDINO权重
 ├── images
 │   └── animals.png
 ├── mmdet
 ├── resources
 ├── README.md
 ├── tests
 ├── tools
 ├── config
 └── ...

安装依赖

conda create -n groundingdino python=3.8
conda activate groundingdino

#在mmdetection目录下执行依赖一键安装脚本，会在自动拉取依赖仓并应用diff patch文件
source demo/install_requirements.sh

模型推理

1 开始推理验证

设置环境变量，执行推理命令

# 指定使用NPU ID，默认为0
export ASCEND_RT_VISIBLE_DEVICES=0

# 执行图片推理命令
python demo/image_demo_npu.py images/animals.png configs/mm_grounding_dino/grounding_dino_swin-b_pretrain_obj365_goldg_v3det.py --weight weights/grounding_dino_swin-b_pretrain_obj365_goldg_v3de-f83eef00.pth --texts '$: coco' --device npu (--loop 10)

# 执行视频推理命令
python demo/video_demo_npu.py demo/demo.mp4 configs/mm_grounding_dino/grounding_dino_swin-b_pretrain_obj365_goldg_v3det.py weights/grounding_dino_swin-b_pretrain_obj365_goldg_v3de-f83eef00.pth (--batch_size 16)

在推理开始后，首先会默认执行warm_up，目的是执行首次编译，首次编译时间较长，在warm_up结束后，会执行推理操作，并打屏计算结果和性能数据。

2 性能

单图性能，以animals.png为例，测得以下数据：

模型芯片 E2E forward

MM-GroundingDINO Atlas 800I A2 877ms 274ms

MM-GroundingDINO Atlas 300I DUO(单芯) 1378ms 740ms
- forward性能在mmdet/apis/det_inferencer.py中的preds = self.forward(data, **forward_kwargs)前后打点得到。
视频性能：

模型芯片视频分辨率 batch_size forward

MM-GroundingDINO Atlas 800I A2 1080P 1bs 140ms

MM-GroundingDINO Atlas 800I A2 1080P 16bs 124ms
- forward性能从demo/video_demo_npu.py执行结束打屏信息获取，对应单针耗时per frame infer time
- config配置文件configs\mm_grounding_dino\grounding_dino_swin-t_pretrain_obj365.py中的scale=(800, 1333)控制图片缩放，减小缩放比例可提升模型性能，已经测试coco数据集精度在(800, 1152)配置下精度mAP为0.522，(800, 1024)配置下精度mAP为0.52，具体配置选择由用户自行评估。

模型	芯片	E2E	forward
MM-GroundingDINO	Atlas 800I A2	877ms	274ms
MM-GroundingDINO	Atlas 300I DUO(单芯)	1378ms	740ms

模型	芯片	视频分辨率	batch_size	forward
MM-GroundingDINO	Atlas 800I A2	1080P	1bs	140ms
MM-GroundingDINO	Atlas 800I A2	1080P	16bs	124ms

FAQ

mmcv源码安装报错参考：https://mmcv.readthedocs.io/zh-cn/2.x/faq.html