dd6fe347创建于 4月9日历史提交

文件	最后提交记录	最后更新时间
LICENSE	!2537 [南京信息工程大学][高校贡献][PyTorch推理][Swin-Transformer-Semantic-Segmentation]--初次提交 * README.md * README.md * 重命名 ACL_PyTorch/contrib/cv/segmentation/ Swin-Transformer-Semantic-Seg… * README.md * README.md * README.md * README.md * 删除文件 ACL_PyTorch/contrib/cv/segmentation/ Swin-Transformer-Semantic-Se… * 补丁 * pth2onnx脚本 * preprocess脚本 * postprocess脚本 * requirements * LICENSE * README.md * README.md * 新建 Swin-Transformer-Semantic-Segmentation	3 年前
README.md	fix link validity Co-authored-by: frozenleaves<914814442@qq.com> # message auto-generated for no-merge-commit merge: !7517 merge master into master fix link validity Created-by: frozenn Commit-by: frozenleaves Merged-by: ascend-robot Description: ## Motivation Please describe the motivation of this PR and the goal you want to achieve through this PR. ## Modification Please briefly describe what modification is made in this PR. ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist Before PR: - [ ] The new code needs to comply with the Clean Code specification. - [ ] The PR content is self-checked, and the expression can be clear and the writing standardized After PR: - [ ] CLA has been signed and all committers have signed the CLA in this PR. - [ ] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/ModelZoo-PyTorch!7517	1 个月前
Swin-Transformer-Semantic-Segmentation_postprocess.py	!2537 [南京信息工程大学][高校贡献][PyTorch推理][Swin-Transformer-Semantic-Segmentation]--初次提交 * README.md * README.md * 重命名 ACL_PyTorch/contrib/cv/segmentation/ Swin-Transformer-Semantic-Seg… * README.md * README.md * README.md * README.md * 删除文件 ACL_PyTorch/contrib/cv/segmentation/ Swin-Transformer-Semantic-Se… * 补丁 * pth2onnx脚本 * preprocess脚本 * postprocess脚本 * requirements * LICENSE * README.md * README.md * 新建 Swin-Transformer-Semantic-Segmentation	3 年前
Swin-Transformer-Semantic-Segmentation_preprocess.py	!2537 [南京信息工程大学][高校贡献][PyTorch推理][Swin-Transformer-Semantic-Segmentation]--初次提交 * README.md * README.md * 重命名 ACL_PyTorch/contrib/cv/segmentation/ Swin-Transformer-Semantic-Seg… * README.md * README.md * README.md * README.md * 删除文件 ACL_PyTorch/contrib/cv/segmentation/ Swin-Transformer-Semantic-Se… * 补丁 * pth2onnx脚本 * preprocess脚本 * postprocess脚本 * requirements * LICENSE * README.md * README.md * 新建 Swin-Transformer-Semantic-Segmentation	3 年前
Swin-Transformer-Semantic-Segmentation_pth2onnx.py	!2537 [南京信息工程大学][高校贡献][PyTorch推理][Swin-Transformer-Semantic-Segmentation]--初次提交 * README.md * README.md * 重命名 ACL_PyTorch/contrib/cv/segmentation/ Swin-Transformer-Semantic-Seg… * README.md * README.md * README.md * README.md * 删除文件 ACL_PyTorch/contrib/cv/segmentation/ Swin-Transformer-Semantic-Se… * 补丁 * pth2onnx脚本 * preprocess脚本 * postprocess脚本 * requirements * LICENSE * README.md * README.md * 新建 Swin-Transformer-Semantic-Segmentation	3 年前
change.patch	!2537 [南京信息工程大学][高校贡献][PyTorch推理][Swin-Transformer-Semantic-Segmentation]--初次提交 * README.md * README.md * 重命名 ACL_PyTorch/contrib/cv/segmentation/ Swin-Transformer-Semantic-Seg… * README.md * README.md * README.md * README.md * 删除文件 ACL_PyTorch/contrib/cv/segmentation/ Swin-Transformer-Semantic-Se… * 补丁 * pth2onnx脚本 * preprocess脚本 * postprocess脚本 * requirements * LICENSE * README.md * README.md * 新建 Swin-Transformer-Semantic-Segmentation	3 年前
requirements.txt	!2537 [南京信息工程大学][高校贡献][PyTorch推理][Swin-Transformer-Semantic-Segmentation]--初次提交 * README.md * README.md * 重命名 ACL_PyTorch/contrib/cv/segmentation/ Swin-Transformer-Semantic-Seg… * README.md * README.md * README.md * README.md * 删除文件 ACL_PyTorch/contrib/cv/segmentation/ Swin-Transformer-Semantic-Se… * 补丁 * pth2onnx脚本 * preprocess脚本 * postprocess脚本 * requirements * LICENSE * README.md * README.md * 新建 Swin-Transformer-Semantic-Segmentation	3 年前

Swin-Transformer-Semantic-Segmentation 模型-推理指导

概述

Transformer 在 NLP 领域表现优异，如何将 Transformer 从 NLP 领域应用到 CV 领域？其挑战来自两个领域在尺度与分辨率上差异。NLP 任务中每个词向量的维度是固定的，而 CV 任务中往往图像尺度变化较大；且与文本段落中的单词量相比，图像中的像素分辨率要高得多。为了解决这些问题，作者提出了一种分层 Transformer，通过 Shifted windows(移位窗口) 将自注意力的计算限制在不重叠的局部窗口范围内，同时允许跨窗口连接，从而带来更高的效率。这种分层架构具有在各种尺度上建模的灵活性，且只有相对于图像大小的线性计算复杂度。Swin Transformer 的这些特性使其与广泛的 CV 任务兼容，包括图像分类和密集预测任务，例如目标检测和语义分割。在这些任务上的优异表现表明，Swin Transformer 可以作为 CV 领域的通用主干网络。

参考实现：

url = https://github.com/SwinTransformer/Swin-Transformer-Semantic-Segmentation
commit_id = 87e6f90577435c94f3e92c7db1d36edc234d91f6
model_name = upernet_swin_small_patch4_window7_512x512

输入输出数据

输入数据

输入数据数据类型大小数据排布

input RGB_FP32 batchsize x 3 x 512 x 512 NCHW
输出数据

输入数据数据类型大小数据排布

output FLOAT32 batchsize x 150 x 512 x 512 ND

输入数据	数据类型	大小	数据排布
input	RGB_FP32	batchsize x 3 x 512 x 512	NCHW

输入数据	数据类型	大小	数据排布
output	FLOAT32	batchsize x 150 x 512 x 512	ND

推理环境准备

该模型需要以下插件与驱动，版本配套表如下所示：

配套	版本	环境准备指导
固件与驱动	1.0.15	Pytorch框架推理环境准备
CANN	5.1.RC2	-
Python	3.7.5	-
PyTorch	1.7.1	-
说明：Atlas 300I Duo 推理卡请以CANN版本选择实际固件与驱动版本。	\	\

快速上手

获取源码

获取源码。

git clone https://github.com/SwinTransformer/Swin-Transformer-Semantic-Segmentation.git   # 克隆仓库的代码
cd Swin-Transformer-Semantic-Segmentation        								   		 # 切换到模型的代码仓目录
git reset --hard 87e6f90577435c94f3e92c7db1d36edc234d91f6                     		  # 代码设置到对应的commit_id
patch -p1<../change.patch															   # 修改源代码

安装依赖。
```
pip install -r requirements.txt
```

准备数据集

获取原始数据集。

本推理项目使用 ADE20K 的 2000 张验证集图片来验证模型精度，请进入 ADE20K官网自行下载数据集（需要先注册）。在Swin-Transformer-Semantic-Segmentation 目录中创建data文件夹，ade数据采集存放在data中。最终，验证集原始图片与标注图片分别存放在annotations/validation和images/validation目录下。目录结构如下：
```
├── data/ade/ADEChallengeData2016/
    ├── annotations/
        ├── validation/
            ├── ADE_val_00000001.png
            ├── ...
            ├── ADE_val_00002000.png
    ├── images/
        ├── validation/
            ├── ADE_val_00000001.jpg
            ├── ...
            ├── ADE_val_00002000.jpg
```
数据预处理，将原始数据集转换为模型输入的数据。

使用mv命令将前处理脚本Swin-Transformer-Semantic-Segmentation_preprocess.py移动至Swin-Transformer-Semantic-Segmentation目录下。然后将目录切换到Swin-Transformer-Semantic-Segmentation。执行前处理Swin-Transformer-Semantic-Segmentation_preprocess.py脚本，完成预处理。
```
cd ..
mv Swin-Transformer-Semantic-Segmentation_preprocess.py Swin-Transformer-Semantic-Segmentation/
cd Swin-Transformer-Semantic-Segmentation

python Swin-Transformer-Semantic-Segmentation_preprocess.py --config configs/swin/upernet_swin_small_patch4_window7_512x512_160k_ade20k.py --save-dir data/bin/
```
参数说明：
- --config: 模型配置文件路径
- --save-dir: 存放生成的bin文件的目录路径

模型推理

模型转换。

使用PyTorch将模型权重文件.pth转换为.onnx文件，再使用ATC工具将.onnx文件转为离线推理模型文件.om文件。

获取权重文件。

获取预训练好的 pth权重文件，下载完成后将权重 pth 文件存放于 Swin-Transformer-Semantic-Segmentation/checkpoint 目录下。
导出onnx文件。
1. 使用Swin-Transformer-Semantic-Segmentation_pth2onnx.py 导出onnx文件。
  
  使用mv命令将Swin-Transformer-Semantic-Segmentation_pth2onnx.py脚本移动至Swin-Transformer-Semantic-Segmentation目录下。执行Swin-Transformer-Semantic-Segmentation_pth2onnx.py。
```
cd ..
mv Swin-Transformer-Semantic-Segmentation_pth2onnx.py Swin-Transformer-Semantic-Segmentation/
cd Swin-Transformer-Semantic-Segmentation

python Swin-Transformer-Semantic-Segmentation_pth2onnx.py --config configs/swin/upernet_swin_small_patch4_window7_512x512_160k_ade20k.py --checkpoint checkpoint/upernet_swin_small_patch4_window7_512x512.pth --onnx swin_bs${bs}.onnx --batchsize ${bs}
```
  参数说明：
  - --config: 模型配置文件路径
  - --checkpoint: 预训练权重所在路径
  - --onnx: 生成ONNX模型的保存路径
  - --batchsize: 模型输入的batchsize，默认为 1
  - --opset-version: ONNX算子集版本，默认为 11
  运行结束后，在Swin-Transformer-Semantic-Segmentation目录下会生成.onnx文件。

使用ATC工具将ONNX模型转OM模型。

配置环境变量。

source /usr/local/Ascend/ascend-toolkit/set_env.sh

执行命令查看芯片名称。

npu-smi info
#该设备芯片名为Ascend310P3 
回显如下：
+--------------------------------------------------------------------------------------------+
| npu-smi 22.0.0                       Version: 22.0.2                                       |
+-------------------+-----------------+------------------------------------------------------+
| NPU     Name      | Health          | Power(W)     Temp(C)           Hugepages-Usage(page) |
| Chip    Device    | Bus-Id          | AICore(%)    Memory-Usage(MB)                        |
+===================+=================+======================================================+
| 0       310P3     | OK              | 16.6         56                0    / 0              |
| 0       0         | 0000:5E:00.0    | 0            935  / 21534                            |
+===================+=================+======================================================+

执行ATC命令。
```
atc --framework=5 --model=swin_bs${bs}.onnx --output=swin_bs${bs} --input_format=NCHW --input_shape="input:${bs},3,512,512" --log=null --soc_version=Ascend310${chip_name}
```
- 参数说明：
  - --model: 为ONNX模型文件。
    - --framework: 5代表ONNX模型。
    - --input_shape: 输入数据的shape。
    - --input_format: 输入数据的排布格式。
    - --output: 输出的OM模型。
    - --log：日志级别。
    - --soc_version: 处理器型号。
  运行结束后，在Swin-Transformer-Semantic-Segmentation目录下会生成.om文件。

开始推理验证。
1. 安装ais_bench推理工具。请访问ais_bench推理工具代码仓，根据readme文档进行工具安装。
```
mkdir infer     								#创建存放推理结果的文件夹
```
2. 执行推理。
```
python -m ais_bench --model swin_bs${bs}.om --input data/bin/ --output infer/ --batchsize ${bs}
```
  参数说明：
  - --model: OM模型路径
  - --input: 存放预处理bin文件的目录路径
  - --output: 存放推理结果的目录路径
  运行成功后，将会在infer文件夹下生成以年月日和时间作为文件名存放的推理结果。
3. 精度验证。
  
  使用mv命令将Swin-Transformer-Semantic-Segmentation_postprocess.py脚本移动至Swin-Transformer-Semantic-Segmentation目录下。执行该脚本可以获得mIoU精度数据。
```
cd ..
mv Swin-Transformer-Semantic-Segmentation_postprocess.py Swin-Transformer-Semantic-Segmentation/
cd Swin-Transformer-Semantic-Segmentation

python Swin-Transformer-Semantic-Segmentation_postprocess.py --config configs/swin/upernet_swin_small_patch4_window7_512x512_160k_ade20k.py --infer-results infer/${infer_result_datalog}/
```
  参数说明：
  - --config: 模型配置文件路径
  - --infer-results: 存放推理结果的目录路径
  ${infer_result_datalog}是推理结果存放的文件夹名称。运行成功后，程序会打印出模型的mIoU精度指标。
4. 性能验证。
  
  可使用ais_bench推理工具的纯推理模式验证不同batch_size的om模型的性能，参考命令如下：
```
python -m ais_bench --model swin_bs${bs}.om --loop 100 --batchsize ${bs}
```
  参数说明：
  - --model: OM模型路径
  - --input: 存放预处理bin文件的目录路径
  - --loop:推理次数，可选参数，默认为1
  - --batchsize:转换OM模型的batchsize，默认为1

模型推理性能&精度

性能对比

在 300I PRO 设备上，当 batchsize 为 1 时模型的性能为 19.24 fps.

芯片型号 Batch Size 数据集精度性能

300I Pro 1 ADE20K 48.06% 19.24fps

注：当 batchsize 为 4 或更高时，因内存不足导致推理失败，无法获取精度和性能数据。