ascend-robot【fix】完善小模型Atlas 300I DUO硬件描述

文件	最后提交记录	最后更新时间
patches	adapt whisper torchair to 310p Co-authored-by: LJ_1998<lijian379@huawei.com> # message auto-generated for no-merge-commit merge: !7543 merge whisper_torchair_310p into master adapt whisper torchair to 310p Created-by: LJ_1998 Commit-by: LJ_1998 Merged-by: ascend-robot Description: ## Motivation 当前whisper torchair路线由于使用了pfa算子（在310p上使用时有诸多限制），所以不支持310p，导致在310p上无法使用whisper large等支持多语种的模型 ## Modification 在代码中通过soc version区分硬件形态，910b使用pfa算子，310p使用小算子 ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist Before PR: - [x] The new code needs to comply with the Clean Code specification. - [x] The PR content is self-checked, and the expression can be clear and the writing standardized After PR: - [x] CLA has been signed and all committers have signed the CLA in this PR. - [x] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/ModelZoo-PyTorch!7543	19 天前
README.md	【fix】完善小模型Atlas 300I DUO硬件描述 Co-authored-by: Niushiya<niushiya1@huawei.com> # message auto-generated for no-merge-commit merge: !7587 merge master into master 【fix】完善小模型Atlas 300I DUO硬件描述 Created-by: niushiya Commit-by: Niushiya Merged-by: ascend-robot Description: ## Motivation 1、完善小模型Atlas 300I DUO硬件描述，补充单芯字段； ## Modification Please briefly describe what modification is made in this PR. ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist Before PR: - [x] The new code needs to comply with the Clean Code specification. - [x] The PR content is self-checked, and the expression can be clear and the writing standardized After PR: - [x] CLA has been signed and all committers have signed the CLA in this PR. - [x] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/ModelZoo-PyTorch!7587	5 天前
check_numa.sh	combine whisper Co-authored-by: LJ_1998<lijian379@huawei.com> # message auto-generated for no-merge-commit merge: !7482 merge combine_whisper into master combine whisper Created-by: LJ_1998 Commit-by: LJ_1998 Merged-by: ascend-robot Description: ## Motivation 仓库中当前有很多版本的whisper，需要整合到一起并表明模型类型版本 ## Modification 将whisper base，whisper large v3 （turbo），whisper base en （om推理）合并到built-in下 ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist Before PR: - [x] The new code needs to comply with the Clean Code specification. - [x] The PR content is self-checked, and the expression can be clear and the writing standardized After PR: - [x] CLA has been signed and all committers have signed the CLA in this PR. - [x] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/ModelZoo-PyTorch!7482	3 个月前
infer.py	combine whisper Co-authored-by: LJ_1998<lijian379@huawei.com> # message auto-generated for no-merge-commit merge: !7482 merge combine_whisper into master combine whisper Created-by: LJ_1998 Commit-by: LJ_1998 Merged-by: ascend-robot Description: ## Motivation 仓库中当前有很多版本的whisper，需要整合到一起并表明模型类型版本 ## Modification 将whisper base，whisper large v3 （turbo），whisper base en （om推理）合并到built-in下 ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist Before PR: - [x] The new code needs to comply with the Clean Code specification. - [x] The PR content is self-checked, and the expression can be clear and the writing standardized After PR: - [x] CLA has been signed and all committers have signed the CLA in this PR. - [x] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/ModelZoo-PyTorch!7482	3 个月前
modeling_whisper.py	adapt whisper torchair to 310p Co-authored-by: LJ_1998<lijian379@huawei.com> # message auto-generated for no-merge-commit merge: !7543 merge whisper_torchair_310p into master adapt whisper torchair to 310p Created-by: LJ_1998 Commit-by: LJ_1998 Merged-by: ascend-robot Description: ## Motivation 当前whisper torchair路线由于使用了pfa算子（在310p上使用时有诸多限制），所以不支持310p，导致在310p上无法使用whisper large等支持多语种的模型 ## Modification 在代码中通过soc version区分硬件形态，910b使用pfa算子，310p使用小算子 ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist Before PR: - [x] The new code needs to comply with the Clean Code specification. - [x] The PR content is self-checked, and the expression can be clear and the writing standardized After PR: - [x] CLA has been signed and all committers have signed the CLA in this PR. - [x] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/ModelZoo-PyTorch!7543	19 天前
pipeline.py	adapt whisper torchair to 310p Co-authored-by: LJ_1998<lijian379@huawei.com> # message auto-generated for no-merge-commit merge: !7543 merge whisper_torchair_310p into master adapt whisper torchair to 310p Created-by: LJ_1998 Commit-by: LJ_1998 Merged-by: ascend-robot Description: ## Motivation 当前whisper torchair路线由于使用了pfa算子（在310p上使用时有诸多限制），所以不支持310p，导致在310p上无法使用whisper large等支持多语种的模型 ## Modification 在代码中通过soc version区分硬件形态，910b使用pfa算子，310p使用小算子 ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist Before PR: - [x] The new code needs to comply with the Clean Code specification. - [x] The PR content is self-checked, and the expression can be clear and the writing standardized After PR: - [x] CLA has been signed and all committers have signed the CLA in this PR. - [x] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/ModelZoo-PyTorch!7543	19 天前
requirements.txt	combine whisper Co-authored-by: LJ_1998<lijian379@huawei.com> # message auto-generated for no-merge-commit merge: !7482 merge combine_whisper into master combine whisper Created-by: LJ_1998 Commit-by: LJ_1998 Merged-by: ascend-robot Description: ## Motivation 仓库中当前有很多版本的whisper，需要整合到一起并表明模型类型版本 ## Modification 将whisper base，whisper large v3 （turbo），whisper base en （om推理）合并到built-in下 ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist Before PR: - [x] The new code needs to comply with the Clean Code specification. - [x] The PR content is self-checked, and the expression can be clear and the writing standardized After PR: - [x] CLA has been signed and all committers have signed the CLA in this PR. - [x] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/ModelZoo-PyTorch!7482	3 个月前
run_wer_test.py	combine whisper Co-authored-by: LJ_1998<lijian379@huawei.com> # message auto-generated for no-merge-commit merge: !7482 merge combine_whisper into master combine whisper Created-by: LJ_1998 Commit-by: LJ_1998 Merged-by: ascend-robot Description: ## Motivation 仓库中当前有很多版本的whisper，需要整合到一起并表明模型类型版本 ## Modification 将whisper base，whisper large v3 （turbo），whisper base en （om推理）合并到built-in下 ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist Before PR: - [x] The new code needs to comply with the Clean Code specification. - [x] The PR content is self-checked, and the expression can be clear and the writing standardized After PR: - [x] CLA has been signed and all committers have signed the CLA in this PR. - [x] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/ModelZoo-PyTorch!7482	3 个月前
test_performance.py	combine whisper Co-authored-by: LJ_1998<lijian379@huawei.com> # message auto-generated for no-merge-commit merge: !7482 merge combine_whisper into master combine whisper Created-by: LJ_1998 Commit-by: LJ_1998 Merged-by: ascend-robot Description: ## Motivation 仓库中当前有很多版本的whisper，需要整合到一起并表明模型类型版本 ## Modification 将whisper base，whisper large v3 （turbo），whisper base en （om推理）合并到built-in下 ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist Before PR: - [x] The new code needs to comply with the Clean Code specification. - [x] The PR content is self-checked, and the expression can be clear and the writing standardized After PR: - [x] CLA has been signed and all committers have signed the CLA in this PR. - [x] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/ModelZoo-PyTorch!7482	3 个月前
transcribe.py	combine whisper Co-authored-by: LJ_1998<lijian379@huawei.com> # message auto-generated for no-merge-commit merge: !7482 merge combine_whisper into master combine whisper Created-by: LJ_1998 Commit-by: LJ_1998 Merged-by: ascend-robot Description: ## Motivation 仓库中当前有很多版本的whisper，需要整合到一起并表明模型类型版本 ## Modification 将whisper base，whisper large v3 （turbo），whisper base en （om推理）合并到built-in下 ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist Before PR: - [x] The new code needs to comply with the Clean Code specification. - [x] The PR content is self-checked, and the expression can be clear and the writing standardized After PR: - [x] CLA has been signed and all committers have signed the CLA in this PR. - [x] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/ModelZoo-PyTorch!7482	3 个月前
weight_converter.py	combine whisper Co-authored-by: LJ_1998<lijian379@huawei.com> # message auto-generated for no-merge-commit merge: !7482 merge combine_whisper into master combine whisper Created-by: LJ_1998 Commit-by: LJ_1998 Merged-by: ascend-robot Description: ## Motivation 仓库中当前有很多版本的whisper，需要整合到一起并表明模型类型版本 ## Modification 将whisper base，whisper large v3 （turbo），whisper base en （om推理）合并到built-in下 ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist Before PR: - [x] The new code needs to comply with the Clean Code specification. - [x] The PR content is self-checked, and the expression can be clear and the writing standardized After PR: - [x] CLA has been signed and all committers have signed the CLA in this PR. - [x] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/ModelZoo-PyTorch!7482	3 个月前

Whisper(TorchAir)-推理指导

概述

Whisper 是 OpenAI 开源的通用语音识别模型，支持多语言转录和翻译，基于 Transformer 架构，适用于会议记录、字幕生成等场景。使用 torchair 编译模型加速。该推理指导使用了whisperx的推理流程，结合了 funasr 的 VAD 模型进行语音切分，以及 transformer pipeline 组batch功能。支持whisper-base, whisper-large-v3以及whisper-large-v3-turbo。

插件与驱动准备

该模型需要以下插件与驱动

配套	版本	环境准备指导
固件与驱动	25.0.RC1	Pytorch框架推理环境准备
CANN	8.2.RC1	包含kernels包和toolkit包
Python	3.11	-
PyTorch	2.5.1	-
Ascend Extension PyTorch	2.5.1	-
说明：支持Atlas 800I A2/Atlas 300I A2和Atlas 300I DUO/Atlas 300I Pro	\	\

获取本仓源码

git clone https://gitcode.com/ascend/ModelZoo-PyTorch.git
cd ModelZoo-PyTorch/ACL_PyTorch/built-in/audio/whisper/whisper_torchair

环境准备

通过以下命令下载并安装（或升级至）Whisper 的最新版本：

pip3 install -U openai-whisper
下载模型权重：
```
mkdir weight
cd weight
```
whiper .pt格式权重：
- base.pt: 下载链接
- large-v3.pt：下载链接
- large-v3-turbo.pt：下载链接

whisper .safetensors权重：

whisper-base safetensors: 下载链接
whisper-large-v3 safetensors：下载链接
whisper-large-v3-turbo safetensors：下载链接

VAD权重：

speech_fsmn_vad_zh-cn-16k-common-pytorch: 下载链接
```
cd ..
```
权重转换（safetensor 转换成 pt 格式）：

如果下载的是 pt 格式的权重可以忽略这一步

python3 weight_converter.py --model_name large-v3 --model_path ./weight/whisper-large-v3 # model_name有效参数为 large-v3, large-v3-turbo 和 base，model_path按具体情况修改
安装命令行工具ffmpeg：
- 在 Ubuntu or Debian上: sudo apt update && sudo apt install ffmpeg
- 在 Arch Linux上: sudo pacman -S ffmpeg
安装requirements： pip3 install -r requirements.txt

数据集准备

LibriSpeech/dev-clean数据集下载地址
audio.mp3是普通的语音文件，可以直观测试，可以通过以下链接获取。（你也可以自己找一个中文语音.mp3/wav文件，放入目录中）
```
https://pan.baidu.com/s/1Yvln3t88XbOR5bfDPdLByg?pwd=i3x8 提取码: i3x8 复制这段内容后打开百度网盘手机App，操作更方便哦
```

文件目录结构

文件目录结构大致如下：

📁 whisper_torchair/
├── check_numa.sh
├── audio.mp3
├── infer.py
├── modeling_whisper.py
├── pipeline.py
├── test_performance.py
├── transcribe.py
├── 📁 LibriSpeech/
├── 📁 patches/
|   └── 📄 patch_apply.py
|   └── 📄 kaldi.patch
|   └── 📄 vad_model.patch
|   └── 📄 wav_frontend.patch
├── README.md
├── requrements.txt
├── run_wer_test.py
├── weight_converter.py
├── 📁 weight/
|   |── 📁 Whisper-large-v3
│       └── 📄 large-v3.pt
|   |── 📁 speech_fsmn_vad_zh-cn-16k-common-pytorch

模型推理

脚本功能说明：

infer.py主要用于短音频（<30s）的转录以及LibriSpeech数据集的性能验证
transcribe.py用于长音频转录，如智慧教室生成字幕场景

激活环境变量

source /usr/local/Ascend/ascend-toolkit/set_env.sh  # 具体路径根据你自己的情况修改
# 提升性能相关环境变量
export TASK_QUEUE_ENABLE=1
export PYTORCH_NPU_ALLOC_CONF='expandable_segments:True'

指定使用NPU ID，默认为0
```
export ASCEND_RT_VISIBLE_DEVICES=0
```

给funasr和torchaudio打补丁

cd patches
python3 patch_apply.py
cd ..

使能绑核，进一步提升性能

export CPU_AFFINITY_CONF=1
apt-get update
apt-get install numactl
# 在容器外执行脚本查看NPU id对应的NUMA node和cpu
bash check_numa.sh

回显如下：

...
>>>>设备 0 对应 NUMA 节点: 6, NUMA node6 CPU(s):     192-223
...

短音频推理demo, 根据实际查询到的核数配置，比如
```
taskset -c 192-223 python3 infer.py --whisper_model_path ./weight/Whisper-large-v3/large-v3.pt
```
infer.py推理参数：
- --whisper_model_path：whisper模型权重路径，默认为"./weight/Whisper-large-v3/large-v3.pt"
- --audio_path：音频文件的路径，默认为"audio.mp3"
- --batch_size: batch_size大小，默认为1
- --warmup：warm up次数，默认为3，首次warm up时编译成图
长音频转录：
```
taskset -c 192-223 python3 transcribe.py --whisper_model_path ./weight/Whisper-large-v3/large-v3.pt --audio_path {audio_file}
```
transcribe.py参数说明：
- --whisper_model_path：whisper模型权重路径，默认为"./weight/Whisper-large-v3/large-v3.pt"
- --language：输出语言，默认为中文
- --sample_audio：warm up阶段使用的音频，默认为"audio.mp3"
- --audio_path：长音频文件路径，必选参数
- --device: npu设备编号，默认为0
- --warmup：warm up次数，默认为3，首次warm up时编译成图
性能测试，使用LibriSpeech数据集，采用whisperx推理流程，先通过vad切分音频再重组batch。
```
taskset -c 192-223 python3 test_performance.py --whisper_model_path ./weight/Whisper-large-v3/large-v3.pt --vad_model_path ./weight/speech_fsmn_vad_zh-cn-16k-common-pytorch 
```
test_performance.py推理参数：
- --whisper_model_path：whisper模型权重路径，默认为"./weight/Whisper-large-v3/large-v3.pt"
- --vad_model_path：vad模型权重路径，默认为"./weight/speech_fsmn_vad_zh-cn-16k-common-pytorch"
- --audio_path：音频文件的路径，默认为"audio.mp3"
- --librispeech_perf_test：启用该参数时，将在LibriSpeech数据集的部分数据上进行性能测试，并输出结果及转录比。默认值为 True
- --skip_librispeech_perf_test：传入该参数时，跳过LibriSpeech数据集的性能测试
- --speech_path：librispeech dev clean数据集文件的路径，默认为"./LibriSpeech/dev-clean/"
- --num_audio_files：从librispeech dev clean数据集中选取部分音频文件做性能测试，默认为52个，调整音频数量尽量让vad切分合并后的segment数接近但不大于batch size来达到最高性能
- --librispeech_wer_demo：启用该参数时，将对一条LibriSpeech音频数据做转录，并计算wer。默认值为 True
- --skip_librispeech_wer_demo：启用该参数时，跳过wer精度demo
- --device: npu设备编号，默认为0
- --batch_size: batch_size大小，默认为16
- --warmup：warm up次数，默认为4，首次warm up时编译成图
精度测试，执行以下命令来对librispeech dev clean数据集做全量的精度测试
```
python3 run_wer_test.py --whisper_model_path ./weight/Whisper-large-v3/ 
```
run_wer_test.py参数说明：
- --whisper_model_path：whisper模型权重路径，默认为"./weight/Whisper-large-v3/large-v3.pt"
- --vad_model_path：vad模型权重路径，默认为"./weight/speech_fsmn_vad_zh-cn-16k-common-pytorch"
- --speech_path：librispeech dev clean数据集文件的路径，默认为"./LibriSpeech/dev-clean/"
- --device: npu设备编号，默认为0

性能数据

infer.py取librispeech dev clean数据集中的部分音频进行转录，性能如下

模型	芯片	平均转录比
whisper-base	800I A2 32G	400
whisper-base	300I DUO(单芯)	014
whisper-large-v3	800I A2 32G	70
whisper-large-v3	300I DUO(单芯)	13
whisper-large-v3-turbo	800I A2 32G	170
whisper-large-v3-turbo	300I DUO(单芯)	37

精度数据

模型	芯片	WER	竞品WER
whisper-base	800I A2 32G	0.085	0.086
whisper-base	300I DUO(单芯)	0.085	0.086
whisper-large-v3	800I A2 32G	0.049	0.051
whisper-large-v3	300I DUO(单芯)	0.049	0.051
whisper-large-v3-turbo	800I A2 32G	0.050	0.051
whisper-large-v3-turbo	300I DUO(单芯)	0.050	0.051

注意：目前此适配代码只能支持在输入语音的bs=1的情况下，对beam_size和best_of参数的使用。如需要支持输入语音bs>1的情况，需要注释whisper的decoding.py脚本中740行audio_features = audio_features[:: self.n_group]