YAMNet
Table of contents
- 1. Description
- 2. Current Support Platform
- 3. Pretrained Model
- 4. Convert to RKNN
- 5. Python Demo
- 6. Android Demo
- 7. Linux Demo
- 8. Expected Results
1. Description
YAMNet is a deep net that predicts 521 audio event classes from the AudioSet-YouTube corpus it was trained on. It employs the Mobilenet_v1 depthwise-separable convolution architecture.
The model used in this example comes from the following open source projects:
https://www.tensorflow.org/hub/tutorials/yamnet
2. Current Support Platform
RK3562, RK3566, RK3568, RK3576, RK3588, RV1126B, RV1109, RV1126, RK1808, RK3399PRO
3. Pretrained Model
Download link:
Download with shell command:
cd model
./download_model.sh
Note: For exporting yamnet onnx models, please refer to export_onnx.md
4. Convert to RKNN
Usage:
cd python
python convert.py <onnx_model> <TARGET_PLATFORM> <dtype(optional)> <output_rknn_path(optional)>
# such as:
python convert.py ../model/yamnet_3s.onnx rk3588
# output model will be saved as ../model/yamnet_3s.rknn
Description:
<onnx_model>: Specify ONNX model path.<TARGET_PLATFORM>: Specify NPU platform name. Support Platform refer here.<dtype>(optional): Specify asi8/u8orfp.i8/u8for doing quantization,fpfor no quantization. Default isfp.<output_rknn_path>(optional): Specify save path for the RKNN model, default save in the same directory as ONNX model.
5. Python Demo
Usage:
cd python
# Inference with ONNX model
python yamnet.py --model_path <onnx_model>
# Inference with RKNN model
python yamnet.py --model_path <rknn_model> --target <TARGET_PLATFORM>
Description:
-
<TARGET_PLATFORM>: Specify NPU platform name. Support Platform refer here.
-
<onnx_model / rknn_model>: Specify model path.
6. Android Demo
Note: RK1808, RV1109, RV1126 does not support Android.
6.1 Compile and Build
Usage:
# go back to the rknn_model_zoo root directory
cd ../../
export ANDROID_NDK_PATH=<android_ndk_path>
./build-android.sh -t <TARGET_PLATFORM> -a <ARCH> -d yamnet
# such as
./build-android.sh -t rk3588 -a arm64-v8a -d yamnet
Description:
<android_ndk_path>: Specify Android NDK path.<TARGET_PLATFORM>: Specify NPU platform name. Support Platform refer here.<ARCH>: Specify device system architecture. To query device architecture, refer to the following command:# Query architecture. For Android, ['arm64-v8a' or 'armeabi-v7a'] should shown in log. adb shell cat /proc/version
6.2 Push demo files to device
With device connected via USB port, push demo files to devices:
adb root
adb remount
adb push install/<TARGET_PLATFORM>_android_<ARCH>/rknn_yamnet_demo/ /data/
6.3 Run demo
adb shell
cd /data/rknn_yamnet_demo
export LD_LIBRARY_PATH=./lib
./rknn_yamnet_demo model/yamnet_3s.rknn model/test.wav
7. Linux Demo
Please note that the Linux compilation tool chain recommends using gcc-linaro-6.3.1(aarch64)/gcc-arm-8.3(armhf)/armhf-uclibcgnueabihf(armhf for RV1106/RV1103 series). Using other versions may encounter the problem of Cdemo compilation failure. For detailed compilation guide, please refer to Compilation_Environment_Setup_Guide.md
7.1 Compile and Build
usage
# go back to the rknn_model_zoo root directory
cd ../../
# if GCC_COMPILER not found while building, please set GCC_COMPILER path
(optional)export GCC_COMPILER=<GCC_COMPILER_PATH>
./build-linux.sh -t <TARGET_PLATFORM> -a <ARCH> -d yamnet
# such as
./build-linux.sh -t rk3588 -a aarch64 -d yamnet
Description:
-
<GCC_COMPILER_PATH>: Specified as GCC_COMPILER path. -
<TARGET_PLATFORM>: Specify NPU platform name. Support Platform refer here. -
<ARCH>: Specify device system architecture. To query device architecture, refer to the following command:# Query architecture. For Linux, ['aarch64' or 'armhf'] should shown in log. adb shell cat /proc/version
7.2 Push demo files to device
- If device connected via USB port, push demo files to devices:
adb push install/<TARGET_PLATFORM>_linux_<ARCH>/rknn_yamnet_demo/ /data/
- For other boards, use
scpor other approaches to push all files underinstall/<TARGET_PLATFORM>_linux_<ARCH>/rknn_yamnet_demo/todata.
7.3 Run demo
adb shell
cd /data/rknn_yamnet_demo
export LD_LIBRARY_PATH=./lib
./rknn_yamnet_demo model/yamnet_3s.rknn model/test.wav
8. Expected Results
This example will print the predicted category of this sound, as follows:
The main sound is: Animal