c0e2cd76创建于 2025年12月23日历史提交

YAMNet

Table of contents

1. Description

YAMNet is a deep net that predicts 521 audio event classes from the AudioSet-YouTube corpus it was trained on. It employs the Mobilenet_v1 depthwise-separable convolution architecture.

The model used in this example comes from the following open source projects:

https://www.tensorflow.org/hub/tutorials/yamnet

2. Current Support Platform

RK3562, RK3566, RK3568, RK3576, RK3588, RV1126B, RV1109, RV1126, RK1808, RK3399PRO

3. Pretrained Model

Download link:

yamnet_3s.onnx

Download with shell command:

cd model
./download_model.sh

Note: For exporting yamnet onnx models, please refer to export_onnx.md

4. Convert to RKNN

Usage:

cd python
python convert.py <onnx_model> <TARGET_PLATFORM> <dtype(optional)> <output_rknn_path(optional)>

# such as: 
python convert.py ../model/yamnet_3s.onnx rk3588
# output model will be saved as ../model/yamnet_3s.rknn

Description:

  • <onnx_model>: Specify ONNX model path.
  • <TARGET_PLATFORM>: Specify NPU platform name. Support Platform refer here.
  • <dtype>(optional): Specify as i8/u8 or fp. i8/u8 for doing quantization, fp for no quantization. Default is fp.
  • <output_rknn_path>(optional): Specify save path for the RKNN model, default save in the same directory as ONNX model.

5. Python Demo

Usage:

cd python
# Inference with ONNX model
python yamnet.py --model_path <onnx_model> 

# Inference with RKNN model
python yamnet.py --model_path <rknn_model> --target <TARGET_PLATFORM>

Description:

  • <TARGET_PLATFORM>: Specify NPU platform name. Support Platform refer here.

  • <onnx_model / rknn_model>: Specify model path.

6. Android Demo

Note: RK1808, RV1109, RV1126 does not support Android.

6.1 Compile and Build

Usage:

# go back to the rknn_model_zoo root directory
cd ../../
export ANDROID_NDK_PATH=<android_ndk_path>

./build-android.sh -t <TARGET_PLATFORM> -a <ARCH> -d yamnet

# such as 
./build-android.sh -t rk3588 -a arm64-v8a -d yamnet

Description:

  • <android_ndk_path>: Specify Android NDK path.
  • <TARGET_PLATFORM>: Specify NPU platform name. Support Platform refer here.
  • <ARCH>: Specify device system architecture. To query device architecture, refer to the following command:
    # Query architecture. For Android, ['arm64-v8a' or 'armeabi-v7a'] should shown in log.
    adb shell cat /proc/version
    

6.2 Push demo files to device

With device connected via USB port, push demo files to devices:

adb root
adb remount
adb push install/<TARGET_PLATFORM>_android_<ARCH>/rknn_yamnet_demo/ /data/

6.3 Run demo

adb shell
cd /data/rknn_yamnet_demo

export LD_LIBRARY_PATH=./lib
./rknn_yamnet_demo model/yamnet_3s.rknn model/test.wav

7. Linux Demo

Please note that the Linux compilation tool chain recommends using gcc-linaro-6.3.1(aarch64)/gcc-arm-8.3(armhf)/armhf-uclibcgnueabihf(armhf for RV1106/RV1103 series). Using other versions may encounter the problem of Cdemo compilation failure. For detailed compilation guide, please refer to Compilation_Environment_Setup_Guide.md

7.1 Compile and Build

usage

# go back to the rknn_model_zoo root directory
cd ../../

# if GCC_COMPILER not found while building, please set GCC_COMPILER path
(optional)export GCC_COMPILER=<GCC_COMPILER_PATH>

./build-linux.sh -t <TARGET_PLATFORM> -a <ARCH> -d yamnet

# such as 
./build-linux.sh -t rk3588 -a aarch64 -d yamnet

Description:

  • <GCC_COMPILER_PATH>: Specified as GCC_COMPILER path.

  • <TARGET_PLATFORM> : Specify NPU platform name. Support Platform refer here.

  • <ARCH>: Specify device system architecture. To query device architecture, refer to the following command:

    # Query architecture. For Linux, ['aarch64' or 'armhf'] should shown in log.
    adb shell cat /proc/version
    

7.2 Push demo files to device

  • If device connected via USB port, push demo files to devices:
adb push install/<TARGET_PLATFORM>_linux_<ARCH>/rknn_yamnet_demo/ /data/
  • For other boards, use scp or other approaches to push all files under install/<TARGET_PLATFORM>_linux_<ARCH>/rknn_yamnet_demo/ to data.

7.3 Run demo

adb shell
cd /data/rknn_yamnet_demo

export LD_LIBRARY_PATH=./lib
./rknn_yamnet_demo model/yamnet_3s.rknn model/test.wav

8. Expected Results

This example will print the predicted category of this sound, as follows:

The main sound is: Animal