OH_NN_QuantParam
typedef struct OH_NN_QuantParam {...} OH_NN_QuantParam
Overview
Defines the quantization information.
In quantization scenarios, the 32-bit floating-point data type is quantized into the fixed-point data type according to the following formula:

where, s and z are quantization parameters, which are stored by scale and zeroPoint in OH_NN_QuantParam. r is a floating point number, q is the quantization result, q_min is the lower bound of the quantization result, and q_max is the upper bound of the quantization result. The calculation method is as follows:


The clamp function is defined as follows:

Since: 9
Deprecated from: 11
Substitute: NN_QuantParam
Related module: NeuralNetworkRuntime
Header file: neural_network_runtime_type.h
Summary
Member Variables
| Name | Description |
|---|---|
| uint32_t quantCount | Length of the numBits, scale, and zeroPoint arrays. In the per-layer quantization scenario, quantCount is usually set to 1. That is, all channels of a tensor share a set of quantization parameters. In the per-channel quantization scenario, quantCount is usually the same as the number of tensor channels, and each channel uses its own quantization parameters. |
| const uint32_t *numBits | Number of quantization bits. |
| const double *scale | Pointer to the scale data in the quantization formula. |
| const int32_t *zeroPoint | Pointer to the zero point data in the quantization formula. |