Module quantization
Expand description
Tensor quantization module.
Structs§
- Calibration
Range - The observed input calibration range.
- QParams
- The quantization tensor data parameters.
- Quantization
Parameters Primitive - The quantization parameters primitive.
- Quantized
Bytes - Quantized data bytes representation.
- Symmetric
Quantization - Symmetric quantization scheme.
Enums§
- Calibration
- Calibration method used to compute the quantization range mapping.
- Quantization
Mode - Quantization mode.
- Quantization
Scheme - Quantization scheme.
- Quantization
Strategy - Quantization strategy.
- Quantization
Type - Quantization data type.
- Quantization
Type Expand
Traits§
- QTensor
Primitive - Quantized tensor primitive.
- Quantization
- Quantization scheme to convert elements of a higher precision data type
E
to a lower precision data typeQ
and vice-versa.
Functions§
- pack_
i8s_ to_ u32s - Pack signed 8-bit integer values into a sequence of unsigned 32-bit integers.
- unpack_
u32s_ to_ i8s - Unpack 32-bit unsigned integer values into a sequence of signed 8-bit integers.
Type Aliases§
- Quantization
Parameters - The tensor quantization parameters.