Module quantization
Expand description
Tensor quantization module.
Structs§
- Affine quantization scheme.
- The observed input calibration range.
- Computes the per-tensor quantization range mapping based on the min and max values.
- The quantization tensor data parameters.
- The quantization parameters primitive.
- Quantized data bytes representation.
- Symmetric quantization scheme.
Enums§
- Quantization scheme.
- Quantization strategy.
- Quantization data type.
Traits§
- Calibration method used to compute the quantization range mapping.
- Quantized tensor primitive.
- Quantization scheme to convert elements of a higher precision data type
E
to a lower precision data typeQ
and vice-versa.
Functions§
- Pack signed 8-bit integer values into a sequence of unsigned 32-bit integers.
- Unpack 32-bit unsigned integer values into a sequence of signed 8-bit integers.
Type Aliases§
- The tensor quantization parameters.