Module quantization
Expand description
Tensor quantization module.
Structs§
- Block
Size - Copyable block size, specialized version of
SmallVec. - Calibration
Range - The observed input calibration range.
- QParam
Tensor - A quantization parameter tensor descriptor.
- QParams
- The quantization tensor data parameters.
- Quant
Scheme - Describes a quantization scheme/configuration.
- Quantization
Parameters Primitive - The quantization parameters primitive.
- Quantized
Bytes - Quantized data bytes representation.
- Symmetric
Quantization - Symmetric quantization scheme.
Enums§
- Calibration
- Calibration method used to compute the quantization range mapping.
- Quant
Acc - The precision of accumulating elements.
- Quant
Level - Level or granularity of quantization.
- Quant
Mode - Strategy used to quantize values.
- Quant
Param - Quantization floating-point precision.
- Quant
Propagation - Specify if the output of an operation is quantized using the scheme of the input or returned unquantized.
- Quant
Store - Data type used to stored quantized values.
- Quant
Value - Data type used to represent quantized values.
- Quantization
Strategy - Quantization strategy.
Traits§
- QTensor
Primitive - Quantized tensor primitive.
- Quantization
- Quantization scheme to convert elements of a higher precision data type
Eto a lower precision data typeQand vice-versa.
Functions§
- compute_
q_ params - Compute the quantization parameters.
- compute_
range - Compute the quantization range mapping.
- pack_
i8s_ to_ u32s - Pack signed 8-bit integer values into a sequence of unsigned 32-bit integers.
- params_
shape - Calculate the shape of the quantization parameters for a given tensor and level
Type Aliases§
- Quantization
Parameters - The tensor quantization parameters.