Struct QuantScheme

pub struct QuantScheme {
    pub value: QuantValue,
    pub param: QuantParam,
    pub store: QuantStore,
    pub level: QuantLevel,
    pub mode: QuantMode,
}

Expand description

Describes a quantization scheme/configuration.

Fields§

§value: QuantValue

The logical data type of quantized input values (e.g., QInt8).

This defines how values are interpreted during computation, independent of how they’re stored.

§param: QuantParam

Precision used for quantization parameters (e.g., scale and biases).

§store: QuantStore

Data type used for storing quantized values.

§level: QuantLevel

Granularity level of quantization (e.g., per-tensor).

§mode: QuantMode

Quantization mode (e.g., symmetric).

Implementations§

§

impl QuantScheme

pub fn with_level(self, level: QuantLevel) -> QuantScheme

Set the quantization level.

pub fn with_mode(self, mode: QuantMode) -> QuantScheme

Set the quantization mode.

pub fn with_value(self, value: QuantValue) -> QuantScheme

Set the data type used for quantized values.

pub fn with_store(self, store: QuantStore) -> QuantScheme

Set the data type used to store quantized values.

pub fn with_param(self, param: QuantParam) -> QuantScheme

Set the precision used for quantization parameters

pub fn size_bits_stored(&self) -> usize

Returns the size of the quantization storage type in bits.

pub fn size_bits_value(&self) -> usize

Returns the size of the quantization storage type in bits.

pub fn num_quants(&self) -> usize

Returns the number of quantized values stored in a single element.

pub fn native_packing(&self) -> usize

Returns the native packing factor for the values. When native packing > 1, the packed representation stores num_quants elements grouped into packs of native_packing size.

pub fn packing_dim(&self) -> Option<usize>

Returns the packing dim for the store.

pub fn swap_packing_dim(&mut self, dim0: usize, dim1: usize)

Swaps the packing dim if it’s either of dim0 or dim1. Executes the corresponding update to shape.swap(dim0, dim1).

Trait Implementations§

§

impl Clone for QuantScheme

§

fn clone(&self) -> QuantScheme

Returns a duplicate of the value. Read more

1.0.0 · Source§

fn clone_from(&mut self, source: &Self)

Performs copy-assignment from source. Read more

§

impl Debug for QuantScheme

§

fn fmt(&self, f: &mut Formatter<'_>) -> Result<(), Error>

Formats the value using the given formatter. Read more

§

impl Default for QuantScheme

§

fn default() -> QuantScheme

Returns the “default value” for a type. Read more

§

impl<'de> Deserialize<'de> for QuantScheme

§

fn deserialize<D>( deserializer: D, ) -> Result<QuantScheme, <D as Deserializer<'de>>::Error>
where __D: Deserializer<'de>,

Deserialize this value from the given Serde deserializer. Read more

§

impl Hash for QuantScheme

§

fn hash<H>(&self, state: &mut H)
where __H: Hasher,

Feeds this value into the given Hasher. Read more

1.3.0 · Source§

fn hash_slice<H>(data: &[Self], state: &mut H)
where H: Hasher, Self: Sized,

Feeds a slice of this type into the given Hasher. Read more

§

impl Ord for QuantScheme

§

fn cmp(&self, other: &QuantScheme) -> Ordering

This method returns an Ordering between self and other. Read more

1.21.0 · Source§

fn max(self, other: Self) -> Self
where Self: Sized,

Compares and returns the maximum of two values. Read more

1.21.0 · Source§

fn min(self, other: Self) -> Self
where Self: Sized,

Compares and returns the minimum of two values. Read more

1.50.0 · Source§

fn clamp(self, min: Self, max: Self) -> Self
where Self: Sized,

Restrict a value to a certain interval. Read more

§

impl PartialEq for QuantScheme

§

fn eq(&self, other: &QuantScheme) -> bool

Tests for self and other values to be equal, and is used by ==.

1.0.0 · Source§

fn ne(&self, other: &Rhs) -> bool

Tests for !=. The default implementation is almost always sufficient, and should not be overridden without very good reason.

§

impl PartialOrd for QuantScheme

§

fn partial_cmp(&self, other: &QuantScheme) -> Option<Ordering>

This method returns an ordering between self and other values if one exists. Read more

1.0.0 · Source§

fn lt(&self, other: &Rhs) -> bool

Tests less than (for self and other) and is used by the < operator. Read more

1.0.0 · Source§

fn le(&self, other: &Rhs) -> bool

Tests less than or equal to (for self and other) and is used by the <= operator. Read more

1.0.0 · Source§

fn gt(&self, other: &Rhs) -> bool

Tests greater than (for self and other) and is used by the > operator. Read more

1.0.0 · Source§

fn ge(&self, other: &Rhs) -> bool

Tests greater than or equal to (for self and other) and is used by the >= operator. Read more

§

impl Serialize for QuantScheme

§

fn serialize<S>( &self, serializer: S, ) -> Result<<S as Serializer>::Ok, <S as Serializer>::Error>
where S: Serializer,

Serialize this value into the given Serde serializer. Read more

§

impl Copy for QuantScheme

§

impl Eq for QuantScheme

§

impl StructuralPartialEq for QuantScheme

Auto Trait Implementations§

§

impl UnwindSafe for QuantScheme

Blanket Implementations§

§

impl<T> Adaptor<()> for T

§

fn adapt(&self)

Adapt the type to be passed to a metric.

Source §

impl<T> Any for T
where T: 'static + ?Sized,

Source §

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more

Source §

impl<T> Borrow<T> for T
where T: ?Sized,

Source §

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more

Source §

impl<T> BorrowMut<T> for T
where T: ?Sized,

Source §

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more

Source §

impl<T> CloneToUninit for T
where T: Clone,

Source §

unsafe fn clone_to_uninit(&self, dest: *mut u8)

🔬This is a nightly-only experimental API. (clone_to_uninit)

Performs copy-assignment from self to dest. Read more

§

impl<Q, K> Comparable<K> for Q
where Q: Ord + ?Sized, K: Borrow<Q> + ?Sized,

§

fn compare(&self, key: &K) -> Ordering

Compare self to key and return their ordering.

§

impl<T> Downcast<T> for T

§

fn downcast(&self) -> &T

§

impl<Q, K> Equivalent<K> for Q
where Q: Eq + ?Sized, K: Borrow<Q> + ?Sized,

§

fn equivalent(&self, key: &K) -> bool

Compare self to key and return true if they are equal.

§

impl<Q, K> Equivalent<K> for Q
where Q: Eq + ?Sized, K: Borrow<Q> + ?Sized,

§

fn equivalent(&self, key: &K) -> bool

Checks if this value is equivalent to the given key. Read more

Source §

impl<T> From<T> for T

Source §

fn from(t: T) -> T

Returns the argument unchanged.

§

impl<T> Instrument for T

§

fn instrument(self, span: Span) -> Instrumented<Self>

Instruments this type with the provided [Span], returning an Instrumented wrapper. Read more

§

fn in_current_span(self) -> Instrumented<Self>

Instruments this type with the current Span, returning an Instrumented wrapper. Read more

Source §

impl<T, U> Into for T
where U: From<T>,

Source §

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

§

impl<T> IntoComptime for T

§

fn comptime(self) -> Self

Source §

impl<T> IntoEither for T

Source §

fn into_either(self, into_left: bool) -> Either<Self, Self>

Converts self into a Left variant of Either<Self, Self> if into_left is true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more

Source §

fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
where F: FnOnce(&Self) -> bool,

Converts self into a Left variant of Either<Self, Self> if into_left(&self) returns true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more

§