Enum AdjustLrFn
pub enum AdjustLrFn {
Original,
MatchRmsAdamW,
}Expand description
Learning rate adjustment method for Muon optimizer.
Muon adjusts the learning rate based on parameter shape to maintain consistent RMS across rectangular matrices.
§References
- Original: Muon: An optimizer for hidden layers
- Moonshot: Muon is Scalable for LLM Training
Variants§
Original
Keller Jordan’s original method: lr * sqrt(max(1, A/B))
This scales the learning rate based on the aspect ratio of the weight matrix, ensuring that tall matrices (more rows than columns) get proportionally larger learning rates.
§Example
For a [1024, 512] matrix: lr * sqrt(1024/512) = lr * 1.414
MatchRmsAdamW
Moonshot’s method: lr * 0.2 * sqrt(max(A, B))
This method is designed to match AdamW’s RMS, allowing Muon to directly reuse learning rates and weight decay values tuned for AdamW without retuning.
§Example
For a [1024, 512] matrix: lr * 0.2 * sqrt(1024) = lr * 6.4
Trait Implementations§
§impl Clone for AdjustLrFn
impl Clone for AdjustLrFn
§fn clone(&self) -> AdjustLrFn
fn clone(&self) -> AdjustLrFn
Returns a duplicate of the value. Read more
1.0.0 · Source§fn clone_from(&mut self, source: &Self)
fn clone_from(&mut self, source: &Self)
Performs copy-assignment from
source. Read more§impl Debug for AdjustLrFn
impl Debug for AdjustLrFn
§impl Default for AdjustLrFn
impl Default for AdjustLrFn
§fn default() -> AdjustLrFn
fn default() -> AdjustLrFn
Returns the “default value” for a type. Read more
§impl<'de> Deserialize<'de> for AdjustLrFn
impl<'de> Deserialize<'de> for AdjustLrFn
§fn deserialize<__D>(
__deserializer: __D,
) -> Result<AdjustLrFn, <__D as Deserializer<'de>>::Error>where
__D: Deserializer<'de>,
fn deserialize<__D>(
__deserializer: __D,
) -> Result<AdjustLrFn, <__D as Deserializer<'de>>::Error>where
__D: Deserializer<'de>,
Deserialize this value from the given Serde deserializer. Read more
§impl PartialEq for AdjustLrFn
impl PartialEq for AdjustLrFn
§impl Serialize for AdjustLrFn
impl Serialize for AdjustLrFn
§fn serialize<__S>(
&self,
__serializer: __S,
) -> Result<<__S as Serializer>::Ok, <__S as Serializer>::Error>where
__S: Serializer,
fn serialize<__S>(
&self,
__serializer: __S,
) -> Result<<__S as Serializer>::Ok, <__S as Serializer>::Error>where
__S: Serializer,
Serialize this value into the given Serde serializer. Read more
impl Copy for AdjustLrFn
impl Eq for AdjustLrFn
impl StructuralPartialEq for AdjustLrFn
Auto Trait Implementations§
impl Freeze for AdjustLrFn
impl RefUnwindSafe for AdjustLrFn
impl Send for AdjustLrFn
impl Sync for AdjustLrFn
impl Unpin for AdjustLrFn
impl UnwindSafe for AdjustLrFn
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more
Source§impl<T> CloneToUninit for Twhere
T: Clone,
impl<T> CloneToUninit for Twhere
T: Clone,
§impl<Q, K> Equivalent<K> for Q
impl<Q, K> Equivalent<K> for Q
§fn equivalent(&self, key: &K) -> bool
fn equivalent(&self, key: &K) -> bool
Compare self to
key and return true if they are equal.§impl<Q, K> Equivalent<K> for Q
impl<Q, K> Equivalent<K> for Q
§fn equivalent(&self, key: &K) -> bool
fn equivalent(&self, key: &K) -> bool
Checks if this value is equivalent to the given key. Read more
§impl<T> Instrument for T
impl<T> Instrument for T
§fn instrument(self, span: Span) -> Instrumented<Self>
fn instrument(self, span: Span) -> Instrumented<Self>
§fn in_current_span(self) -> Instrumented<Self>
fn in_current_span(self) -> Instrumented<Self>
Source§impl<T> IntoEither for T
impl<T> IntoEither for T
Source§fn into_either(self, into_left: bool) -> Either<Self, Self>
fn into_either(self, into_left: bool) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left is true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read moreSource§fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left(&self) returns true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read more