CTCLoss

Struct CTCLoss 

pub struct CTCLoss { /* private fields */ }
Expand description

Computes the Connectionist Temporal Classification (CTC) loss.

Calculates the loss between a continuous (unsegmented) time series and a target sequence. CTC sums over the probability of all possible alignments of the input to the target, producing a loss value that is differentiable with respect to each input node.

The input to this loss is expected to be log-probabilities (e.g,, via log_softmax), not raw logits.

§References

§Example

use burn::tensor::{Tensor, Int};
use burn::tensor::activation::log_softmax;
use burn::nn::loss::{CTCLossConfig, CTCLoss};

let device = Default::default();

// Initialize CTC Loss with default configuration
let ctc_loss = CTCLossConfig::new().init();

// Initialize CTC Loss with custom configuration
let ctc_loss = CTCLossConfig::new()
    .with_blank(1)
    .with_zero_infinity(true)
    .init();

// Prepare inputs (Logits shape: [Time, Batch, Class])
// In your actual code, the logits would be the output of your model
let logits = Tensor::<B, 3>::ones([10, 2, 5], &device);
let log_probs = log_softmax(logits, 2);

// Targets shape: [Batch, Max_Target_Len]
// Note: Targets should not contain the blank index (1).
let targets = Tensor::<B, 2, Int>::from_data([[0, 2], [3, 4]], &device);

// Lengths shape: [Batch]
let input_lengths = Tensor::<B, 1, Int>::from_data([10, 8], &device);
let target_lengths = Tensor::<B, 1, Int>::from_data([2, 2], &device);

// Compute loss
let loss = ctc_loss.forward(log_probs, targets, input_lengths, target_lengths);

Implementations§

§

impl CTCLoss

pub fn forward<B>( &self, log_probs: Tensor<B, 3>, targets: Tensor<B, 2, Int>, input_lengths: Tensor<B, 1, Int>, target_lengths: Tensor<B, 1, Int>, ) -> Tensor<B, 1>
where B: Backend,

Computes the CTC loss for the input log-probabilities and targets with no reduction applied.

§Arguments
  • log_probs: The log-probabilities of the outputs (e.g., from log_softmax).
  • targets: A 2D tensor containing the target class indices. These indices should not include the blank index used in CTC loss. The targets are padded to the length of the longest sequence.
  • input_lengths: A 1D tensor containing the actual length of the input sequence for each batch. This allows retrieving the actual sequence of log-probabilities from log_probs if the batch contains sequences of varying lengths.
  • target_lengths: A 1D tensor containing the actual length of the target sequence for each target sequence in targets.
§Returns
  • A 1D tensor of shape [batch_size] containing the loss for each sample.
§Shapes
  • log_probs: [time_steps, batch_size, num_classes] where num_classes includes blank.
  • targets: [batch_size, max_target_length]
  • input_lengths: [batch_size]
  • target_lengths: [batch_size]

pub fn forward_with_reduction<B>( &self, log_probs: Tensor<B, 3>, targets: Tensor<B, 2, Int>, input_lengths: Tensor<B, 1, Int>, target_lengths: Tensor<B, 1, Int>, reduction: Reduction, ) -> Tensor<B, 1>
where B: Backend,

Computes the CTC loss for the input log-probabilities and targets with reduction.

§Arguments
  • log_probs: The log-probabilities of the outputs (e.g., from log_softmax).
  • targets: A 2D tensor containing the target class indices. These indices should not include the blank index used in CTC loss. The targets are padded to the length of the longest sequence.
  • input_lengths: A 1D tensor containing the actual length of the input sequence for each batch. This allows retrieving the actual sequence of log-probabilities from log_probs if the batch contains sequences of varying lengths.
  • target_lengths: A 1D tensor containing the actual length of the target sequence for each target sequence in targets.
  • reduction: The reduction stratey to apply to the loss tensor containing the CTC loss values for each sample (e.g., mean, sum). For the mean reduction strategy, the output losses will be divided by the target lengths and then the mean over the batch is taken. This follows PyTorch’s behavior.
§Returns
  • A 1D tensor of shape [1] containing the reduced loss value.
§Shapes
  • log_probs: [time_steps, batch_size, num_classes] where num_classes includes blank.
  • targets: [batch_size, max_target_length]
  • input_lengths: [batch_size]
  • target_lengths: [batch_size]
§Panics
  • If reduction is not one of Reduction::Auto, Reduction::Mean, and Reduction::Sum.
  • If blank index is greater than or equal to num_classes.
  • If the batch dimension of log_probs, targets, input_lengths, and target_lengths do not match.

Trait Implementations§

§

impl<B> AutodiffModule<B> for CTCLoss
where B: AutodiffBackend,

§

type InnerModule = CTCLoss

Inner module without auto-differentiation.
§

fn valid(&self) -> <CTCLoss as AutodiffModule<B>>::InnerModule

Returns the same module, but on the inner backend without auto-differentiation.
§

fn from_inner(module: <CTCLoss as AutodiffModule<B>>::InnerModule) -> CTCLoss

Wraps an inner module back into an auto-diff module.
§

impl Clone for CTCLoss

§

fn clone(&self) -> CTCLoss

Returns a duplicate of the value. Read more
1.0.0 · Source§

fn clone_from(&mut self, source: &Self)

Performs copy-assignment from source. Read more
§

impl Debug for CTCLoss

§

fn fmt(&self, f: &mut Formatter<'_>) -> Result<(), Error>

Formats the value using the given formatter. Read more
§

impl Display for CTCLoss

§

fn fmt(&self, f: &mut Formatter<'_>) -> Result<(), Error>

Formats the value using the given formatter. Read more
§

impl<B> Module<B> for CTCLoss
where B: Backend,

§

type Record = EmptyRecord

Type to save and load the module.
§

fn visit<V>(&self, _visitor: &mut V)
where V: ModuleVisitor<B>,

Visit each tensor parameter in the module with a visitor.
§

fn map<M>(self, _mapper: &mut M) -> CTCLoss
where M: ModuleMapper<B>,

Map each tensor parameter in the module with a mapper.
§

fn load_record(self, _record: <CTCLoss as Module<B>>::Record) -> CTCLoss

Load the module state from a record.
§

fn into_record(self) -> <CTCLoss as Module<B>>::Record

Convert the module into a record containing the state.
§

fn to_device(self, _: &<B as BackendTypes>::Device) -> CTCLoss

Move the module and all of its sub-modules to the given device. Read more
§

fn fork(self, _: &<B as BackendTypes>::Device) -> CTCLoss

Fork the module and all of its sub-modules to the given device. Read more
§

fn collect_devices( &self, devices: Vec<<B as BackendTypes>::Device>, ) -> Vec<<B as BackendTypes>::Device>

Return all the devices found in the underneath module tree added to the given vector without duplicates.
§

fn devices(&self) -> Vec<<B as BackendTypes>::Device>

Return all the devices found in the underneath module tree without duplicates.
§

fn no_grad(self) -> Self

Each tensor in the module tree will not require grad. Read more
§

fn train<AB>(self) -> Self::TrainModule
where AB: AutodiffBackend<InnerBackend = B>, Self: HasAutodiffModule<AB>,

Move the module and all of its sub-modules to the autodiff backend. Read more
§

fn num_params(&self) -> usize

Get the number of parameters the module has, including all of its sub-modules.
§

fn save_file<FR, PB>( self, file_path: PB, recorder: &FR, ) -> Result<(), RecorderError>
where FR: FileRecorder<B>, PB: Into<PathBuf>,

Save the module to a file using the provided file recorder. Read more
§

fn load_file<FR, PB>( self, file_path: PB, recorder: &FR, device: &<B as BackendTypes>::Device, ) -> Result<Self, RecorderError>
where FR: FileRecorder<B>, PB: Into<PathBuf>,

Load the module from a file using the provided file recorder. Read more
§

fn quantize_weights(self, quantizer: &mut Quantizer) -> Self

Quantize the weights of the module.
§

impl ModuleDisplay for CTCLoss

§

fn format(&self, passed_settings: DisplaySettings) -> String

Formats the module with provided display settings. Read more
§

fn custom_settings(&self) -> Option<DisplaySettings>

Custom display settings for the module. Read more
§

fn custom_content(&self, _content: Content) -> Option<Content>

Custom attributes for the module. Read more
§

impl ModuleDisplayDefault for CTCLoss

§

fn content(&self, content: Content) -> Option<Content>

Attributes of the module used for display purposes. Read more
§

fn num_params(&self) -> usize

Gets the number of the parameters of the module.

Auto Trait Implementations§

Blanket Implementations§

§

impl<T> Adaptor<()> for T

§

fn adapt(&self)

Adapt the type to be passed to a metric.
Source§

impl<T> Any for T
where T: 'static + ?Sized,

Source§

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more
Source§

impl<T> Borrow<T> for T
where T: ?Sized,

Source§

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more
Source§

impl<T> BorrowMut<T> for T
where T: ?Sized,

Source§

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more
§

impl<C> CloneExpand for C
where C: Clone,

§

fn __expand_clone_method(&self, _scope: &mut Scope) -> C

Source§

impl<T> CloneToUninit for T
where T: Clone,

Source§

unsafe fn clone_to_uninit(&self, dest: *mut u8)

🔬This is a nightly-only experimental API. (clone_to_uninit)
Performs copy-assignment from self to dest. Read more
§

impl<T> Downcast<T> for T

§

fn downcast(&self) -> &T

Source§

impl<T> From<T> for T

Source§

fn from(t: T) -> T

Returns the argument unchanged.

§

impl<T> Instrument for T

§

fn instrument(self, span: Span) -> Instrumented<Self>

Instruments this type with the provided [Span], returning an Instrumented wrapper. Read more
§

fn in_current_span(self) -> Instrumented<Self>

Instruments this type with the current Span, returning an Instrumented wrapper. Read more
Source§

impl<T, U> Into<U> for T
where U: From<T>,

Source§

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

§

impl<T> IntoComptime for T

§

fn comptime(self) -> Self

Source§

impl<T> IntoEither for T

Source§

fn into_either(self, into_left: bool) -> Either<Self, Self>

Converts self into a Left variant of Either<Self, Self> if into_left is true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more
Source§

fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
where F: FnOnce(&Self) -> bool,

Converts self into a Left variant of Either<Self, Self> if into_left(&self) returns true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more
§

impl<T> Pointable for T

§

const ALIGN: usize

The alignment of pointer.
§

type Init = T

The type for initializers.
§

unsafe fn init(init: <T as Pointable>::Init) -> usize

Initializes a with the given initializer. Read more
§

unsafe fn deref<'a>(ptr: usize) -> &'a T

Dereferences the given pointer. Read more
§

unsafe fn deref_mut<'a>(ptr: usize) -> &'a mut T

Mutably dereferences the given pointer. Read more
§

unsafe fn drop(ptr: usize)

Drops the object pointed to by the given pointer. Read more
§

impl<T> ToCompactString for T
where T: Display,

§

fn try_to_compact_string(&self) -> Result<CompactString, ToCompactStringError>

Fallible version of [ToCompactString::to_compact_string()] Read more
§

fn to_compact_string(&self) -> CompactString

Converts the given value to a [CompactString]. Read more
§

impl<T> ToLine for T
where T: Display,

§

fn to_line(&self) -> Line<'_>

Converts the value to a [Line].
Source§

impl<T> ToOwned for T
where T: Clone,

Source§

type Owned = T

The resulting type after obtaining ownership.
Source§

fn to_owned(&self) -> T

Creates owned data from borrowed data, usually by cloning. Read more
Source§

fn clone_into(&self, target: &mut T)

Uses borrowed data to replace owned data, usually by cloning. Read more
§

impl<T> ToSpan for T
where T: Display,

§

fn to_span(&self) -> Span<'_>

Converts the value to a [Span].
Source§

impl<T> ToString for T
where T: Display + ?Sized,

Source§

fn to_string(&self) -> String

Converts the given value to a String. Read more
§

impl<T> ToText for T
where T: Display,

§

fn to_text(&self) -> Text<'_>

Converts the value to a [Text].
Source§

impl<T, U> TryFrom<U> for T
where U: Into<T>,

Source§

type Error = Infallible

The type returned in the event of a conversion error.
Source§

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.
Source§

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

Source§

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.
Source§

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.
§

impl<T> TuneInputs for T
where T: Clone + Send + Sync + 'static,

§

type At<'a> = T

The concrete input type at lifetime 'a.
§

impl<T> Upcast<T> for T

§

fn upcast(&self) -> Option<&T>

§

impl<V, T> VZip<V> for T
where V: MultiLane<T>,

§

fn vzip(self) -> V

§

impl<T> WithSubscriber for T

§

fn with_subscriber<S>(self, subscriber: S) -> WithDispatch<Self>
where S: Into<Dispatch>,

Attaches the provided Subscriber to this type, returning a [WithDispatch] wrapper. Read more
§

fn with_current_subscriber(self) -> WithDispatch<Self>

Attaches the current default Subscriber to this type, returning a [WithDispatch] wrapper. Read more
§

impl<T> WasmNotSend for T
where T: Send,

§

impl<T> WasmNotSendSync for T
where T: WasmNotSend + WasmNotSync,

§

impl<T> WasmNotSync for T
where T: Sync,