Skip to main content

Mamba2Cache

Struct Mamba2Cache 

Source
pub struct Mamba2Cache<B: Backend> {
    pub conv_bvk: Tensor<B, 3>,
    pub ssm_bhpr: Tensor<B, 4>,
}
Expand description

The mutable state carried between decoding steps for a single Mamba-2 layer.

Both tensors are updated in-place (via Burn’s functional clone) at every call to [crate::mamba2::Mamba2::step].

Fields§

§conv_bvk: Tensor<B, 3>

Convolution rolling window.

Stores the last conv_kernel pre-activation feature vectors fed into the depthwise Conv1d. At each step, the oldest column is discarded and the new token’s projection is appended (a left-shift followed by an insert into the rightmost column), maintaining strict causality.

Shape: [batch, conv_dim, conv_kernel]

  • conv_dim = d_inner + 2 · ngroups · state_rank
  • conv_kernel is typically 4
§ssm_bhpr: Tensor<B, 4>

SSM hidden state hₜ.

This is the O(P·N) compressed summary of all tokens seen so far. Updated via hₜ = Āₜ hₜ₋₁ + B̄ₜ xₜ at each decoding step.

The tensor is indexed as [batch, nheads, per_head_dim, state_rank] (i.e. [B, H, P, N] in the paper’s notation), which is the transpose of the mathematical hₜ ∈ ℝ^{N×P} but equivalent in content.

Shape: [batch, nheads, per_head_dim, state_rank]

Implementations§

Source§

impl<B: Backend> Mamba2Cache<B>

Source

pub fn sanity(&self)

Trait Implementations§

Source§

impl<B> AutodiffModule<B> for Mamba2Cache<B>
where B: AutodiffBackend + Backend, <B as AutodiffBackend>::InnerBackend: Backend,

Source§

type InnerModule = Mamba2Cache<<B as AutodiffBackend>::InnerBackend>

Inner module without auto-differentiation.
Source§

fn valid(&self) -> Self::InnerModule

Returns the same module, but on the inner backend without auto-differentiation.
Source§

fn from_inner(module: Self::InnerModule) -> Self

Wraps an inner module back into an auto-diff module.
Source§

impl<B: Backend> Clone for Mamba2Cache<B>

Source§

fn clone(&self) -> Self

Returns a duplicate of the value. Read more
1.0.0 (const: unstable) · Source§

fn clone_from(&mut self, source: &Self)

Performs copy-assignment from source. Read more
Source§

impl<B: Debug + Backend> Debug for Mamba2Cache<B>

Source§

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more
Source§

impl<B: Backend> Display for Mamba2Cache<B>

Source§

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more
Source§

impl<B> HasAutodiffModule<B> for Mamba2Cache<B::InnerBackend>
where B: AutodiffBackend + Backend, <B as AutodiffBackend>::InnerBackend: Backend,

Source§

type TrainModule = Mamba2Cache<B>

The module with auto-differentiation.
Source§

impl<B: Backend> Module<B> for Mamba2Cache<B>

Source§

type Record = Mamba2CacheRecord<B>

Type to save and load the module.
Source§

fn load_record(self, record: Self::Record) -> Self

Load the module state from a record.
Source§

fn into_record(self) -> Self::Record

Convert the module into a record containing the state.
Source§

fn num_params(&self) -> usize

Get the number of parameters the module has, including all of its sub-modules.
Source§

fn visit<Visitor: ModuleVisitor<B>>(&self, visitor: &mut Visitor)

Visit each tensor parameter in the module with a visitor.
Source§

fn map<Mapper: ModuleMapper<B>>(self, mapper: &mut Mapper) -> Self

Map each tensor parameter in the module with a mapper.
Source§

fn collect_devices(&self, devices: Devices<B>) -> Devices<B>

Return all the devices found in the underneath module tree added to the given vector without duplicates.
Source§

fn to_device(self, device: &B::Device) -> Self

Move the module and all of its sub-modules to the given device. Read more
Source§

fn fork(self, device: &B::Device) -> Self

Fork the module and all of its sub-modules to the given device. Read more
§

fn devices(&self) -> Vec<<B as BackendTypes>::Device>

Return all the devices found in the underneath module tree without duplicates.
§

fn no_grad(self) -> Self

Each tensor in the module tree will not require grad. Read more
§

fn train<AB>(self) -> Self::TrainModule
where AB: AutodiffBackend<InnerBackend = B>, Self: HasAutodiffModule<AB>,

Move the module and all of its sub-modules to the autodiff backend. Read more
§

fn quantize_weights(self, quantizer: &mut Quantizer) -> Self

Quantize the weights of the module.
Source§

impl<B: Backend> ModuleDisplay for Mamba2Cache<B>

§

fn format(&self, passed_settings: DisplaySettings) -> String

Formats the module with provided display settings. Read more
§

fn custom_settings(&self) -> Option<DisplaySettings>

Custom display settings for the module. Read more
§

fn custom_content(&self, _content: Content) -> Option<Content>

Custom attributes for the module. Read more
Source§

impl<B: Backend> ModuleDisplayDefault for Mamba2Cache<B>

Source§

fn content(&self, content: Content) -> Option<Content>

Attributes of the module used for display purposes. Read more
Source§

fn num_params(&self) -> usize

Gets the number of the parameters of the module.

Auto Trait Implementations§

§

impl<B> Freeze for Mamba2Cache<B>
where <B as BackendTypes>::FloatTensorPrimitive: Freeze, <B as BackendTypes>::QuantizedTensorPrimitive: Freeze,

§

impl<B> RefUnwindSafe for Mamba2Cache<B>
where <B as BackendTypes>::FloatTensorPrimitive: RefUnwindSafe, <B as BackendTypes>::QuantizedTensorPrimitive: RefUnwindSafe,

§

impl<B> Send for Mamba2Cache<B>

§

impl<B> Sync for Mamba2Cache<B>

§

impl<B> Unpin for Mamba2Cache<B>
where <B as BackendTypes>::FloatTensorPrimitive: Unpin, <B as BackendTypes>::QuantizedTensorPrimitive: Unpin,

§

impl<B> UnsafeUnpin for Mamba2Cache<B>
where <B as BackendTypes>::FloatTensorPrimitive: UnsafeUnpin, <B as BackendTypes>::QuantizedTensorPrimitive: UnsafeUnpin,

§

impl<B> UnwindSafe for Mamba2Cache<B>
where <B as BackendTypes>::FloatTensorPrimitive: UnwindSafe, <B as BackendTypes>::QuantizedTensorPrimitive: UnwindSafe,

Blanket Implementations§

Source§

impl<T> Any for T
where T: 'static + ?Sized,

Source§

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more
Source§

impl<T> Borrow<T> for T
where T: ?Sized,

Source§

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more
Source§

impl<T> BorrowMut<T> for T
where T: ?Sized,

Source§

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more
Source§

impl<T> CloneToUninit for T
where T: Clone,

Source§

unsafe fn clone_to_uninit(&self, dest: *mut u8)

🔬This is a nightly-only experimental API. (clone_to_uninit)
Performs copy-assignment from self to dest. Read more
Source§

impl<T> From<T> for T

Source§

fn from(t: T) -> T

Returns the argument unchanged.

Source§

impl<T, U> Into<U> for T
where U: From<T>,

Source§

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

Source§

impl<T> ToOwned for T
where T: Clone,

Source§

type Owned = T

The resulting type after obtaining ownership.
Source§

fn to_owned(&self) -> T

Creates owned data from borrowed data, usually by cloning. Read more
Source§

fn clone_into(&self, target: &mut T)

Uses borrowed data to replace owned data, usually by cloning. Read more
Source§

impl<T> ToString for T
where T: Display + ?Sized,

Source§

fn to_string(&self) -> String

Converts the given value to a String. Read more
Source§

impl<T, U> TryFrom<U> for T
where U: Into<T>,

Source§

type Error = Infallible

The type returned in the event of a conversion error.
Source§

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.
Source§

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

Source§

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.
Source§

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.