pub fn k3_ssd_chunk_state<B: Backend>(
x_bnlhp: Tensor<B, 5>,
b_bnlgr: Tensor<B, 5>,
da_cumsum_bhnl: Tensor<B, 4>,
dt_discretized_bhnl: Tensor<B, 4>,
) -> Tensor<B, 5>Expand description
Based on the Kernel 3 Triton reference _chunk_state_fwd_kernel (ssd_chunk_state.py).
Returns:
- cb_bngll used in K5 - state assuming zero initial state at each chunk boundary.
- b_bar_scale_bhnl [*] - intermediary