描述
High-performance Kimi Delta Attention CUDA kernels built on CUTLASS for efficient recurrent state inference
软件工程 / 诊断修复
flashkda-delta-attention
描述
High-performance Kimi Delta Attention CUDA kernels built on CUTLASS for efficient recurrent state inference
安全审计