热点
"长序列建模" 相关文章
MemMamba: Rethinking Memory Patterns in State Space Model
cs.AI updates on arXiv.org 2025-10-07T04:13:29.000000Z
MemMamba: Rethinking Memory Patterns in State Space Model
cs.AI updates on arXiv.org 2025-10-07T04:13:29.000000Z
Gather-Scatter Mamba: Accelerating Propagation with Efficient State Space Model
cs.AI updates on arXiv.org 2025-10-02T04:18:31.000000Z
Dendritic Resonate-and-Fire Neuron for Effective and Efficient Long Sequence Modeling
cs.AI updates on arXiv.org 2025-09-23T05:57:44.000000Z
Dynamic Adaptive Shared Experts with Grouped Multi-Head Attention Mixture of Experts
cs.AI updates on arXiv.org 2025-09-16T05:06:56.000000Z
Exploring Synaptic Resonance in Large Language Models: A Novel Approach to Contextual Memory Integration
cs.AI updates on arXiv.org 2025-08-11T04:08:47.000000Z
算力终结者来了!华人天团「降维打击」注意力瓶颈,AI狂飙进对数时代
智源社区 2025-06-09T16:38:01.000000Z
盖过马斯克Grok3锋芒!DeepSeek又放大招:基于硬件对齐的 NSA, 可直接端到端训练
一支烟花AI 2025-02-19T23:29:38.000000Z
ReMamba: Enhancing Long-Sequence Modeling with a 3.2-Point Boost on LongBench and 1.6-Point Improvement on L-Eval Benchmarks
MarkTechPost@AI 2024-09-03T05:35:08.000000Z
ReMamba: Enhancing Long-Sequence Modeling with a 3.2-Point Boost on LongBench and 1.6-Point Improvement on L-Eval Benchmarks
MarkTechPost@AI 2024-09-03T05:35:08.000000Z
ReMamba: Enhancing Long-Sequence Modeling with a 3.2-Point Boost on LongBench and 1.6-Point Improvement on L-Eval Benchmarks
MarkTechPost@AI 2024-09-03T05:35:08.000000Z
ReMamba: Enhancing Long-Sequence Modeling with a 3.2-Point Boost on LongBench and 1.6-Point Improvement on L-Eval Benchmarks
MarkTechPost@AI 2024-09-03T05:35:08.000000Z
ReMamba: Enhancing Long-Sequence Modeling with a 3.2-Point Boost on LongBench and 1.6-Point Improvement on L-Eval Benchmarks
MarkTechPost@AI 2024-09-03T05:35:08.000000Z