VITA-Group / SSM-BottleneckView external linksLinks
[ICLR'25] "Understanding Bottlenecks of State Space Models through the Lens of Recency and Over-smoothing" by Peihao Wang, Ruisi Cai, Yuehao Wang, Jiajun Zhu, Pragya Srivastava, Zhangyang Wang, Pan Li
☆17Mar 21, 2025Updated 10 months ago
Alternatives and similar repositories for SSM-Bottleneck
Users that are interested in SSM-Bottleneck are comparing it to the libraries listed below
Sorting:
- Leveraging Recursive Gumbel-Max Trick for Approximate Inference in Combinatorial Spaces, NeurIPS 2021☆13Dec 11, 2021Updated 4 years ago
- Code for "Theoretical Foundations of Deep Selective State-Space Models" (NeurIPS 2024)☆15Jan 7, 2025Updated last year
- This is the code for the Paper: Transition Information Enhanced Disentangled Graph Neural Networks for Session-based Recommendation☆14Apr 6, 2022Updated 3 years ago
- Blog post☆17Feb 16, 2024Updated last year
- ☆20Apr 17, 2023Updated 2 years ago
- AGaLiTe: Approximate Gated Linear Transformers for Online Reinforcement Learning (Published in TMLR)☆23Oct 15, 2024Updated last year
- [ECIR 2024] Official repository for the paper titled "Self Contrastive Learning for Session-based Recommendation"☆21Apr 3, 2024Updated last year
- ☆24Sep 25, 2024Updated last year
- Here we will test various linear attention designs.☆62Apr 25, 2024Updated last year
- Triton implement of bi-directional (non-causal) linear attention☆65Feb 2, 2026Updated last week
- Reference implementation of "Softmax Attention with Constant Cost per Token" (Heinsen, 2024)☆24Jun 6, 2024Updated last year
- This is the official code for SIGIR 2024 paper: 'Multi-intent-aware Session-based Recommendation'.☆24Mar 21, 2025Updated 10 months ago
- The offical code for 《A Time Series is Worth Five Experts: Heterogeneous Mixture of Experts for Traffic Flow Prediction》☆29Sep 27, 2024Updated last year
- ☆29Jul 12, 2022Updated 3 years ago
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32May 25, 2024Updated last year
- Addressing the problem of predicting crime occurrence based on historic records☆11Nov 27, 2019Updated 6 years ago
- [AAAI'2025] The official implementation code of SIGMA☆39Oct 14, 2025Updated 3 months ago
- Transformers components but in Triton☆34May 9, 2025Updated 9 months ago
- This is the official code for WWW 2021 paper "Session-aware Linear Item-Item Models for Session-based Recommendation"☆33Sep 19, 2023Updated 2 years ago
- code for "Natural Language to Code Translation with Execution"☆41Nov 2, 2022Updated 3 years ago
- [ICLR 2025] Let Your Features Tell The Differences: Understanding Graph Convolution By Feature Splitting☆13Nov 24, 2025Updated 2 months ago
- ☆16Feb 1, 2026Updated last week
- ☆13Jun 18, 2025Updated 7 months ago
- 中国科学院大学太极实验室2025年度“大学生创新实践训练计划”☆13Apr 1, 2025Updated 10 months ago
- [ICLR 2023] Official implementation of Transnormer in our ICLR 2023 paper - Toeplitz Neural Network for Sequence Modeling☆81Apr 24, 2024Updated last year
- RWKV-X is a Linear Complexity Hybrid Language Model based on the RWKV architecture, integrating Sparse Attention to improve the model's l…☆54Jan 12, 2026Updated last month
- Official code for "STaSy: Score-based Tabular data Synthesis", ICLR 2023☆34Aug 11, 2023Updated 2 years ago
- This is the official code for the WSDM 2022 paper: 'S-Walk: Accurate and Scalable Session-based Recommendation with Random Walks'.☆33Sep 19, 2023Updated 2 years ago
- 湖南大学课程论文LaTeX模板☆17Jul 14, 2024Updated last year
- ☆11Jul 20, 2021Updated 4 years ago
- Repositorio de la unidad 2 del curso INFO274: Simulación, Instituto de Informática, UACh☆10Dec 18, 2022Updated 3 years ago
- Statistical discontinuous constituent parsing☆11Feb 15, 2018Updated 7 years ago
- "SCONE: A Novel Stochastic Sampling to Generate Contrastive Views and Hard Negative Samples for Recommendation", WSDM 2025☆14Nov 25, 2025Updated 2 months ago
- Code for ICML21 paper "Learning Self-Modulating Attention in Continuous Time Space with Applications to Sequential Recommendation"☆12Feb 8, 2023Updated 3 years ago
- Mitigating the Filter Bubble while Maintaining Relevance: Targeted Diversification with VAE-based Recommender Systems☆10Mar 15, 2023Updated 2 years ago
- Code for AAAI21 paper "Scalable and Explainable 1-Bit Matrix Completion via Graph Signal Learning"☆11Feb 15, 2022Updated 3 years ago
- ☆12Jul 7, 2022Updated 3 years ago
- Official Code for "CMamba: Channel Correlation Enhanced State Space Models for Multivariate Time Series Forecasting"☆45Jan 6, 2025Updated last year
- This repository reproduces the results in the paper "How expressive are transformers in spectral domain for graphs?"(published in TMLR)☆12Jul 10, 2022Updated 3 years ago