[ICLR'25] "Understanding Bottlenecks of State Space Models through the Lens of Recency and Over-smoothing" by Peihao Wang, Ruisi Cai, Yuehao Wang, Jiajun Zhu, Pragya Srivastava, Zhangyang Wang, Pan Li
☆17Mar 21, 2025Updated 11 months ago
Alternatives and similar repositories for SSM-Bottleneck
Users that are interested in SSM-Bottleneck are comparing it to the libraries listed below
Sorting:
- Leveraging Recursive Gumbel-Max Trick for Approximate Inference in Combinatorial Spaces, NeurIPS 2021☆14Dec 11, 2021Updated 4 years ago
- Code for "Theoretical Foundations of Deep Selective State-Space Models" (NeurIPS 2024)☆15Jan 7, 2025Updated last year
- This is the code for the Paper: Transition Information Enhanced Disentangled Graph Neural Networks for Session-based Recommendation☆14Apr 6, 2022Updated 3 years ago
- Blog post☆17Feb 16, 2024Updated 2 years ago
- AGaLiTe: Approximate Gated Linear Transformers for Online Reinforcement Learning (Published in TMLR)☆23Oct 15, 2024Updated last year
- ☆20Apr 17, 2023Updated 2 years ago
- [ECIR 2024] Official repository for the paper titled "Self Contrastive Learning for Session-based Recommendation"☆21Apr 3, 2024Updated last year
- ☆24Sep 25, 2024Updated last year
- Here we will test various linear attention designs.☆62Apr 25, 2024Updated last year
- Triton implement of bi-directional (non-causal) linear attention☆70Feb 22, 2026Updated last week
- Reference implementation of "Softmax Attention with Constant Cost per Token" (Heinsen, 2024)☆24Jun 6, 2024Updated last year
- This is the official code for SIGIR 2024 paper: 'Multi-intent-aware Session-based Recommendation'.☆24Mar 21, 2025Updated 11 months ago
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32May 25, 2024Updated last year
- The offical code for 《A Time Series is Worth Five Experts: Heterogeneous Mixture of Experts for Traffic Flow Prediction》☆30Sep 27, 2024Updated last year
- ☆29Jul 12, 2022Updated 3 years ago
- Addressing the problem of predicting crime occurrence based on historic records☆11Nov 27, 2019Updated 6 years ago
- Transformers components but in Triton☆34May 9, 2025Updated 9 months ago
- [AAAI'2025] The official implementation code of SIGMA☆39Oct 14, 2025Updated 4 months ago
- code for "Natural Language to Code Translation with Execution"☆41Nov 2, 2022Updated 3 years ago
- This is the official code for WWW 2021 paper "Session-aware Linear Item-Item Models for Session-based Recommendation"☆33Sep 19, 2023Updated 2 years ago
- 中国科学院大学太极实验室2025年度“大学生创新实践训练计划”☆14Apr 1, 2025Updated 11 months ago
- ☆17Feb 1, 2026Updated last month
- ☆13Jun 18, 2025Updated 8 months ago
- [ICLR 2023] Official implementation of Transnormer in our ICLR 2023 paper - Toeplitz Neural Network for Sequence Modeling☆81Apr 24, 2024Updated last year
- This is the official code for the WSDM 2022 paper: 'S-Walk: Accurate and Scalable Session-based Recommendation with Random Walks'.☆33Sep 19, 2023Updated 2 years ago
- RWKV-X is a Linear Complexity Hybrid Language Model based on the RWKV architecture, integrating Sparse Attention to improve the model's l…☆54Jan 12, 2026Updated last month
- Official code for "STaSy: Score-based Tabular data Synthesis", ICLR 2023☆34Aug 11, 2023Updated 2 years ago
- Single-Source Domain Generalization for Bearing Fault Diagnosis Using Feature-Augmented Adaptive Neuro-Fuzzy Inference System☆11Apr 13, 2024Updated last year
- Official Code for "CMamba: Channel Correlation Enhanced State Space Models for Multivariate Time Series Forecasting"☆46Jan 6, 2025Updated last year
- [ICLR 2025] Let Your Features Tell The Differences: Understanding Graph Convolution By Feature Splitting☆14Nov 24, 2025Updated 3 months ago
- Statistical discontinuous constituent parsing☆11Feb 15, 2018Updated 8 years ago
- ☆11Jan 7, 2025Updated last year
- ☆11Jul 20, 2021Updated 4 years ago
- Mitigating the Filter Bubble while Maintaining Relevance: Targeted Diversification with VAE-based Recommender Systems☆10Mar 15, 2023Updated 2 years ago
- Code for ICML21 paper "Learning Self-Modulating Attention in Continuous Time Space with Applications to Sequential Recommendation"☆12Feb 8, 2023Updated 3 years ago
- The official PyTorch implementation of "An Attentional Multi-scale Co-evolving Model for Dynamic Link Prediction" (TheWebConf'23)☆11May 4, 2023Updated 2 years ago
- This is the code of paper: Robust Mid-Pass Filtering Graph Convolutional Networks.(paper accepted by WWW2023)☆13Feb 17, 2023Updated 3 years ago
- This repository reproduces the results in the paper "How expressive are transformers in spectral domain for graphs?"(published in TMLR)☆12Jul 10, 2022Updated 3 years ago
- Graphical intuition to MOSFET square-law☆11Jan 5, 2021Updated 5 years ago