[ICLR'25] "Understanding Bottlenecks of State Space Models through the Lens of Recency and Over-smoothing" by Peihao Wang, Ruisi Cai, Yuehao Wang, Jiajun Zhu, Pragya Srivastava, Zhangyang Wang, Pan Li
☆17Mar 21, 2025Updated last year
Alternatives and similar repositories for SSM-Bottleneck
Users that are interested in SSM-Bottleneck are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Blog post☆17Feb 16, 2024Updated 2 years ago
- This is the code for the Paper: Transition Information Enhanced Disentangled Graph Neural Networks for Session-based Recommendation☆14Apr 6, 2022Updated 3 years ago
- Code for "Theoretical Foundations of Deep Selective State-Space Models" (NeurIPS 2024)☆15Jan 7, 2025Updated last year
- ☆14Oct 17, 2023Updated 2 years ago
- Leveraging Recursive Gumbel-Max Trick for Approximate Inference in Combinatorial Spaces, NeurIPS 2021☆14Dec 11, 2021Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- AGaLiTe: Approximate Gated Linear Transformers for Online Reinforcement Learning (Published in TMLR)☆23Oct 15, 2024Updated last year
- Here we will test various linear attention designs.☆62Apr 25, 2024Updated last year
- [ECIR 2024] Official repository for the paper titled "Self Contrastive Learning for Session-based Recommendation"☆21Apr 3, 2024Updated last year
- Reference implementation of "Softmax Attention with Constant Cost per Token" (Heinsen, 2024)☆24Jun 6, 2024Updated last year
- ☆20Apr 17, 2023Updated 2 years ago
- ☆24Sep 25, 2024Updated last year
- [ICLR 2025] Let Your Features Tell The Differences: Understanding Graph Convolution By Feature Splitting☆14Nov 24, 2025Updated 4 months ago
- [AAAI 2026] Towards Non-Stationary Time Series Forecasting with Temporal Stabilization and Frequency Differencing☆36Jan 27, 2026Updated last month
- ☆47Nov 8, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- An automated feature engineering framework 'FETCH' accepted in ICLR 2023.☆11Jun 20, 2023Updated 2 years ago
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32May 25, 2024Updated last year
- ☆18Feb 1, 2026Updated last month
- Triton implement of bi-directional (non-causal) linear attention☆71Mar 1, 2026Updated 3 weeks ago
- This is the official code for SIGIR 2024 paper: 'Multi-intent-aware Session-based Recommendation'.☆25Mar 21, 2025Updated last year
- A Python implementation of NN4G, a constructive neural network for graphs.☆13Sep 27, 2021Updated 4 years ago
- Learning and practice Computer Graphics.☆11Jan 30, 2023Updated 3 years ago
- Transformers components but in Triton☆34May 9, 2025Updated 10 months ago
- [AAAI 2026] Official repository of the EMAformer paper: "EMAformer: Enhancing Transformer through Embedding Armor for Time Series Forecas…☆35Dec 3, 2025Updated 3 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- About Official implementation of "KARMA: A Multilevel Decomposition Hybrid Mamba Framework for Multivariate Long-Term Time Series Forecas…☆20Jul 15, 2025Updated 8 months ago
- A repository for using the distributed information bottleneck to locate information in data☆16Aug 26, 2024Updated last year
- [AAAI'25] "Decomposed Spatio-Temporal Mamba for Long-Term Traffic Prediction"☆19May 30, 2025Updated 9 months ago
- ☆15Jun 4, 2024Updated last year
- This is the official code for WWW 2021 paper "Session-aware Linear Item-Item Models for Session-based Recommendation"☆33Sep 19, 2023Updated 2 years ago
- [KDD 2024] Team up GBDTs and DNNs: Advancing Efficient and Effective Tabular Prediction with Tree-hybrid MLPs☆11Mar 3, 2025Updated last year
- Official repository for the paper "TempSAL - Uncovering Temporal Information for Deep Saliency Prediction" (CVPR 2023)☆15Mar 11, 2025Updated last year
- [AAAI'2025] The official implementation code of SIGMA☆39Oct 14, 2025Updated 5 months ago
- Zeta implementation of a reusable and plug in and play feedforward from the paper "Exponentially Faster Language Modeling"☆16Nov 11, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Single-Source Domain Generalization for Bearing Fault Diagnosis Using Feature-Augmented Adaptive Neuro-Fuzzy Inference System☆11Apr 13, 2024Updated last year
- The official dataset of the flowvqa project.☆21Mar 26, 2024Updated 2 years ago
- Just wanna see what type and how many GPUs/TPUs are used in CVPR 2025 oral papers. Fun vibe coding with LLMs.☆12Apr 24, 2025Updated 11 months ago
- ☆29Jul 12, 2022Updated 3 years ago
- Code for "Are Powerful Graph Neural Nets Necessary? A Dissection on Graph Classification"☆54May 5, 2020Updated 5 years ago
- next-item recommendations in short sessions☆10Sep 24, 2022Updated 3 years ago
- Code release for "Gaze-Assisted Medical Image Segmentation" [AIM-FM @ NeurIPS, 2024]☆14Oct 22, 2024Updated last year