VITA-Group/SSM-Bottleneck

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/VITA-Group/SSM-Bottleneck)

VITA-Group / SSM-Bottleneck

[ICLR'25] "Understanding Bottlenecks of State Space Models through the Lens of Recency and Over-smoothing" by Peihao Wang, Ruisi Cai, Yuehao Wang, Jiajun Zhu, Pragya Srivastava, Zhangyang Wang, Pan Li

☆18

Alternatives and similar repositories for SSM-Bottleneck

Users that are interested in SSM-Bottleneck are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

goombalab / Gather-and-Aggregate
View on GitHub
Experiments Notebook of "Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism"
☆16Apr 30, 2025Updated last year
psm1206 / DAN
View on GitHub
[SIGIR'25] "Why is Normalization Necessary for Linear Recommenders?"
☆15Dec 11, 2025Updated 7 months ago
srush / mamba-scans
View on GitHub
Blog post
☆17Feb 16, 2024Updated 2 years ago
AnsongLi / TIE-DGNN
View on GitHub
This is the code for the Paper: Transition Information Enhanced Disentangled Graph Neural Networks for Session-based Recommendation
☆14Apr 6, 2022Updated 4 years ago
RakitinDen / pytorch-recursive-gumbel-max-trick
View on GitHub
Leveraging Recursive Gumbel-Max Trick for Approximate Inference in Combinatorial Spaces, NeurIPS 2021
☆14Dec 11, 2021Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
adiSimhi / Interpreting-Embedding-Spaces-by-Conceptualization
View on GitHub
☆15Oct 17, 2023Updated 2 years ago
BlinkDL / LinearAttentionArena
View on GitHub
Here we will test various linear attention designs.
☆62Apr 25, 2024Updated 2 years ago
Huster-Hq / DADA
View on GitHub
[MICCAI 2025 Early Accept] Targeted False Positive Synthesis via Detector-guided Adversarial Diffusion Attacker for Robust Polyp Detectio…
☆15Dec 5, 2025Updated 7 months ago
glassroom / heinsen_attention
View on GitHub
Reference implementation of "Softmax Attention with Constant Cost per Token" (Heinsen, 2024)
☆25Jun 6, 2024Updated 2 years ago
ShiZhengyan / SelfContrastiveLearningRecSys
View on GitHub
[ECIR 2024] Official repository for the paper titled "Self Contrastive Learning for Session-based Recommendation"
☆21Apr 3, 2024Updated 2 years ago
Doraemonzzz / hgru2-pytorch
View on GitHub
☆24Sep 25, 2024Updated last year
Doraemonzzz / tnn-pytorch
View on GitHub
☆20Apr 17, 2023Updated 3 years ago
EleutherAI / rnngineering
View on GitHub
Engineering the state of RNN language models (Mamba, RWKV, etc.)
☆33May 25, 2024Updated 2 years ago
Benjamin-Walker / selective-ssms-and-linear-cdes
View on GitHub
Code for "Theoretical Foundations of Deep Selective State-Space Models" (NeurIPS 2024)
☆17Jan 7, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
locuslab / llava-token-compression
View on GitHub
☆47Nov 8, 2024Updated last year
liyaooi / FETCH
View on GitHub
An automated feature engineering framework 'FETCH' accepted in ICLR 2023.
☆12Jun 20, 2023Updated 3 years ago
jin530 / MiaSRec
View on GitHub
This is the official code for SIGIR 2024 paper: 'Multi-intent-aware Session-based Recommendation'.
☆26Mar 21, 2025Updated last year
EmanueleCosenza / NN4G
View on GitHub
A Python implementation of NN4G, a constructive neural network for graphs.
☆13Sep 27, 2021Updated 4 years ago
llm4sr / PO4ISR
View on GitHub
☆15Jun 4, 2024Updated 2 years ago
yuehaowang / lets_CG
View on GitHub
Learning and practice Computer Graphics.
☆11Jan 30, 2023Updated 3 years ago
IVRL / Tempsal
View on GitHub
Official repository for the paper "TempSAL - Uncovering Temporal Information for Deep Saliency Prediction" (CVPR 2023)
☆15Mar 11, 2025Updated last year
dame-cell / Triformer
View on GitHub
Transformers components but in Triton
☆34May 9, 2025Updated last year
Zehong-Wang / GPM
View on GitHub
Beyond Message Passing: Neural Graph Pattern Machine, ICML 2025
☆15May 28, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
subho406 / agalite
View on GitHub
AGaLiTe: Approximate Gated Linear Transformers for Online Reinforcement Learning (Published in TMLR)
☆24Oct 15, 2024Updated last year
fuego-wtf / graphyn-code
View on GitHub
Your AI dev team member, one command away.
☆17Updated this week
ejmichaud / feature-geometry
View on GitHub
Code for "The Geometry of Concepts: Sparse Autoencoder Feature Structure"
☆17Mar 25, 2025Updated last year
distributed-information-bottleneck / distributed-information-bottleneck.github.io
View on GitHub
A repository for using the distributed information bottleneck to locate information in data
☆17Aug 26, 2024Updated last year
lucidrains / quartic-transformer
View on GitHub
Exploring an idea where one forgets about efficiency and carries out attention across each edge of the nodes (tokens)
☆56Mar 25, 2025Updated last year
wzsong17 / recsys21_insert
View on GitHub
next-item recommendations in short sessions
☆10Sep 24, 2022Updated 3 years ago
jin530 / SLIST
View on GitHub
This is the official code for WWW 2021 paper "Session-aware Linear Item-Item Models for Session-based Recommendation"
☆34Sep 19, 2023Updated 2 years ago
kyegomez / FastFF
View on GitHub
Zeta implementation of a reusable and plug in and play feedforward from the paper "Exponentially Faster Language Modeling"
☆16Nov 11, 2024Updated last year
yzh8221 / DiMTS
View on GitHub
[AAAI 2026] The implementation of DiMTS: Bridge the Gap between Selective State Space Models and Time Series for Generative Modeling.
☆15Nov 28, 2025Updated 7 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
jyansir / tmlp
View on GitHub
[KDD 2024] Team up GBDTs and DNNs: Advancing Efficient and Effective Tabular Prediction with Tree-hybrid MLPs
☆12Mar 3, 2025Updated last year
Arhosseini77 / ADDNN_2023
View on GitHub
Analyse and Design Deep Neural Network, Dr.Kalhor, University of Tehran
☆11Feb 18, 2024Updated 2 years ago
DeepGraphLearning / SPN
View on GitHub
☆29Jul 12, 2022Updated 4 years ago
wannature / Detective-A-Dynamic-Integrated-Uncertainty-Valuation-Framework
View on GitHub
Pytorch implementation of Detective
☆13Jul 11, 2024Updated 2 years ago
SongYanSDU / AugANFIS
View on GitHub
Single-Source Domain Generalization for Bearing Fault Diagnosis Using Feature-Augmented Adaptive Neuro-Fuzzy Inference System
☆12Apr 13, 2024Updated 2 years ago
leiluk1 / gaze-based-segmentation
View on GitHub
Code release for "Gaze-Assisted Medical Image Segmentation" [AIM-FM @ NeurIPS, 2024]
☆14Oct 22, 2024Updated last year
Zyphra / Zamba2
View on GitHub
PyTorch implementation of models from the Zamba2 series.
☆193Jan 23, 2025Updated last year