FarnoushRJ / MambaLRP
[NeurIPS 2024] Official implementation of the paper "MambaLRP: Explaining Selective State Space Sequence Models".
☆38Updated 4 months ago
Alternatives and similar repositories for MambaLRP:
Users that are interested in MambaLRP are comparing it to the libraries listed below
- Official PyTorch Implementation of "The Hidden Attention of Mamba Models"☆216Updated 9 months ago
- Official implementation of "Hydra: Bidirectional State Space Models Through Generalized Matrix Mixers"☆125Updated last month
- More dimensions = More fun☆21Updated 7 months ago
- The official repository for HyperZ⋅Z⋅W Operator Connects Slow-Fast Networks for Full Context Interaction.☆36Updated last month
- [ICLR 2025] Official Code Release for Explaining Modern Gated-Linear RNNs via a Unified Implicit Attention Formulation☆40Updated 3 weeks ago
- Official code for the ICML 2024 paper "The Entropy Enigma: Success and Failure of Entropy Minimization"☆49Updated 9 months ago
- Official implementation of Phi-Mamba. A MOHAWK-distilled model (Transformers to SSMs: Distilling Quadratic Knowledge to Subquadratic Mode…☆98Updated 6 months ago
- [NeurIPS 2023] Factorized Contrastive Learning: Going Beyond Multi-view Redundancy☆66Updated last year
- MambaFormer in-context learning experiments and implementation for https://arxiv.org/abs/2402.04248☆50Updated 9 months ago
- A Triton Kernel for incorporating Bi-Directionality in Mamba2☆63Updated 3 months ago
- Official Code for ICLR 2024 Paper: Non-negative Contrastive Learning☆40Updated 11 months ago
- State Space Models☆66Updated 10 months ago
- Why Do We Need Weight Decay in Modern Deep Learning? [NeurIPS 2024]☆62Updated 5 months ago
- Implementations of various linear RNN layers using pytorch and triton☆50Updated last year
- Code for GFlowNet-EM, a novel algorithm for fitting latent variable models with compositional latents and an intractable true posterior.☆41Updated last year
- Implementation of Griffin from the paper: "Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models"☆51Updated last month
- ☆53Updated last month
- Implementation of MambaFormer in Pytorch ++ Zeta from the paper: "Can Mamba Learn How to Learn? A Comparative Study on In-Context Learnin…☆20Updated this week
- HGRN2: Gated Linear RNNs with State Expansion☆53Updated 7 months ago
- The official Pytorch implementation of the paper "Fourier Transformer: Fast Long Range Modeling by Removing Sequence Redundancy with FFT …☆34Updated last year
- Ofiicial Implementation for Mamba-ND: Selective State Space Modeling for Multi-Dimensional Data☆58Updated 8 months ago
- Official implementation for "Targeted Cause Discovery with Data-Driven Learning"☆23Updated 6 months ago
- TorchDR - PyTorch Dimensionality Reduction☆96Updated last month
- ☆31Updated 10 months ago
- PyTorch implementation of Structured State Space for Sequence Modeling (S4), based on Annotated S4.☆77Updated last year
- PyTorch implementation of Soft MoE by Google Brain in "From Sparse to Soft Mixtures of Experts" (https://arxiv.org/pdf/2308.00951.pdf)☆71Updated last year
- Holistic evaluation of multimodal foundation models☆42Updated 7 months ago
- ☆52Updated 5 months ago
- I2M2: Jointly Modeling Inter- & Intra-Modality Dependencies for Multi-modal Learning (NeurIPS 2024)☆18Updated 4 months ago