[ICLR 2025] Official Code Release for Explaining Modern Gated-Linear RNNs via a Unified Implicit Attention Formulation
β49Mar 1, 2025Updated last year
Alternatives and similar repositories for UnifiedImplicitAttnRepr
Users that are interested in UnifiedImplicitAttnRepr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official PyTorch Implementation of "The Hidden Attention of Mamba Models"β232Oct 16, 2025Updated 6 months ago
- [NeurIPS 2024] Official implementation of the paper "MambaLRP: Explaining Selective State Space Sequence Models" πβ46Nov 6, 2024Updated last year
- β12Dec 26, 2021Updated 4 years ago
- Code for the paper: https://arxiv.org/pdf/2309.06979.pdfβ21Jul 29, 2024Updated last year
- β11Apr 23, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits β’ AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- β39Apr 5, 2024Updated 2 years ago
- Repository for SPECTRA: Sparse Structured Text Rationalization, accepted at EMNLP 2021 main conference.β10Feb 14, 2024Updated 2 years ago
- [ECCV 2024 Workshop Best Paper Award] Famba-V: Fast Vision Mamba with Cross-Layer Token Fusionβ34Sep 30, 2024Updated last year
- LAGr: Label Aligned Graphs for Better Systematic Generalization in Semantic Parsingβ10Jun 1, 2022Updated 3 years ago
- Implementation of Cascaded Head-colliding Attention (ACL'2021)β11Sep 16, 2021Updated 4 years ago
- A large-scale RWKV v7(World, PRWKV, Hybrid-RWKV) inference. Capable of inference by combining multiple states(Pseudo MoE). Easy to deployβ¦β48Oct 21, 2025Updated 5 months ago
- Visualize neural networks using TikZ in Juliaβ15Jan 29, 2025Updated last year
- Open-sourcing code associated with the AAAI-25 paper "On the Expressiveness and Length Generalization of Selective State-Space Models on β¦β16Sep 18, 2025Updated 6 months ago
- β58Jul 9, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Tasks for describing differences between text distributions.β17Aug 9, 2024Updated last year
- β16Feb 23, 2025Updated last year
- The official code for [ECCV2020] "HALO: Hardware-aware Learning to Optimize"β10Mar 22, 2023Updated 3 years ago
- Time2Feat: Learning Interpretable Representations for Multivariate Time Series Clusteringβ31Oct 9, 2025Updated 6 months ago
- Official Implementation of Video-MA2MBAβ12Dec 3, 2024Updated last year
- Expanding linear RNN state-transition matrix eigenvalues to include negatives improves state-tracking tasks and language modeling withoutβ¦β21Mar 15, 2025Updated last year
- The code of paper "O-Mamba: O-shape State-Space Model for Underwater Image Enhancement"β14Oct 18, 2024Updated last year
- Non official implementation of the Linear Recurrent Unit (LRU, Orvieto et al. 2023)β62Sep 3, 2025Updated 7 months ago
- The accompanying code for "Exploring the limits of decoder-only models trained on public speech recognition corpora" (Ankit Gupta, Georgeβ¦β20Oct 11, 2024Updated last year
- Deploy open-source AI quickly and easily - Bonus Offer β’ AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- β12Jul 26, 2022Updated 3 years ago
- This is a simple toolkit to view and crop image patches for image/video super-resolution tasks.β11Jan 6, 2023Updated 3 years ago
- [Tool] AutoRec (2015) PyTorch Implementationβ10Mar 1, 2020Updated 6 years ago
- [ICLR 2025] Distilled Decoding 1: One-step Sampling of Image Auto-regressive Models with Flow Matchingβ19Apr 21, 2025Updated 11 months ago
- pysat support for space weather indices and data setsβ14Apr 3, 2026Updated last week
- Official implementation for the IJCAI'24 paper: SDformerβ31Mar 6, 2025Updated last year
- [NeurIPS 2025 MechInterp Workshop - Spotlight] Official implementation of the paper "RelP: Faithful and Efficient Circuit Discovery in Laβ¦β27Nov 3, 2025Updated 5 months ago
- Sirius, an efficient correction mechanism, which significantly boosts Contextual Sparsity models on reasoning tasks while maintaining itsβ¦β21Sep 10, 2024Updated last year
- β17Apr 14, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- xMIL: Insightful Explanations for Multiple Instance Learning in Histopathologyβ28Feb 25, 2026Updated last month
- Implementation of MambaFormer in Pytorch ++ Zeta from the paper: "Can Mamba Learn How to Learn? A Comparative Study on In-Context Learninβ¦β20Mar 28, 2026Updated 2 weeks ago
- β51Oct 9, 2025Updated 6 months ago
- Official repository of "Beyond Spatial Frequency: Pixel-wise Temporal Frequency-based Deepfake Video Detection" [ICCV 2025]β21Jan 17, 2026Updated 2 months ago
- Implementation of the paper End-to-end Learning of Deterministic Decision Treesβ17May 19, 2022Updated 3 years ago
- [COLM 2024] LITE: Modeling Environmental Ecosystems with Multimodal Large Language Modelsβ14Jan 4, 2025Updated last year
- Source code of ACL 2023 accepted paper "AD-KD: Attribution-Driven Knowledge Distillation for Language Model Compression"β13Jun 14, 2023Updated 2 years ago