[ICLR 2025] Official Code Release for Explaining Modern Gated-Linear RNNs via a Unified Implicit Attention Formulation
β49Mar 1, 2025Updated last year
Alternatives and similar repositories for UnifiedImplicitAttnRepr
Users that are interested in UnifiedImplicitAttnRepr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- β13Jul 11, 2025Updated 8 months ago
- [NeurIPS 2024] Official implementation of the paper "MambaLRP: Explaining Selective State Space Sequence Models" πβ45Nov 6, 2024Updated last year
- β12Dec 26, 2021Updated 4 years ago
- Code for the paper: https://arxiv.org/pdf/2309.06979.pdfβ21Jul 29, 2024Updated last year
- β19Dec 24, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- β16Jul 10, 2023Updated 2 years ago
- β39Apr 5, 2024Updated last year
- Repository for SPECTRA: Sparse Structured Text Rationalization, accepted at EMNLP 2021 main conference.β10Feb 14, 2024Updated 2 years ago
- [ECCV 2024 Workshop Best Paper Award] Famba-V: Fast Vision Mamba with Cross-Layer Token Fusionβ34Sep 30, 2024Updated last year
- LAGr: Label Aligned Graphs for Better Systematic Generalization in Semantic Parsingβ10Jun 1, 2022Updated 3 years ago
- Implementation of Cascaded Head-colliding Attention (ACL'2021)β11Sep 16, 2021Updated 4 years ago
- A large-scale RWKV v7(World, PRWKV, Hybrid-RWKV) inference. Capable of inference by combining multiple states(Pseudo MoE). Easy to deployβ¦β48Oct 21, 2025Updated 5 months ago
- Visualize neural networks using TikZ in Juliaβ15Jan 29, 2025Updated last year
- Open-sourcing code associated with the AAAI-25 paper "On the Expressiveness and Length Generalization of Selective State-Space Models on β¦β16Sep 18, 2025Updated 6 months ago
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [AAAI24] Learning Invariant Inter-pixel Correlations for Superpixel Generationβ14Mar 27, 2024Updated 2 years ago
- β58Jul 9, 2024Updated last year
- Tasks for describing differences between text distributions.β17Aug 9, 2024Updated last year
- β16Feb 23, 2025Updated last year
- The official code for [ECCV2020] "HALO: Hardware-aware Learning to Optimize"β10Mar 22, 2023Updated 3 years ago
- Time2Feat: Learning Interpretable Representations for Multivariate Time Series Clusteringβ30Oct 9, 2025Updated 5 months ago
- Expanding linear RNN state-transition matrix eigenvalues to include negatives improves state-tracking tasks and language modeling withoutβ¦β21Mar 15, 2025Updated last year
- Official Implementation of Video-MA2MBAβ12Dec 3, 2024Updated last year
- Non official implementation of the Linear Recurrent Unit (LRU, Orvieto et al. 2023)β62Sep 3, 2025Updated 6 months ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- The accompanying code for "Exploring the limits of decoder-only models trained on public speech recognition corpora" (Ankit Gupta, Georgeβ¦β20Oct 11, 2024Updated last year
- A fork of the PEFT library, supporting Robust Adaptation (RoSA)β15Aug 16, 2024Updated last year
- β12Jul 26, 2022Updated 3 years ago
- A PyTorch implementation of SIN.β12Oct 20, 2021Updated 4 years ago
- This is a simple toolkit to view and crop image patches for image/video super-resolution tasks.β11Jan 6, 2023Updated 3 years ago
- [ICLR 2025] Distilled Decoding 1: One-step Sampling of Image Auto-regressive Models with Flow Matchingβ20Apr 21, 2025Updated 11 months ago
- Access your MDBX database over network safelyβ27Jan 6, 2026Updated 2 months ago
- Sirius, an efficient correction mechanism, which significantly boosts Contextual Sparsity models on reasoning tasks while maintaining itsβ¦β21Sep 10, 2024Updated last year
- [ICCV2025]Generate one 2K image on single 24GB 3090 GPU!β84Sep 8, 2025Updated 6 months ago
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- β30Oct 20, 2021Updated 4 years ago
- β51Jan 28, 2024Updated 2 years ago
- Implementation of MambaFormer in Pytorch ++ Zeta from the paper: "Can Mamba Learn How to Learn? A Comparative Study on In-Context Learninβ¦β21Feb 9, 2026Updated last month
- Implementation of the paper End-to-end Learning of Deterministic Decision Treesβ17May 19, 2022Updated 3 years ago
- Code for MambaVC: Learned Visual Compression with Selective State Spacesβ62May 29, 2024Updated last year
- Official codebase for NeurIPS 2022 paper End-to-end Learning to Index and Search in Large Output Spacesβ12Apr 19, 2023Updated 2 years ago
- Source code of ACL 2023 accepted paper "AD-KD: Attribution-Driven Knowledge Distillation for Language Model Compression"β13Jun 14, 2023Updated 2 years ago