[ICLR 2025] Official Code Release for Explaining Modern Gated-Linear RNNs via a Unified Implicit Attention Formulation
β49Mar 1, 2025Updated last year
Alternatives and similar repositories for UnifiedImplicitAttnRepr
Users that are interested in UnifiedImplicitAttnRepr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- β13Jul 11, 2025Updated 9 months ago
- [NeurIPS 2024] Official implementation of the paper "MambaLRP: Explaining Selective State Space Sequence Models" πβ47Nov 6, 2024Updated last year
- β12Dec 26, 2021Updated 4 years ago
- β16Jul 10, 2023Updated 2 years ago
- β39Apr 5, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Repository for SPECTRA: Sparse Structured Text Rationalization, accepted at EMNLP 2021 main conference.β10Feb 14, 2024Updated 2 years ago
- [ECCV 2024 Workshop Best Paper Award] Famba-V: Fast Vision Mamba with Cross-Layer Token Fusionβ34Sep 30, 2024Updated last year
- LAGr: Label Aligned Graphs for Better Systematic Generalization in Semantic Parsingβ10Jun 1, 2022Updated 3 years ago
- An Attention Superoptimizerβ22Jan 20, 2025Updated last year
- Implementation of Cascaded Head-colliding Attention (ACL'2021)β11Sep 16, 2021Updated 4 years ago
- A large-scale RWKV v7(World, PRWKV, Hybrid-RWKV) inference. Capable of inference by combining multiple states(Pseudo MoE). Easy to deployβ¦β49Oct 21, 2025Updated 6 months ago
- Visualize neural networks using TikZ in Juliaβ15Jan 29, 2025Updated last year
- Open-sourcing code associated with the AAAI-25 paper "On the Expressiveness and Length Generalization of Selective State-Space Models on β¦β16Sep 18, 2025Updated 7 months ago
- β20Nov 26, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [AAAI24] Learning Invariant Inter-pixel Correlations for Superpixel Generationβ14Mar 27, 2024Updated 2 years ago
- β59Jul 9, 2024Updated last year
- Educational Implementation of "Edit Flows: Flow Matching with Edit Operations" by Havasi et al.β41Oct 17, 2025Updated 6 months ago
- Tasks for describing differences between text distributions.β17Aug 9, 2024Updated last year
- The official code for [ECCV2020] "HALO: Hardware-aware Learning to Optimize"β10Mar 22, 2023Updated 3 years ago
- Expanding linear RNN state-transition matrix eigenvalues to include negatives improves state-tracking tasks and language modeling withoutβ¦β21Mar 15, 2025Updated last year
- Official Implementation of Video-MA2MBAβ12Dec 3, 2024Updated last year
- The code of paper "O-Mamba: O-shape State-Space Model for Underwater Image Enhancement"β14Oct 18, 2024Updated last year
- The accompanying code for "Exploring the limits of decoder-only models trained on public speech recognition corpora" (Ankit Gupta, Georgeβ¦β20Oct 11, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A fork of the PEFT library, supporting Robust Adaptation (RoSA)β15Aug 16, 2024Updated last year
- A PyTorch implementation of SIN.β12Oct 20, 2021Updated 4 years ago
- This is a simple toolkit to view and crop image patches for image/video super-resolution tasks.β11Jan 6, 2023Updated 3 years ago
- [ICLR 2025] Distilled Decoding 1: One-step Sampling of Image Auto-regressive Models with Flow Matchingβ19Apr 21, 2025Updated last year
- [NeurIPS 2025 MechInterp Workshop - Spotlight] Official implementation of the paper "RelP: Faithful and Efficient Circuit Discovery in Laβ¦β28Nov 3, 2025Updated 6 months ago
- Access your MDBX database over network safelyβ27Jan 6, 2026Updated 4 months ago
- Sirius, an efficient correction mechanism, which significantly boosts Contextual Sparsity models on reasoning tasks while maintaining itsβ¦β21Sep 10, 2024Updated last year
- β16Apr 14, 2025Updated last year
- β30Oct 20, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- β52Jan 28, 2024Updated 2 years ago
- Implementation of MambaFormer in Pytorch ++ Zeta from the paper: "Can Mamba Learn How to Learn? A Comparative Study on In-Context Learninβ¦β21Apr 20, 2026Updated 2 weeks ago
- Official codebase for NeurIPS 2022 paper End-to-end Learning to Index and Search in Large Output Spacesβ12Apr 19, 2023Updated 3 years ago
- Source code of ACL 2023 accepted paper "AD-KD: Attribution-Driven Knowledge Distillation for Language Model Compression"β13Jun 14, 2023Updated 2 years ago
- Code for MambaVC: Learned Visual Compression with Selective State Spacesβ62May 29, 2024Updated last year
- [COLM 2024] LITE: Modeling Environmental Ecosystems with Multimodal Large Language Modelsβ14Jan 4, 2025Updated last year
- Repository for the ICML 2021 paper: https://arxiv.org/abs/2103.04886β13Jan 24, 2022Updated 4 years ago