Itamarzimm/UnifiedImplicitAttnRepr

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Itamarzimm/UnifiedImplicitAttnRepr)

Itamarzimm / UnifiedImplicitAttnRepr

[ICLR 2025] Official Code Release for Explaining Modern Gated-Linear RNNs via a Unified Implicit Attention Formulation

☆50

Alternatives and similar repositories for UnifiedImplicitAttnRepr

Users that are interested in UnifiedImplicitAttnRepr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

AmeenAli / HiddenMambaAttn
View on GitHub
Official PyTorch Implementation of "The Hidden Attention of Mamba Models"
☆232Oct 16, 2025Updated 8 months ago
SR0920 / TEC-Net
View on GitHub
☆13Jul 11, 2025Updated 11 months ago
FarnoushRJ / MambaLRP
View on GitHub
[NeurIPS 2024] Official implementation of the paper "MambaLRP: Explaining Selective State Space Sequence Models" 🐍
☆47Nov 6, 2024Updated last year
Steve-Tod / STFC3
View on GitHub
☆12Dec 26, 2021Updated 4 years ago
emalach / LinearLM
View on GitHub
Code for the paper: https://arxiv.org/pdf/2309.06979.pdf
☆21Jul 29, 2024Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
kdu4108 / semiring-backprop-exps
View on GitHub
☆16Jul 10, 2023Updated 2 years ago
yasumasaonoe / ecbd
View on GitHub
☆11Apr 23, 2023Updated 3 years ago
srush / mamba-primer
View on GitHub
☆39Apr 5, 2024Updated 2 years ago
deep-spin / spectra-rationalization
View on GitHub
Repository for SPECTRA: Sparse Structured Text Rationalization, accepted at EMNLP 2021 main conference.
☆10Feb 14, 2024Updated 2 years ago
AIoT-MLSys-Lab / Famba-V
View on GitHub
[ECCV 2024 Workshop Best Paper Award] Famba-V: Fast Vision Mamba with Cross-Layer Token Fusion
☆34Sep 30, 2024Updated last year
ElementAI / lagr
View on GitHub
LAGr: Label Aligned Graphs for Better Systematic Generalization in Semantic Parsing
☆10Jun 1, 2022Updated 4 years ago
jiazhihao / attention_superoptimizer
View on GitHub
An Attention Superoptimizer
☆22Jan 20, 2025Updated last year
LZhengisme / CODA
View on GitHub
Implementation of Cascaded Head-colliding Attention (ACL'2021)
☆11Sep 16, 2021Updated 4 years ago
OpenMOSE / RWKV-Infer
View on GitHub
A large-scale RWKV v7(World, PRWKV, Hybrid-RWKV) inference. Capable of inference by combining multiple states(Pseudo MoE). Easy to deploy…
☆49Oct 21, 2025Updated 8 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
mossr / TikzNeuralNetworks.jl
View on GitHub
Visualize neural networks using TikZ in Julia
☆15Jan 29, 2025Updated last year
IBM / selective-dense-state-space-model
View on GitHub
Open-sourcing code associated with the AAAI-25 paper "On the Expressiveness and Length Generalization of Selective State-Space Models on …
☆16Sep 18, 2025Updated 9 months ago
wwy155 / FAN
View on GitHub
☆25Nov 26, 2024Updated last year
petezh / OpenD5
View on GitHub
Tasks for describing differences between text distributions.
☆17Aug 9, 2024Updated last year
HazyResearch / prefix-linear-attention
View on GitHub
☆62Jul 9, 2024Updated 2 years ago
foundation-model-research / NeuronPath
View on GitHub
☆17Feb 23, 2025Updated last year
GATECH-EIC / HALO
View on GitHub
The official code for [ECCV2020] "HALO: Hardware-aware Learning to Optimize"
☆10Mar 22, 2023Updated 3 years ago
automl / unlocking_state_tracking
View on GitHub
Expanding linear RNN state-transition matrix eigenvalues to include negatives improves state-tracking tasks and language modeling without…
☆22Mar 15, 2025Updated last year
caoluyang0830 / CVBM
View on GitHub
☆16May 23, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
IVY-LVLM / Video-MA2MBA
View on GitHub
Official Implementation of Video-MA2MBA
☆12Dec 3, 2024Updated last year
chenydong / O-Mamba
View on GitHub
The code of paper "O-Mamba: O-shape State-Space Model for Underwater Image Enhancement"
☆14Oct 18, 2024Updated last year
NicolasZucchet / minimal-LRU
View on GitHub
Non official implementation of the Linear Recurrent Unit (LRU, Orvieto et al. 2023)
☆62Sep 3, 2025Updated 10 months ago
ag1988 / mel-asr
View on GitHub
The accompanying code for "Exploring the limits of decoder-only models trained on public speech recognition corpora" (Ankit Gupta, George…
☆21Oct 11, 2024Updated last year
leefly072 / DRCNet
View on GitHub
☆12Jul 26, 2022Updated 3 years ago
IST-DASLab / peft-rosa
View on GitHub
A fork of the PEFT library, supporting Robust Adaptation (RoSA)
☆15Aug 16, 2024Updated last year
ImKeTT / AutoRec-Pytorch
View on GitHub
[Tool] AutoRec (2015) PyTorch Implementation
☆10Mar 1, 2020Updated 6 years ago
microsoft / distilled_decoding
View on GitHub
[ICLR 2025] Distilled Decoding 1: One-step Sampling of Image Auto-regressive Models with Flow Matching
☆19Apr 21, 2025Updated last year
hyang0511 / SR_Viewer
View on GitHub
This is a simple toolkit to view and crop image patches for image/video super-resolution tasks.
☆11Jan 6, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
pysat / pysatSpaceWeather
View on GitHub
pysat support for space weather indices and data sets
☆14Apr 3, 2026Updated 3 months ago
zhouziyu02 / SDformer
View on GitHub
Official implementation for the IJCAI'24 paper: SDformer
☆34Mar 6, 2025Updated last year
FarnoushRJ / RelP
View on GitHub
[NeurIPS 2025 MechInterp Workshop - Spotlight] Official implementation of the paper "RelP: Faithful and Efficient Circuit Discovery in La…
☆29Nov 3, 2025Updated 8 months ago
wtdcode / mdbx-remote
View on GitHub
Access your MDBX database over network safely
☆27Jan 6, 2026Updated 6 months ago
Infini-AI-Lab / Sirius
View on GitHub
Sirius, an efficient correction mechanism, which significantly boosts Contextual Sparsity models on reasoning tasks while maintaining its…
☆21Sep 10, 2024Updated last year
alipay / L3TC-leveraging-rwkv-for-learned-lossless-low-complexity-text-compression
View on GitHub
☆18Apr 14, 2025Updated last year
bifold-pathomics / xMIL
View on GitHub
xMIL: Insightful Explanations for Multiple Instance Learning in Histopathology
☆36Apr 27, 2026Updated 2 months ago