[CVPR2025] Breaking the Low-Rank Dilemma of Linear Attention
☆39Mar 11, 2025Updated 11 months ago
Alternatives and similar repositories for RALA
Users that are interested in RALA are comparing it to the libraries listed below
Sorting:
- Devil is in the Uniformity: Exploring Diverse Learners within Transformer for Image Restoration☆26Aug 13, 2025Updated 6 months ago
- ☆26May 28, 2025Updated 9 months ago
- [NeurIPS 2024] Official repository of InLine attention☆59Dec 22, 2024Updated last year
- [AAAI 2025] SparX: A Sparse Cross-Layer Connection Mechanism for Hierarchical Vision Mamba and Transformer Networks☆44Jun 12, 2025Updated 8 months ago
- ☆14Mar 20, 2025Updated 11 months ago
- ☆90May 13, 2025Updated 9 months ago
- Official Pytorch implementation of Chromatic Graph Transformers☆10Jun 14, 2023Updated 2 years ago
- ☆21Oct 22, 2025Updated 4 months ago
- Official implementation of the paper "You Do Not Fully Utilize Transformer's Representation Capacity"☆32May 28, 2025Updated 9 months ago
- [ICML 2024] When Linear Attention Meets Autoregressive Decoding: Towards More Effective and Efficient Linearized Large Language Models☆35Jun 12, 2024Updated last year
- PyTorch implementation of "Sample- and Parameter-Efficient Auto-Regressive Image Models" from CVPR 2025☆14Nov 21, 2025Updated 3 months ago
- ☆16May 13, 2025Updated 9 months ago
- 3D-LMVIC: Learning-based Multi-View Image Coding with 3D Gaussian Geometric Priors☆12Jun 19, 2025Updated 8 months ago
- ☆17Aug 7, 2025Updated 6 months ago
- ☆24May 23, 2025Updated 9 months ago
- UniEval: Unified Holistic Evaluation for Unified Multimodal Understanding and Generation☆23May 16, 2025Updated 9 months ago
- [ICLR2026] "OFTSR: One-Step Flow for Image Super-Resolution with Tunable Fidelity-Realism Trade-offs"☆37Feb 7, 2026Updated 3 weeks ago
- [EMNLP 2024] Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality☆21Oct 8, 2024Updated last year
- [ACL 2025 Findings] Implicit Reasoning in Transformers is Reasoning through Shortcuts☆17Mar 11, 2025Updated 11 months ago
- ☆16Apr 30, 2024Updated last year
- PyTorch implementation of StableMask (ICML'24)☆15Jun 27, 2024Updated last year
- [AAAI 2026] Official repository of Circulant Attention☆27Jan 12, 2026Updated last month
- [CVPR 2024] G3DR: Generative 3D Reconstruction in ImageNet☆38Jun 27, 2024Updated last year
- Code for the paper "Cottention: Linear Transformers With Cosine Attention"☆20Nov 15, 2025Updated 3 months ago
- A token pruning method that accelerates ViTs for various tasks while maintaining high performance.☆27Jul 21, 2025Updated 7 months ago
- Code of paper 'Stochastic Layer-Wise Shuffle for Improving Vision Mamba Training'☆21Jun 10, 2025Updated 8 months ago
- The official implementation of ICLR 2025 paper "Polynomial Composition Activations: Unleashing the Dynamics of Large Language Models".☆18Apr 25, 2025Updated 10 months ago
- Algorithms for approximate attention in LLMs☆21Apr 14, 2025Updated 10 months ago
- [AAAI 2025] Linear-complexity Visual Sequence Learning with Gated Linear Attention☆116Jun 17, 2024Updated last year
- Code and data for paper "Exploring Hallucination of Large Multimodal Models in Video Understanding: Benchmark, Analysis and Mitigation".☆23Oct 22, 2025Updated 4 months ago
- Official code release of our paper "FViT: A Focal Vision Transformer with Gabor Filter"☆19Aug 21, 2025Updated 6 months ago
- The code of "HSR-KAN: Hyperspectral Image Super-Resolution based on Kolmogorov-Arnold Networks"☆25Sep 15, 2024Updated last year
- The official repo for LIFT: Language-Image Alignment with Fixed Text Encoders☆42Jun 10, 2025Updated 8 months ago
- [ACM MM25] LongWriter-V: Enabling Ultra-Long and High-Fidelity Generation in Vision-Language Models☆23Mar 29, 2025Updated 11 months ago
- A Spitting Image: Modular Superpixel Tokenization in Vision Transformers☆21Sep 12, 2025Updated 5 months ago
- [ICLR 2025 & COLM 2025] Official PyTorch implementation of the Forgetting Transformer and Adaptive Computation Pruning☆140Feb 25, 2026Updated last week
- TPDiff: Temporal Pyramid Video Diffusion Model☆25Mar 13, 2025Updated 11 months ago
- the code of GRFormer: Grouped Residual Self-Attention for Lightweight Single Image Super-Resolution☆26May 16, 2024Updated last year
- Official Release of "Mixture of Horizons in Action Chunking"☆43Dec 3, 2025Updated 3 months ago