qhfan / RALAView external linksLinks
[CVPR2025] Breaking the Low-Rank Dilemma of Linear Attention
☆39Mar 11, 2025Updated 11 months ago
Alternatives and similar repositories for RALA
Users that are interested in RALA are comparing it to the libraries listed below
Sorting:
- Devil is in the Uniformity: Exploring Diverse Learners within Transformer for Image Restoration☆26Aug 13, 2025Updated 6 months ago
- ☆27May 28, 2025Updated 8 months ago
- Official repository of InLine attention (NeurIPS 2024)☆58Dec 22, 2024Updated last year
- ☆14Mar 20, 2025Updated 10 months ago
- ☆90May 13, 2025Updated 9 months ago
- Official Pytorch implementation of Chromatic Graph Transformers☆10Jun 14, 2023Updated 2 years ago
- ☆20Oct 22, 2025Updated 3 months ago
- Official implementation of the paper "You Do Not Fully Utilize Transformer's Representation Capacity"☆31May 28, 2025Updated 8 months ago
- [ICML 2024] When Linear Attention Meets Autoregressive Decoding: Towards More Effective and Efficient Linearized Large Language Models☆35Jun 12, 2024Updated last year
- 3D-LMVIC: Learning-based Multi-View Image Coding with 3D Gaussian Geometric Priors☆12Jun 19, 2025Updated 7 months ago
- PyTorch implementation of "Sample- and Parameter-Efficient Auto-Regressive Image Models" from CVPR 2025☆14Nov 21, 2025Updated 2 months ago
- ☆16May 13, 2025Updated 9 months ago
- ☆17Aug 7, 2025Updated 6 months ago
- ☆24May 23, 2025Updated 8 months ago
- Official repository of Circulant Attention (AAAI 2026)☆19Jan 12, 2026Updated last month
- UniEval: Unified Holistic Evaluation for Unified Multimodal Understanding and Generation☆22May 16, 2025Updated 8 months ago
- [ICLR2026] "OFTSR: One-Step Flow for Image Super-Resolution with Tunable Fidelity-Realism Trade-offs"☆37Updated this week
- [EMNLP 2024] Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality☆21Oct 8, 2024Updated last year
- ☆16Apr 30, 2024Updated last year
- [ACL 2025 Findings] Implicit Reasoning in Transformers is Reasoning through Shortcuts☆17Mar 11, 2025Updated 11 months ago
- [CVPR 2024] G3DR: Generative 3D Reconstruction in ImageNet☆37Jun 27, 2024Updated last year
- A token pruning method that accelerates ViTs for various tasks while maintaining high performance.☆28Jul 21, 2025Updated 6 months ago
- Code for the paper "Cottention: Linear Transformers With Cosine Attention"☆20Nov 15, 2025Updated 2 months ago
- Code of paper 'Stochastic Layer-Wise Shuffle for Improving Vision Mamba Training'☆21Jun 10, 2025Updated 8 months ago
- Algorithms for approximate attention in LLMs☆21Apr 14, 2025Updated 9 months ago
- The official implementation of ICLR 2025 paper "Polynomial Composition Activations: Unleashing the Dynamics of Large Language Models".☆18Apr 25, 2025Updated 9 months ago
- The code of "HSR-KAN: Hyperspectral Image Super-Resolution based on Kolmogorov-Arnold Networks"☆24Sep 15, 2024Updated last year
- [AAAI 2025] Linear-complexity Visual Sequence Learning with Gated Linear Attention☆115Jun 17, 2024Updated last year
- Official Release of "Mixture of Horizons in Action Chunking"☆40Dec 3, 2025Updated 2 months ago
- Code and data for paper "Exploring Hallucination of Large Multimodal Models in Video Understanding: Benchmark, Analysis and Mitigation".☆23Oct 22, 2025Updated 3 months ago
- A Spitting Image: Modular Superpixel Tokenization in Vision Transformers☆21Sep 12, 2025Updated 5 months ago
- The official repo for LIFT: Language-Image Alignment with Fixed Text Encoders☆42Jun 10, 2025Updated 8 months ago
- Official code release of our paper "FViT: A Focal Vision Transformer with Gabor Filter"☆19Aug 21, 2025Updated 5 months ago
- [ACM MM25] LongWriter-V: Enabling Ultra-Long and High-Fidelity Generation in Vision-Language Models☆23Mar 29, 2025Updated 10 months ago
- Official code for paper: F3D-Gaus: Feed-forward 3D-aware Generation on ImageNet with Cycle-Aggregative Gaussian Splatting☆50Mar 11, 2025Updated 11 months ago
- [ICLR 2025 & COLM 2025] Official PyTorch implementation of the Forgetting Transformer and Adaptive Computation Pruning☆137Dec 19, 2025Updated last month
- The open-source materials for paper "Sparsing Law: Towards Large Language Models with Greater Activation Sparsity".☆30Nov 12, 2024Updated last year
- TPDiff: Temporal Pyramid Video Diffusion Model☆23Mar 13, 2025Updated 11 months ago
- Official implementation of Vector-ICL: In-context Learning with Continuous Vector Representations (ICLR 2025)☆21Jun 2, 2025Updated 8 months ago