[ICLR 2025 Spotlight] Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like Architectures
☆543Feb 18, 2025Updated last year
Alternatives and similar repositories for Vision-RWKV
Users that are interested in Vision-RWKV are comparing it to the libraries listed below
Sorting:
- VisualRWKV is the visual-enhanced version of the RWKV language model, enabling RWKV to handle various visual tasks.☆244Jan 13, 2026Updated 2 months ago
- A curated list of papers on the applications of RWKV in computer vision. Please raise an issue if you suggest new qualified project.☆241Jun 19, 2025Updated 9 months ago
- Restore-RWKV: Efficient and Effective Medical Image Restoration with RWKV☆130Jul 28, 2025Updated 7 months ago
- Scaling RWKV-Like Architectures for Diffusion Models☆144Apr 12, 2024Updated last year
- [EMNLP 2024] RWKV-CLIP: A Robust Vision-Language Representation Learner☆153Dec 14, 2025Updated 3 months ago
- VMamba: Visual State Space Models,code is based on mamba☆3,079Mar 7, 2025Updated last year
- [ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model☆3,820Feb 13, 2025Updated last year
- [AAAI 2025] Linear-complexity Visual Sequence Learning with Gated Linear Attention☆116Jun 17, 2024Updated last year
- [ECCV 2024] The official code of paper "Open-Vocabulary SAM".☆1,029Aug 4, 2025Updated 7 months ago
- [ECCV2024] VideoMamba: State Space Model for Efficient Video Understanding☆1,087Jul 6, 2024Updated last year
- xLSTM as Generic Vision Backbone☆491Oct 20, 2025Updated 5 months ago
- The code of paper "O-Mamba: O-shape State-Space Model for Underwater Image Enhancement"☆13Oct 18, 2024Updated last year
- Code for paper LocalMamba: Visual State Space Model with Windowed Selective Scan☆277May 6, 2024Updated last year
- Code Implementation of EfficientVMamba☆244Apr 16, 2024Updated last year
- RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable)…☆14,419Mar 5, 2026Updated 2 weeks ago
- Here we will test various linear attention designs.☆62Apr 25, 2024Updated last year
- RWKV-TS: Beyond Traditional Recurrent Neural Network for Time Series Tasks☆125Aug 16, 2024Updated last year
- [ICLR 2023 Spotlight] Vision Transformer Adapter for Dense Predictions☆1,476Jun 3, 2025Updated 9 months ago
- VisionLLaMA: A Unified LLaMA Backbone for Vision Tasks☆392Jul 9, 2024Updated last year
- Mamba SSM architecture☆17,524Updated this week
- [CVPR 2024] Deformable Convolution v4☆714May 17, 2024Updated last year
- This is an inference framework for the RWKV large language model implemented purely in native PyTorch. The official native implementation…☆133Jul 20, 2024Updated last year
- ☆23Dec 28, 2024Updated last year
- [CVPR 2025] Official PyTorch Implementation of MambaVision: A Hybrid Mamba-Transformer Vision Backbone☆2,065Mar 11, 2026Updated last week
- A large-scale RWKV v7(World, PRWKV, Hybrid-RWKV) inference. Capable of inference by combining multiple states(Pseudo MoE). Easy to deploy…☆48Oct 21, 2025Updated 4 months ago
- Repository of Vision Transformer with Deformable Attention (CVPR2022) and DAT++: Spatially Dynamic Vision Transformerwith Deformable Atte…☆925Apr 17, 2024Updated last year
- [Mamba-Survey-2024] Paper list for State-Space-Model/Mamba and it's Applications☆748Jun 28, 2025Updated 8 months ago
- Awesome Papers related to Mamba.☆1,393Oct 17, 2024Updated last year
- The official implementation of ADDP (ICLR 2024)☆12Mar 27, 2024Updated last year
- ☆73Aug 1, 2025Updated 7 months ago
- [Official Repo] Visual Mamba: A Survey and New Outlooks☆734Feb 18, 2025Updated last year
- 🚀 Efficient implementations of state-of-the-art linear attention models☆4,630Updated this week
- [NeurIPS 2024] Official repository of MLLA☆373Jul 11, 2025Updated 8 months ago
- [CVPR 2023] implementation of Towards All-in-one Pre-training via Maximizing Multi-modal Mutual Information.☆91Jun 1, 2023Updated 2 years ago
- [IEEE TCSVT] Vivim: a Video Vision Mamba for Medical Video Segmentation☆185Jun 12, 2025Updated 9 months ago
- [NeurIPS 2024 Spotlight ⭐️ & TPAMI 2025] Parameter-Inverted Image Pyramid Networks (PIIP)☆111Aug 5, 2025Updated 7 months ago
- EVE Series: Encoder-Free Vision-Language Models from BAAI☆368Jul 24, 2025Updated 7 months ago
- [ECCV2024, CVPR2025] MambaIR and MambaIRv2!☆1,043Apr 15, 2025Updated 11 months ago
- [CVPR 2024 & TPAMI 2025] UniRepLKNet☆1,071Aug 10, 2025Updated 7 months ago