[ICLR 2025 Spotlight] Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like Architectures
☆541Feb 18, 2025Updated last year
Alternatives and similar repositories for Vision-RWKV
Users that are interested in Vision-RWKV are comparing it to the libraries listed below
Sorting:
- VisualRWKV is the visual-enhanced version of the RWKV language model, enabling RWKV to handle various visual tasks.☆245Jan 13, 2026Updated last month
- A curated list of papers on the applications of RWKV in computer vision. Please raise an issue if you suggest new qualified project.☆240Jun 19, 2025Updated 8 months ago
- [EMNLP 2024] RWKV-CLIP: A Robust Vision-Language Representation Learner☆153Dec 14, 2025Updated 2 months ago
- Scaling RWKV-Like Architectures for Diffusion Models☆143Apr 12, 2024Updated last year
- VMamba: Visual State Space Models,code is based on mamba☆3,046Mar 7, 2025Updated 11 months ago
- [ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model☆3,799Feb 13, 2025Updated last year
- [AAAI 2025] Linear-complexity Visual Sequence Learning with Gated Linear Attention☆115Jun 17, 2024Updated last year
- [ECCV2024] VideoMamba: State Space Model for Efficient Video Understanding☆1,081Jul 6, 2024Updated last year
- [ECCV 2024] The official code of paper "Open-Vocabulary SAM".☆1,028Aug 4, 2025Updated 6 months ago
- xLSTM as Generic Vision Backbone☆491Oct 20, 2025Updated 4 months ago
- Code Implementation of EfficientVMamba☆243Apr 16, 2024Updated last year
- Code for paper LocalMamba: Visual State Space Model with Windowed Selective Scan☆274May 6, 2024Updated last year
- VisionLLaMA: A Unified LLaMA Backbone for Vision Tasks☆390Jul 9, 2024Updated last year
- The official implementation of ADDP (ICLR 2024)☆12Mar 27, 2024Updated last year
- [ICLR 2023 Spotlight] Vision Transformer Adapter for Dense Predictions☆1,474Jun 3, 2025Updated 8 months ago
- EVE Series: Encoder-Free Vision-Language Models from BAAI☆368Jul 24, 2025Updated 7 months ago
- [CVPR 2024] Deformable Convolution v4☆707May 17, 2024Updated last year
- [Mamba-Survey-2024] Paper list for State-Space-Model/Mamba and it's Applications☆744Jun 28, 2025Updated 8 months ago
- Here we will test various linear attention designs.☆62Apr 25, 2024Updated last year
- [CVPR 2025] Official PyTorch Implementation of MambaVision: A Hybrid Mamba-Transformer Vision Backbone☆2,034Feb 9, 2026Updated 2 weeks ago
- Mamba SSM architecture☆17,257Feb 18, 2026Updated last week
- 🚀 Efficient implementations of state-of-the-art linear attention models☆4,428Updated this week
- [NeurIPS 2024 Spotlight ⭐️ & TPAMI 2025] Parameter-Inverted Image Pyramid Networks (PIIP)☆109Aug 5, 2025Updated 6 months ago
- RWKV-TS: Beyond Traditional Recurrent Neural Network for Time Series Tasks☆123Aug 16, 2024Updated last year
- RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable)…☆14,375Updated this week
- The code of paper "O-Mamba: O-shape State-Space Model for Underwater Image Enhancement"☆13Oct 18, 2024Updated last year
- Awesome Papers related to Mamba.☆1,390Oct 17, 2024Updated last year
- Official Repo For OMG-LLaVA and OMG-Seg codebase [CVPR-24 and NeurIPS-24]☆1,342Oct 15, 2025Updated 4 months ago
- Repository of Vision Transformer with Deformable Attention (CVPR2022) and DAT++: Spatially Dynamic Vision Transformerwith Deformable Atte…☆926Apr 17, 2024Updated last year
- Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation☆1,936Aug 15, 2024Updated last year
- [Official Repo] Visual Mamba: A Survey and New Outlooks☆731Feb 18, 2025Updated last year
- [NeurIPS 2024] Official repository of MLLA☆371Jul 11, 2025Updated 7 months ago
- [NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Mod…☆8,626Nov 10, 2025Updated 3 months ago
- [ICLR 2024 & ECCV 2024] The All-Seeing Projects: Towards Panoptic Visual Recognition&Understanding and General Relation Comprehension of …☆505Aug 9, 2024Updated last year
- Official Codes and Pretrained Models for RecursiveMix☆22Apr 24, 2023Updated 2 years ago
- [IEEE TCSVT] Vivim: a Video Vision Mamba for Medical Video Segmentation☆184Jun 12, 2025Updated 8 months ago
- [NeurIPS 2024] Classification Done Right for Vision-Language Pre-Training☆227Mar 20, 2025Updated 11 months ago
- A large-scale RWKV v7(World, PRWKV, Hybrid-RWKV) inference. Capable of inference by combining multiple states(Pseudo MoE). Easy to deploy…☆47Oct 21, 2025Updated 4 months ago
- ☆23Dec 28, 2024Updated last year