isLinXu / VALSE-WorkShopLinks
This project is an unofficial summary of the resources related to VALSE and its annual seminar. Its main purpose is to more facilitate your communication and learning, and we also welcome your additions and suggestions.
☆19Updated last year
Alternatives and similar repositories for VALSE-WorkShop
Users that are interested in VALSE-WorkShop are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2024] Exploring DCN-like Architectures for Fast Image Generation with Arbitrary Resolution☆32Updated 6 months ago
- ☆73Updated 4 months ago
- Official Pytorch implementation of Super Vision Transformer (IJCV)☆43Updated last year
- ☆11Updated 6 months ago
- This is the official PyTorch implementation of ASAG (ICCV 2023).☆18Updated last year
- ☆28Updated 3 months ago
- This is code of paper "ScalableViT: Rethinking the Context-oriented Generalization of Vision Transformer"☆26Updated last year
- [ICCV 2025] Unbiased Region-Language Alignment for Open-Vocabulary Dense Prediction☆25Updated last week
- Zone Evaluation: Revealing Spatial Bias in Object Detection (TPAMI 2024)☆46Updated 7 months ago
- [BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition☆78Updated 3 months ago
- A Close Look at Spatial Modeling: From Attention to Convolution☆91Updated 2 years ago
- A Simple Adaptive Unfolding Network for Hyperspectral Image Reconstruction☆33Updated 2 years ago
- Official implementation of paper "Masked Distillation with Receptive Tokens", ICLR 2023.☆69Updated 2 years ago
- [ECCV 2022] AMixer: Adaptive Weight Mixing for Self-attention Free Vision Transformers☆28Updated 2 years ago
- ☆69Updated 2 months ago
- ICCV2023论文代码汇总☆18Updated last year
- ☆12Updated last year
- Distilling the powerful segment anything models into lightweight ones for efficient segmentation.☆30Updated 2 years ago
- [NeurIPS 2024] official code release for our paper "Revisiting the Integration of Convolution and Attention for Vision Backbone".☆40Updated 5 months ago
- Offical implementation of "Re-Aligning Language to Visual Objects with an Agentic Workflow"☆24Updated 2 months ago
- ☆77Updated last year
- ☆45Updated 6 months ago
- [ECCV 2024] SegVG: Transferring Object Bounding Box to Segmentation for Visual Grounding☆60Updated 8 months ago
- [ICCV 23]An approach to enhance the efficiency of Vision Transformer (ViT) by concurrently employing token pruning and token merging tech…☆99Updated 2 years ago
- [ECCV 2024] The official PyTorch implementation of the "Plain-Det: A Plain Multi-Dataset Object Detector".☆28Updated 7 months ago
- [ICLR 2024 poster] Efficient Modulation for Vision Networks☆54Updated last year
- An Empirical Study of GPT-4o Image Generation Capabilities☆24Updated 3 months ago
- [AAAI 2023] DQ-DETR: Dual Query Detection Transformer for Phrase Extraction and Grounding☆56Updated 2 years ago
- Official implement of Evo-ViT: Slow-Fast Token Evolution for Dynamic Vision Transformer☆72Updated 3 years ago
- ☆37Updated 2 years ago