isLinXu / VALSE-WorkShopLinks
This project is an unofficial summary of the resources related to VALSE and its annual seminar. Its main purpose is to more facilitate your communication and learning, and we also welcome your additions and suggestions.
☆20Updated last year
Alternatives and similar repositories for VALSE-WorkShop
Users that are interested in VALSE-WorkShop are comparing it to the libraries listed below
Sorting:
- Official Pytorch implementation of Super Vision Transformer (IJCV)☆43Updated 2 years ago
- ☆72Updated 10 months ago
- Unified Architecture Search with Convolution, Transformer, and MLP (ECCV 2022)☆53Updated 3 years ago
- Zone Evaluation: Revealing Spatial Bias in Object Detection (TPAMI 2024)☆47Updated last year
- ☆19Updated last month
- [NeurIPS 2024] Exploring DCN-like Architectures for Fast Image Generation with Arbitrary Resolution☆34Updated last year
- ☆37Updated 3 years ago
- PyTorch code and checkpoints release for VanillaKD: https://arxiv.org/abs/2305.15781☆76Updated 2 years ago
- [ICCV 23]An approach to enhance the efficiency of Vision Transformer (ViT) by concurrently employing token pruning and token merging tech…☆105Updated 2 years ago
- Distilling the powerful segment anything models into lightweight ones for efficient segmentation.☆30Updated 2 years ago
- Official implementation of paper "Masked Distillation with Receptive Tokens", ICLR 2023.☆71Updated 2 years ago
- [CVPR 2023] Official implementation of "SAP-DETR: Bridging the Gap between Salient Points and Queries-Based Transformer Detector for Fast…☆30Updated 2 years ago
- ICCV2023论文代码汇总☆18Updated 2 years ago
- [ICCV 2025] Unbiased Region-Language Alignment for Open-Vocabulary Dense Prediction☆52Updated 4 months ago
- Official implement of Evo-ViT: Slow-Fast Token Evolution for Dynamic Vision Transformer☆74Updated 3 years ago
- Pytorch implementation of our paper accepted by ECCV2022 -- Knowledge Condensation Distillation https://arxiv.org/abs/2207.05409☆30Updated 3 years ago
- [AAAI 2023] DQ-DETR: Dual Query Detection Transformer for Phrase Extraction and Grounding☆57Updated 3 years ago
- ☆11Updated last year
- [CVPR 2025] Mono-InternVL: Pushing the Boundaries of Monolithic Multimodal Large Language Models with Endogenous Visual Pre-training☆101Updated 6 months ago
- [WACV2025 Oral] DeepMIM: Deep Supervision for Masked Image Modeling☆56Updated 8 months ago
- This is the official PyTorch implementation of ASAG (ICCV 2023).☆18Updated 2 years ago
- ☆53Updated last year
- (AAAI 2023 Oral) Pytorch implementation of "CF-ViT: A General Coarse-to-Fine Method for Vision Transformer"☆106Updated 2 years ago
- [ECCV 2022] AMixer: Adaptive Weight Mixing for Self-attention Free Vision Transformers☆29Updated 3 years ago
- This repo is the official megengine implementation of the ECCV2022 paper: Efficient One Pass Self-distillation with Zipf's Label Smoothin…☆28Updated 3 years ago
- [ECCV 2024] SegVG: Transferring Object Bounding Box to Segmentation for Visual Grounding☆64Updated last year
- A Close Look at Spatial Modeling: From Attention to Convolution☆92Updated 3 years ago
- The official implementation of the paper "Reducing Fine-Tuning Memory Overhead by Approximate and Memory-Sharing Backpropagation"☆21Updated last year
- Official implementation of the paper ``W2N: Switching From Weak Supervision to Noisy Supervision for Object Detection"☆29Updated 3 years ago
- This is code of paper "ScalableViT: Rethinking the Context-oriented Generalization of Vision Transformer"☆26Updated 2 years ago