hustvl / CoStudentLinks
☆11Updated 8 months ago
Alternatives and similar repositories for CoStudent
Users that are interested in CoStudent are comparing it to the libraries listed below
Sorting:
- ☆16Updated last year
- [ICCV 2025] GroundingSuite: Measuring Complex Multi-Granular Pixel Grounding☆66Updated last month
- Codes for ICML 2023 Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation☆37Updated last year
- [ECCV 2024] This is the official implementation of "Stitched ViTs are Flexible Vision Backbones".☆28Updated last year
- 一个mmcv 的logger hook, 可以用来把模型结果推送到微信上☆20Updated 2 years ago
- SimCMF: A Simple Cross-modal Fine-tuning Strategy from Vision Foundation Models to Any Imaging Modality☆33Updated 8 months ago
- ☆30Updated last year
- [CVPR2025] Official code repository for SeTa: "Scale Efficient Training for Large Datasets"☆19Updated 4 months ago
- (ICLR 2024, CVPR 2024) SparseFormer☆74Updated 8 months ago
- [CVPR 2023] RILS: Masked Visual Reconstruction in Language Semantic Space (https://arxiv.org/abs/2301.06958)☆45Updated last year
- [ACM MM 2024] WeakSAM: Segment Anything Meets Weakly-supervised Instance-level Recognition☆56Updated 3 months ago
- Featurized Query R-CNN☆45Updated 3 years ago
- ☆47Updated 2 months ago
- [ICCV 2025] MOVE: Motion-Guided Few-Shot Video Object Segmentation☆14Updated last week
- [IJCV 2024]☆16Updated 8 months ago
- The first decoder-only multimodal state space model☆95Updated 2 months ago
- [CVPR 2025] DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention☆169Updated 5 months ago
- Sambor: Boosting Segment Anything Model Towards Open-Vocabulary Learning☆30Updated last year
- [ICCV'23] Cascade-DETR: Delving into High-Quality Universal Object Detection☆99Updated last year
- [ICLR 2025] Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception☆14Updated last month
- Code release for "Language-conditioned Detection Transformer"☆87Updated last year
- ☆18Updated last year
- ☆34Updated this week
- Learning 1D Causal Visual Representation with De-focus Attention Networks☆35Updated last year
- Code For Our Work: DVIS-DAQ: Improving Video Segmentation via Dynamic Anchor Queries [ECCV-2024]☆14Updated last year
- This repo contains the code for our paper Towards Open-Ended Visual Recognition with Large Language Model☆98Updated last year
- Code and dataset link for "DenseWorld-1M: Towards Detailed Dense Grounded Caption in the Real World"☆89Updated last month
- ☆30Updated 6 months ago
- [ICML 2024] This repository includes the official implementation of our paper "Rejuvenating image-GPT as Strong Visual Representation Lea…☆98Updated last year
- OpenMMLab Detection Toolbox and Benchmark for V3Det☆15Updated last year