hustvl / CoStudentLinks
☆11Updated 9 months ago
Alternatives and similar repositories for CoStudent
Users that are interested in CoStudent are comparing it to the libraries listed below
Sorting:
- ☆16Updated last year
- LENS: Learning to Segment Anything with Unified Reinforced Reasoning☆24Updated last week
- [ICCV 2025] GroundingSuite: Measuring Complex Multi-Granular Pixel Grounding☆66Updated 2 months ago
- [ACM MM 2024] WeakSAM: Segment Anything Meets Weakly-supervised Instance-level Recognition☆56Updated 4 months ago
- [ECCV 2024] This is the official implementation of "Stitched ViTs are Flexible Vision Backbones".☆28Updated last year
- The first decoder-only multimodal state space model☆95Updated 3 months ago
- ☆48Updated 3 months ago
- Codes for ICML 2023 Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation☆37Updated last year
- Learning 1D Causal Visual Representation with De-focus Attention Networks☆35Updated last year
- [CVPR2025] Official code repository for SeTa: "Scale Efficient Training for Large Datasets"☆20Updated 5 months ago
- [CVPR 2023] RILS: Masked Visual Reconstruction in Language Semantic Space (https://arxiv.org/abs/2301.06958)☆45Updated last year
- (ICLR 2024, CVPR 2024) SparseFormer☆75Updated 9 months ago
- 一个mmcv 的logger hook, 可以用来把模型结果推送到微信上☆20Updated 2 years ago
- [ICLR 2025] Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception☆14Updated last month
- Sambor: Boosting Segment Anything Model Towards Open-Vocabulary Learning☆30Updated last year
- ☆31Updated last year
- [IJCV 2024]☆16Updated 9 months ago
- OpenMMLab Detection Toolbox and Benchmark for V3Det☆15Updated last year
- This repo contains the code for our paper Towards Open-Ended Visual Recognition with Large Language Model☆98Updated last year
- MIMIC: Masked Image Modeling with Image Correspondences☆16Updated last year
- DiverGen (CVPR 2024) & BSGAL (ICML 2024)☆48Updated last month
- [ECCV-24] This is the official implementation of the paper "SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation".☆27Updated 10 months ago
- [IEEE TCSVT] Official Pytorch Implementation of CLIP-VIS: Adapting CLIP for Open-Vocabulary Video Instance Segmentation.☆44Updated 7 months ago
- Open-Vocabulary Panoptic Segmentation☆26Updated 2 months ago
- Code release for "Language-conditioned Detection Transformer"☆87Updated last year
- [CVPR 2025] Test-Time Visual In-Context Tuning☆25Updated 5 months ago
- [CVPR 2025] DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perception☆92Updated 2 months ago
- ☆14Updated 2 years ago
- [TCSVT] state-of-the-art open vocabulary detector on COCO/LVIS/V3Det☆32Updated 2 months ago
- Code For Our Work: DVIS-DAQ: Improving Video Segmentation via Dynamic Anchor Queries [ECCV-2024]☆14Updated last year