showlab / Efficient-CLS
[ICCV 2023] Label-Efficient Online Continual Object Detection in Streaming Video
☆17Updated 10 months ago
Related projects ⓘ
Alternatives and complementary repositories for Efficient-CLS
- ☆72Updated 6 months ago
- (NeurIPS 2024 Spotlight) TOPA: Extend Large Language Models for Video Understanding via Text-Only Pre-Alignment☆19Updated last month
- [ECCV 2024] ControlCap: Controllable Region-level Captioning☆55Updated last month
- [NeurlPS 2024] One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos☆52Updated this week
- [ECCV2024] Learning Video Context as Interleaved Multimodal Sequences☆30Updated last month
- ICCV2023: Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning☆39Updated last year
- Accelerating Vision-Language Pretraining with Free Language Modeling (CVPR 2023)☆31Updated last year
- [NeurIPS 2024] EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models.☆40Updated last month
- IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks☆59Updated last month
- VisualGPTScore for visio-linguistic reasoning☆26Updated last year
- ☆22Updated last year
- ☆58Updated last year
- ☆10Updated 2 weeks ago
- Code for the paper "Detecting Any Human-Object Interaction Relationship: Universal HOI Detector with Spatial Prompt Learning on Foundatio…☆23Updated last year
- 🔥 [CVPR 2024] Official implementation of "See, Say, and Segment: Teaching LMMs to Overcome False Premises (SESAME)"☆28Updated 5 months ago
- [CVPR 2022] OCSampler: Compressing Videos to One Clip with Single-step Sampling☆17Updated 2 years ago
- [AAAI2023] Symbolic Replay: Scene Graph as Prompt for Continual Learning on VQA Task (Oral)☆38Updated 8 months ago
- ☆57Updated last year
- ☆55Updated last year
- Benchmarking and Analyzing Generative Data for Visual Recognition☆26Updated last year
- Implementation of paper 'Helping Hands: An Object-Aware Ego-Centric Video Recognition Model'☆31Updated last year
- [AAAI 2024] Referred by Multi-Modality: A Unified Temporal Transformers for Video Object Segmentation☆69Updated 4 months ago
- Official Pytorch codebase for Open-Vocabulary Instance Segmentation without Manual Mask Annotations [CVPR 2023]☆47Updated last year
- Official implementation of the paper "Boosting Human-Object Interaction Detection with Text-to-Image Diffusion Model"☆49Updated last year
- ☆21Updated last year
- Official code for the paper, "TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible Adapter".☆16Updated last year
- ☆44Updated last year
- [ECCV2024, Oral, Best Paper Finalist]This is the official implementation of the paper "LEGO: Learning EGOcentric Action Frame Generation …☆34Updated 3 weeks ago