tomchen-ctj / CVPR23-LOVEU-AQTCLinks
【CVPRW'23】First Place Solution to the CVPR'2023 AQTC Challenge
☆15Updated 2 years ago
Alternatives and similar repositories for CVPR23-LOVEU-AQTC
Users that are interested in CVPR23-LOVEU-AQTC are comparing it to the libraries listed below
Sorting:
- 【CVPR'24】OST: Refining Text Knowledge with Optimal Spatio-Temporal Descriptor for General Video Recognition☆38Updated last year
- (NeurIPS 2023) Open-set visual object query search & localization in long-form videos☆25Updated last year
- [ECCV 2024] Code for Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation☆34Updated 9 months ago
- (ICCV 2023) Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation☆48Updated last year
- [ECCV 2024 Oral] ActionVOS: Actions as Prompts for Video Object Segmentation☆31Updated last year
- ☆13Updated last year
- Code for the paper "Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation", ECCV 2024☆45Updated last year
- Official implementation of "A Backpack Full of Skills: Egocentric Video Understanding with Diverse Task Perspectives", accepted at CVPR 2…☆24Updated last year
- [CVPR 2024 Accepted] TaskWeave: Decoupling and Inter-Task Feedback for Joint Moment Retrieval and Highlight Detection☆27Updated last year
- ICCV2023: Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning☆41Updated 2 years ago
- [ICCV 2025] Object-centric Video Question Answering with Visual Grounding and Referring☆23Updated 4 months ago
- ☆41Updated 5 months ago
- ☆13Updated 9 months ago
- Official code for CVPR2024 “VideoMAC: Video Masked Autoencoders Meet ConvNets”☆12Updated last year
- ☆22Updated 9 months ago
- Code for the paper "Detecting Any Human-Object Interaction Relationship: Universal HOI Detector with Spatial Prompt Learning on Foundatio…☆28Updated 2 years ago
- ☆12Updated 2 years ago
- [CVPR 2024] LoSh: Long-Short Text Joint Prediction Network for Referring Video Object Segmentation☆13Updated last year
- [CVPR2024] The code of "UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory"☆68Updated last year
- Official Implementation for "SiLVR : A Simple Language-based Video Reasoning Framework"☆19Updated 3 months ago
- Disentangled Pre-training for Human-Object Interaction Detection☆27Updated 3 months ago
- [IJCV 2025] VLPrompt-PSG: Vision-Language Prompting for Panoptic Scene Graph Generation☆28Updated last year
- Test-Time Training on Video Streams☆66Updated 2 years ago
- [AAAI 2025] Open-vocabulary Video Instance Segmentation Codebase built upon Detectron2, which is really easy to use.☆25Updated last year
- [ICLR 2024] FROSTER: Frozen CLIP is a Strong Teacher for Open-Vocabulary Action Recognition☆94Updated 11 months ago
- Official PyTorch code of GroundVQA (CVPR'24)☆64Updated last year
- ☆32Updated last year
- This repository contains the Adverbs in Recipes (AIR) dataset and the code published at the CVPR 23 paper: "Learning Action Changes by Me…☆13Updated 2 years ago
- ☆16Updated last year
- [AAAI 2024] DGL: Dynamic Global-Local Prompt Tuning for Text-Video Retrieval.☆47Updated last year