JiazuoYu / PathWeave
Code for paper "LLMs Can Evolve Continually on Modality for X-Modal Reasoning" NeurIPS2024
☆28Updated 3 weeks ago
Alternatives and similar repositories for PathWeave:
Users that are interested in PathWeave are comparing it to the libraries listed below
- [CVPR2024] The code of "UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory"☆65Updated 2 months ago
- Official PyTorch code of "Grounded Question-Answering in Long Egocentric Videos", accepted by CVPR 2024.☆56Updated 3 months ago
- The official repository for ICLR2024 paper "FROSTER: Frozen CLIP is a Strong Teacher for Open-Vocabulary Action Recognition"☆66Updated last month
- Open-vocabulary Video Instance Segmentation Codebase built upon Detectron2, which is really easy to use.☆20Updated last week
- This repo holds the official code and data for "Unveiling Parts Beyond Objects: Towards Finer-Granularity Referring Expression Segmentati…☆63Updated 7 months ago
- Code for the paper: "SuS-X: Training-Free Name-Only Transfer of Vision-Language Models" [ICCV'23]☆95Updated last year
- SeqTR: A Simple yet Universal Network for Visual Grounding☆131Updated 2 months ago
- [NeurIPS 2023] The official implementation of SOC: Semantic-Assisted Object Cluster for Referring Video Object Segmentation☆29Updated 9 months ago
- ICCV2023: Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning☆39Updated last year
- ☆36Updated 9 months ago
- ☆34Updated last year
- Source code of our CVPR2024 paper TeachCLIP for Text-to-Video Retrieval☆25Updated 2 months ago
- CVPR 2023 Accepted Paper HOICLIP: Efficient Knowledge Transfer for HOI Detection with Vision-Language Models☆60Updated 9 months ago
- [BMVC 2023] Zero-shot Composed Text-Image Retrieval☆50Updated last month
- [AAAI 2024] DGL: Dynamic Global-Local Prompt Tuning for Text-Video Retrieval.☆34Updated 2 months ago
- [CVPR 2024] Context-Guided Spatio-Temporal Video Grounding☆44Updated 6 months ago
- [ICCV 2023] ALIP: Adaptive Language-Image Pre-training with Synthetic Caption☆96Updated last year
- ☆22Updated 3 months ago
- [ICCV 2023] Generative Prompt Model for Weakly Supervised Object Localization☆54Updated last year
- ☆28Updated last year
- [ACM MM 22] Correspondence Matters for Video Referring Expression Comprehension☆15Updated 2 years ago
- FreeVA: Offline MLLM as Training-Free Video Assistant☆54Updated 7 months ago
- [TPAMI under review] Towards Visual Grounding: A Survey☆37Updated this week
- Official implementation of HawkEye: Training Video-Text LLMs for Grounding Text in Videos☆36Updated 8 months ago
- ☆89Updated last year
- ☆19Updated 2 years ago
- Can I Trust Your Answer? Visually Grounded Video Question Answering (CVPR'24, Highlight)☆63Updated 6 months ago
- (ICCV 2023) Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation☆46Updated 5 months ago
- [NeurIPS 2024] Official PyTorch implementation of LoTLIP: Improving Language-Image Pre-training for Long Text Understanding☆39Updated last month
- A lightweight codebase for referring expression comprehension and segmentation☆52Updated 2 years ago