Code for paper "LLMs Can Evolve Continually on Modality for X-Modal Reasoning" NeurIPS2024
☆41Dec 18, 2024Updated last year
Alternatives and similar repositories for PathWeave
Users that are interested in PathWeave are comparing it to the libraries listed below
Sorting:
- ☆12Dec 19, 2024Updated last year
- Code for paper "Boosting Continual Learning of Vision-Language Models via Mixture-of-Experts Adapters" CVPR2024☆270Sep 18, 2025Updated 5 months ago
- [CVPR2024] The code of "UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory"☆69Oct 15, 2024Updated last year
- This repository is an official implementation of the paper A Simple Baseline for Open-World Tracking via Self-training.☆10Jan 26, 2024Updated 2 years ago
- [ICLR 2026] Official repo for "FrameThinker: Learning to Think with Long Videos via Multi-Turn Frame Spotlighting"☆38Oct 9, 2025Updated 4 months ago
- ☆13Jul 10, 2024Updated last year
- Official code for our paper "Model Composition for Multimodal Large Language Models" (ACL 2024)☆31Jan 8, 2025Updated last year
- ECCV 2024 STMA & CVPR 2024 1st MOSE & 1st VOT Challenge & 1st LSVOS v6☆12Oct 16, 2024Updated last year
- ☆18Apr 20, 2025Updated 10 months ago
- ReNeg: Learning Negative Embedding with Reward Guidance☆17Jan 17, 2025Updated last year
- The official repository of our paper "Reinforcing Video Reasoning with Focused Thinking"☆35Jun 12, 2025Updated 8 months ago
- [ICCV'2023] Compositional Feature Augmentation for Unbiased Scene Graph Generation☆15Dec 5, 2023Updated 2 years ago
- This repository is related to 'Intriguing Properties of Hyperbolic Embeddings in Vision-Language Models', published at TMLR (2024), https…☆22Jul 5, 2024Updated last year
- [NeurIPS25] Official Implementation (Pytorch) of "DeepVideo-R1"☆31Feb 22, 2026Updated last week
- LLMBind: A Unified Modality-Task Integration Framework☆19Jun 16, 2024Updated last year
- ☆28Apr 8, 2025Updated 10 months ago
- MR. Video: MapReduce is the Principle for Long Video Understanding☆30Apr 23, 2025Updated 10 months ago
- A Versatile Video-LLM for Long and Short Video Understanding with Superior Temporal Localization Ability☆106Nov 28, 2024Updated last year
- [ICCV2023] Isomer: Isomerous Transformer for Zero-Shot Video Object Segmentation☆30Nov 21, 2023Updated 2 years ago
- Code implementation of paper "MUSE: Mamba is Efficient Multi-scale Learner for Text-video Retrieval (AAAI2025)"☆25Feb 2, 2025Updated last year
- Official Implementation (Pytorch) of the "VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Capti…☆23Jan 26, 2025Updated last year
- Paper: UniGS: Unified Language-Image-3D Pretraining with Gaussian Splatting☆31Jun 5, 2025Updated 8 months ago
- Benchmarking Video-LLMs on Video Spatio-Temporal Reasoning☆41Aug 4, 2025Updated 6 months ago
- [ICCV 2025] Dynamic-VLM☆28Dec 16, 2024Updated last year
- Official Pytorch Implementation of "Outlier-weighed Layerwise Sampling for LLM Fine-tuning" by Pengxiang Li, Lu Yin, Xiaowei Gao, Shiwei …☆35Jun 3, 2025Updated 8 months ago
- CLIMB-ReID: A Hybrid CLIP-Mamba Framework for Person Re-Identification(AAAI2025)☆44Nov 24, 2025Updated 3 months ago
- Instruction Tuning in Continual Learning paradigm☆71Feb 5, 2025Updated last year
- TrackGPT: Track What You Need in Videos via Text Prompts☆25May 16, 2023Updated 2 years ago
- The official implementation of our work Hawkeye: Discovering and Grounding Implicit Anomalous Sentiment in Recon-videos via Scene-enhanc…☆12Oct 14, 2024Updated last year
- Code for our Paper "All in an Aggregated Image for In-Image Learning"☆29Apr 9, 2024Updated last year
- [2023 ACL] CONE: An Efficient COarse-to-fiNE Alignment Framework for Long Video Temporal Grounding☆31Aug 5, 2023Updated 2 years ago
- This is the official implementation of ReVisionLLM: Recursive Vision-Language Model for Temporal Grounding in Hour-Long Videos☆43Nov 5, 2025Updated 3 months ago
- [AAAI25] Asymmetric Siamese Network for Efficient Visual Tracking☆42Mar 4, 2025Updated 11 months ago
- High Quality Video Reasoning Segmentation☆146Nov 24, 2025Updated 3 months ago
- 【AAAI2025】MambaPro: Multi-Modal Object Re-Identification with Mamba Aggregation and Synergistic Prompt☆83May 13, 2025Updated 9 months ago
- CVPR 2024 (Highlight)☆143Oct 20, 2024Updated last year
- [ECCV-2022]Grounding Visual Representations with Texts for Domain Generalization☆30Apr 7, 2023Updated 2 years ago
- RefVOS☆29Feb 3, 2021Updated 5 years ago
- Fast and general video object segmentation evaluation.☆36Jan 30, 2024Updated 2 years ago