Official code for DAM: Dynamic Adapter Merging for Continual Video QA Learning
☆14Apr 25, 2024Updated last year
Alternatives and similar repositories for DAM
Users that are interested in DAM are comparing it to the libraries listed below
Sorting:
- Official Implementation for "SiLVR : A Simple Language-based Video Reasoning Framework"☆19Jan 18, 2026Updated 2 months ago
- [TCSVT] Regularity Learning via Explicit Distribution Modeling for Skeletal Video Anomaly Detection☆17Jul 22, 2023Updated 2 years ago
- VPEval Codebase from Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)☆45Nov 29, 2023Updated 2 years ago
- Pytorch Code for "Unified Coarse-to-Fine Alignment for Video-Text Retrieval" (ICCV 2023)☆66Jun 7, 2024Updated last year
- Implementation of the model: "(MC-ViT)" from the paper: "Memory Consolidation Enables Long-Context Video Understanding"☆27Mar 13, 2026Updated last week
- PyTorch code for "Contrastive Region Guidance: Improving Grounding in Vision-Language Models without Training"☆39Mar 4, 2024Updated 2 years ago
- [ICCV'25] HERMES: temporal-coHERent long-forM understanding with Episodes and Semantics☆38Sep 10, 2025Updated 6 months ago
- [ICLR 2025] CREMA: Generalizable and Efficient Video-Language Reasoning via Multimodal Modular Fusion☆56Jul 1, 2025Updated 8 months ago
- Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)☆57Jul 25, 2023Updated 2 years ago
- A Pytorch implementation of Diffusion-Based Probabilistic Uncertainty Estimation for Active Domain Adaptation☆15Nov 28, 2023Updated 2 years ago
- [NeurIPS 2022] JAX/Haiku implementation of "On Privacy and Personalization in Cross-Silo Federated Learning"☆27Apr 16, 2023Updated 2 years ago
- [NeurIPS 2023] Self-Chained Image-Language Model for Video Localization and Question Answering☆197Jan 14, 2024Updated 2 years ago
- More reliable Video Understanding Evaluation☆14Sep 23, 2025Updated 5 months ago
- Thermal Indoor Motion Dataset☆14Apr 27, 2023Updated 2 years ago
- Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers☆28Mar 1, 2025Updated last year
- [Blog 1] Recording a bug of grpo_trainer in some R1 projects☆22Feb 23, 2025Updated last year
- ☆58Dec 2, 2025Updated 3 months ago
- CS234 Reinforcement Learning: Keras implementation of Recurrent Deterministic Policy Gradient (https://arxiv.org/abs/1512.04455)☆10Jun 10, 2017Updated 8 years ago
- F-16 is a powerful video large language model (LLM) that perceives high-frame-rate videos, which is developed by the Department of Electr…☆34Jul 3, 2025Updated 8 months ago
- ☆23Sep 19, 2024Updated last year
- build vgg16 with pytorch 0.4.0 for classification of CIFAR datasets☆10Mar 31, 2019Updated 6 years ago
- ☐ ☐ A simple, out-of-the-box and cross-platform bbox annotation tool by Python. Try it by `pip install easybox`☆10May 28, 2021Updated 4 years ago
- Deep Counterfactual Prediction with Categorical Backward Variables☆12Feb 8, 2023Updated 3 years ago
- ☆35Oct 27, 2025Updated 4 months ago
- Code for "RSQ: Learning from Important Tokens Leads to Better Quantized LLMs"☆21Updated this week
- NeurIPS23 "Flow Factorized Representation Learning"☆43Dec 15, 2025Updated 3 months ago
- The Continual Learning in Multimodality Benchmark☆68Jun 24, 2023Updated 2 years ago
- Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization☆25Apr 14, 2025Updated 11 months ago
- [ICCV 2025] Factorized Learning for Temporally Grounded Video-Language Models☆24Jan 1, 2026Updated 2 months ago
- UMB: Understanding Model Behavior for Open-World object Detection (NeurIPS 2024)☆11May 26, 2024Updated last year
- Official implementation of CVPR 2024 paper "vid-TLDR: Training Free Token merging for Light-weight Video Transformer".☆55Oct 21, 2025Updated 5 months ago
- Video-Language Continual Learning Benchmark☆20Oct 30, 2024Updated last year
- The official source code of our AAAI25 paper "D&M: Enriching E-commerce Videos with Sound Effects by Key Moment Detection and SFX Matchin…☆10Feb 9, 2025Updated last year
- ☆70Dec 5, 2025Updated 3 months ago
- Code for ACL 2024 paper "Soft Self-Consistency Improves Language Model Agents"☆25Sep 11, 2024Updated last year
- Traffic Video Event Retrieval via Text Query using Vehicle Appearance and Motion Attributes☆10Jun 21, 2021Updated 4 years ago
- ☆17Jul 23, 2024Updated last year
- [CVPR 2022] OCSampler: Compressing Videos to One Clip with Single-step Sampling☆17Jun 21, 2022Updated 3 years ago
- Official code for the paper "Joint Bayesian Inference of Graphical Structure and Parameters with a Single Generative Flow Network"☆16Aug 9, 2023Updated 2 years ago