Official implementation of CVPR 2024 paper "vid-TLDR: Training Free Token merging for Light-weight Video Transformer".
☆55Oct 21, 2025Updated 4 months ago
Alternatives and similar repositories for vid-TLDR
Users that are interested in vid-TLDR are comparing it to the libraries listed below
Sorting:
- Official Implementation (Pytorch) of the "Generative Subgraph Retrieval for Knowledge Graph-Grounded Dialog Generation", EMNLP 2024 (main…☆12Mar 10, 2025Updated 11 months ago
- Official implementation of CVPR 2024 paper "Prompt Learning via Meta-Regularization".☆32Mar 10, 2025Updated 11 months ago
- Official implementation of CVPR 2024 paper "Multi-criteria Token Fusion with One-step-ahead Attention for Efficient Vision Transformers".☆40Jul 30, 2025Updated 7 months ago
- ☆12Jan 4, 2022Updated 4 years ago
- ☆11Jan 4, 2022Updated 4 years ago
- 2021 Drone AI challenge☆16Jan 4, 2022Updated 4 years ago
- ☆16Jun 5, 2023Updated 2 years ago
- Official PyTorch Implementation for Advancing Bayesian Optimization via Learning Correlated Latent Space (CoBO)☆18Apr 22, 2025Updated 10 months ago
- ☆16Jun 5, 2023Updated 2 years ago
- Official Implementation (Pytorch) of "Constant Acceleration Flow", NeurIPS 2024☆35Oct 31, 2025Updated 4 months ago
- [ACL 2025] PruneVid: Visual Token Pruning for Efficient Video Large Language Models☆67May 15, 2025Updated 9 months ago
- Archive for AI grand challenge☆20Jun 6, 2023Updated 2 years ago
- [ICCV 2025] Multi-Granular Spatio-Temporal Token Merging for Training-Free Acceleration of Video LLMs☆57Feb 2, 2026Updated last month
- Implementation of the model: "(MC-ViT)" from the paper: "Memory Consolidation Enables Long-Context Video Understanding"☆27Jan 17, 2026Updated last month
- Open-Vocabulary Video Question Answering: A New Benchmark for Evaluating the Generalizability of Video Question Answering Models (ICCV 20…☆18Apr 23, 2024Updated last year
- ☆16Jun 5, 2023Updated 2 years ago
- Implementation of "Look, Listen and Recognise:character-aware audio-visual subtitling"☆19Nov 3, 2025Updated 4 months ago
- ☆42Apr 7, 2024Updated last year
- ☆13May 15, 2025Updated 9 months ago
- Bidirectional Likelihood Estimation with Multi-Modal Large Language Models for Text-Video Retrieval (ICCV 2025 Highlight)☆20Aug 1, 2025Updated 7 months ago
- Official Implementation (Pytorch) of "DDMI: Domain-Agnostic Latent Diffusion Models for Synthesizing High-Quality Implicit Neural Represe…☆27Jun 24, 2024Updated last year
- Official Implementation (PyTorch) of "SageMix: Saliency-Guided Mixup for Point Clouds", NeurIPS 2022☆25Jul 19, 2023Updated 2 years ago
- Official pytorch implementation of NeurIPS 2022 paper, TokenMixup☆48Nov 22, 2022Updated 3 years ago
- Official Implementation (Pytorch) of the "VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Capti…☆24Jan 26, 2025Updated last year
- Official Implementation for "SiLVR : A Simple Language-based Video Reasoning Framework"☆19Jan 18, 2026Updated last month
- [ICLR 2025] Adaptive prompt tailored pruning of T2I diffusion models.☆15Feb 1, 2025Updated last year
- Official code for DAM: Dynamic Adapter Merging for Continual Video QA Learning☆14Apr 25, 2024Updated last year
- ☆12Dec 15, 2023Updated 2 years ago
- [ICCV'25] HERMES: temporal-coHERent long-forM understanding with Episodes and Semantics☆38Sep 10, 2025Updated 5 months ago
- [ECCV 2022] A pytorch implementation for TS2-Net: Token Shift and Selection Transformer for Text-Video Retrieval☆79Nov 29, 2022Updated 3 years ago
- Deformable Graph Convolutional Networks (Author's PyTorch implementation for the AAAI 2022 paper)☆27Sep 22, 2022Updated 3 years ago
- Code for our ACL 2025 paper "Language Repository for Long Video Understanding"☆34Jun 17, 2024Updated last year
- Official PyTorch implementation of NeurIPS 2022 paper "Invertible Monotone Operators for Normalizing Flows"☆14Nov 28, 2022Updated 3 years ago
- (EMNLP 2025 Main) RACCooN: A Versatile Instructional Video Editing Framework with Auto-Generated Narratives☆37Dec 20, 2025Updated 2 months ago
- Distribution-Aware Prompt Tuning for Vision-Language Models (ICCV 2023)☆45Dec 11, 2023Updated 2 years ago
- Official implementation of paper ReTaKe: Reducing Temporal and Knowledge Redundancy for Long Video Understanding☆40Mar 16, 2025Updated 11 months ago
- ☆16Jul 7, 2023Updated 2 years ago
- [ICLR2025] γ -MOD: Mixture-of-Depth Adaptation for Multimodal Large Language Models☆42Oct 28, 2025Updated 4 months ago
- Official PyTorch implementation of "Groupwise Query Specialization and Quality-Aware Multi-Assignment for Transformer-based Visual Relati…☆41Apr 19, 2024Updated last year