Official implementation of CVPR 2024 paper "vid-TLDR: Training Free Token merging for Light-weight Video Transformer".
☆55Oct 21, 2025Updated 5 months ago
Alternatives and similar repositories for vid-TLDR
Users that are interested in vid-TLDR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Implementation (Pytorch) of the "Generative Subgraph Retrieval for Knowledge Graph-Grounded Dialog Generation", EMNLP 2024 (main…☆12Mar 10, 2025Updated last year
- [NeurIPS 24] Official Implementation (Pytorch) of "Inversion-based Latent Bayesian Optimization"☆10Nov 15, 2024Updated last year
- Official implementation of CVPR 2024 paper "Prompt Learning via Meta-Regularization".☆32Mar 10, 2025Updated last year
- Official implementation of CVPR 2024 paper "Multi-criteria Token Fusion with One-step-ahead Attention for Efficient Vision Transformers".☆40Jul 30, 2025Updated 8 months ago
- ☆12Jan 4, 2022Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆11Jan 4, 2022Updated 4 years ago
- Official Implementation (Pytorch) of "Constant Acceleration Flow", NeurIPS 2024☆35Oct 31, 2025Updated 5 months ago
- Official PyTorch Implementation for Advancing Bayesian Optimization via Learning Correlated Latent Space (CoBO)☆18Apr 22, 2025Updated 11 months ago
- Official pytorch implementation of NeurIPS 2022 paper, TokenMixup☆48Nov 22, 2022Updated 3 years ago
- Official Implementation (Pytorch) of "DDMI: Domain-Agnostic Latent Diffusion Models for Synthesizing High-Quality Implicit Neural Represe…☆27Jun 24, 2024Updated last year
- ☆16Jun 5, 2023Updated 2 years ago
- Official Implementation (PyTorch) of "SageMix: Saliency-Guided Mixup for Point Clouds", NeurIPS 2022☆25Jul 19, 2023Updated 2 years ago
- ☆16Jun 5, 2023Updated 2 years ago
- Archive for AI grand challenge☆20Jun 6, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Open-Vocabulary Video Question Answering: A New Benchmark for Evaluating the Generalizability of Video Question Answering Models (ICCV 20…☆18Apr 23, 2024Updated last year
- ☆16Jun 5, 2023Updated 2 years ago
- [ACL 2025] PruneVid: Visual Token Pruning for Efficient Video Large Language Models☆71May 15, 2025Updated 10 months ago
- Deformable Graph Convolutional Networks (Author's PyTorch implementation for the AAAI 2022 paper)☆27Sep 22, 2022Updated 3 years ago
- Official Implementation (Pytorch) of the "VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Capti…☆25Jan 26, 2025Updated last year
- [ICCV 2025] Multi-Granular Spatio-Temporal Token Merging for Training-Free Acceleration of Video LLMs☆60Feb 2, 2026Updated 2 months ago
- Large Language Models are Temporal and Causal Reasoners for Video Question Answering (EMNLP 2023)☆78Mar 26, 2025Updated last year
- Official PyTorch implementation of NeurIPS 2022 paper "Invertible Monotone Operators for Normalizing Flows"☆15Nov 28, 2022Updated 3 years ago
- AAAI2025☆12Apr 18, 2025Updated 11 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Official implementation of CVPR 2024 paper "Retrieval-Augmented Open-Vocabulary Object Detection".☆45Sep 12, 2024Updated last year
- Distribution-Aware Prompt Tuning for Vision-Language Models (ICCV 2023)☆45Dec 11, 2023Updated 2 years ago
- ☆42Apr 7, 2024Updated 2 years ago
- Official Implementation for "SiLVR : A Simple Language-based Video Reasoning Framework"☆19Jan 18, 2026Updated 2 months ago
- Official Implementation (PyTorch) of "Point Cloud Augmentation with Weighted Local Transformations", ICCV 2021☆41Mar 2, 2022Updated 4 years ago
- ☆13May 15, 2025Updated 10 months ago
- [ICLR 2025] CREMA: Generalizable and Efficient Video-Language Reasoning via Multimodal Modular Fusion☆56Jul 1, 2025Updated 9 months ago
- Bidirectional Likelihood Estimation with Multi-Modal Large Language Models for Text-Video Retrieval (ICCV 2025 Highlight)☆21Aug 1, 2025Updated 8 months ago
- Official code for DAM: Dynamic Adapter Merging for Continual Video QA Learning☆14Apr 25, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official implementation of paper ReTaKe: Reducing Temporal and Knowledge Redundancy for Long Video Understanding☆40Mar 16, 2025Updated last year
- [ECCV 2022] A pytorch implementation for TS2-Net: Token Shift and Selection Transformer for Text-Video Retrieval☆79Nov 29, 2022Updated 3 years ago
- [ICCV'25] HERMES: temporal-coHERent long-forM understanding with Episodes and Semantics☆38Sep 10, 2025Updated 7 months ago
- Make tool-calling schemas for existing tools☆14Mar 8, 2025Updated last year
- Implementation of "Look, Listen and Recognise:character-aware audio-visual subtitling"☆20Nov 3, 2025Updated 5 months ago
- Video-Text Representation Learning via Differentiable Weak Temporal Alignment (CVPR 2022)☆18Apr 19, 2024Updated last year
- [NIPS2023] Code and Model for VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset☆300Mar 14, 2024Updated 2 years ago