yuz1wan / video_distillationLinks
Official implementation of Dancing with Still Images: Video Distillation via Static-Dynamic Disentanglement.
☆30Updated last year
Alternatives and similar repositories for video_distillation
Users that are interested in video_distillation are comparing it to the libraries listed below
Sorting:
- Preview code of ECCV'24 paper "Distill Gold from Massive Ores" (BiLP)☆25Updated last year
- Code for our ICML'24 on multimodal dataset distillation☆41Updated last year
- Official implementation of ECCV 2024 paper: Take A Step Back: Rethinking the Two Stages in Visual Reasoning☆16Updated 5 months ago
- [CVPR2024] Efficient Dataset Distillation via Minimax Diffusion☆98Updated last year
- ☆16Updated last year
- Diffusion-TTA improves pre-trained discriminative models such as image classifiers or segmentors using pre-trained generative models.☆78Updated last year
- official implementation of CVPR 23 paper "M3Video: Masked Motion Modeling for Self-Supervised Video Representation Learning"☆52Updated last year
- ☆11Updated 4 months ago
- A pytorch implementation of CVPR24 paper "D4M: Dataset Distillation via Disentangled Diffusion Model"☆36Updated last year
- (NeurIPS 2023 spotlight) Large-scale Dataset Distillation/Condensation, 50 IPC (Images Per Class) achieves the highest 60.8% on original …☆131Updated last year
- [ICLR 2024] Test-Time RL with CLIP Feedback for Vision-Language Models.☆95Updated last month
- [NeurIPS 2023] Generalized Logit Adjustment☆39Updated last year
- ☆44Updated last year
- Distilling Dataset into Generative Models☆54Updated 2 years ago
- [AAAI2023] Symbolic Replay: Scene Graph as Prompt for Continual Learning on VQA Task (Oral)☆40Updated last year
- [CVPR2024 Highlight] Official implementation for Transferable Visual Prompting. The paper "Exploring the Transferability of Visual Prompt…☆46Updated 11 months ago
- PyTorch code for "Contrastive Region Guidance: Improving Grounding in Vision-Language Models without Training"☆37Updated last year
- Official code for "Understanding and Mitigating Overfitting in Prompt Tuning for Vision-Language Models" (TCSVT'2023)☆28Updated last year
- ☆113Updated last year
- [CVPR 2024] Tune-An-Ellipse: CLIP Has Potential to Find What You Want☆14Updated 10 months ago
- [ICCV 2025] Official code for "AIM: Adaptive Inference of Multi-Modal LLMs via Token Merging and Pruning"☆45Updated last month
- ICLR 2024, Towards Lossless Dataset Distillation via Difficulty-Aligned Trajectory Matching☆104Updated last year
- Task Residual for Tuning Vision-Language Models (CVPR 2023)☆73Updated 2 years ago
- 【NeurIPS 2024】The official code of paper "Automated Multi-level Preference for MLLMs"☆20Updated last year
- (NeurIPS 2024 Spotlight) TOPA: Extend Large Language Models for Video Understanding via Text-Only Pre-Alignment☆31Updated last year
- ✨A curated list of papers on the uncertainty in multi-modal large language model (MLLM).☆55Updated 7 months ago
- [ECCV 2024] ControlCap: Controllable Region-level Captioning☆80Updated last year
- ☆60Updated 11 months ago
- [ICCV2023] The repo for "Boosting Multi-modal Model Performance with Adaptive Gradient Modulation".☆27Updated last year
- Towards a Unified View on Visual Parameter-Efficient Transfer Learning☆26Updated 3 years ago