VILA-Lab / DELTLinks
(CVPR 2025) Official implementation to DELT: A Simple Diversity-driven EarlyLate Training for Dataset Distillation which outperforms SOTA top 1-acc by +1.3% and increases diversity per class by +5%
☆23Updated 2 months ago
Alternatives and similar repositories for DELT
Users that are interested in DELT are comparing it to the libraries listed below
Sorting:
- [ACL2025] Unsolvable Problem Detection: Robust Understanding Evaluation for Large Multimodal Models☆77Updated last month
- Initialization using Update Approximation is a Silver Bullet for Extremely Efficient Low-Rank Fine-Tuning☆49Updated last month
- Dimple, the first Discrete Diffusion Multimodal Large Language Model☆78Updated last week
- Data distillation benchmark☆66Updated last month
- [Preprint 2025] Thinkless: LLM Learns When to Think☆201Updated 3 weeks ago
- OpenVLThinker: An Early Exploration to Vision-Language Reasoning via Iterative Self-Improvement☆93Updated last week
- Model Merging with SVD to Tie the KnOTS [ICLR 2025]☆59Updated 3 months ago
- [ICML 2025] This is the official repository of our paper "What If We Recaption Billions of Web Images with LLaMA-3 ?"☆136Updated last year
- Code for Heima☆50Updated 2 months ago
- [NAACL 2025 Oral] Multimodal Needle in a Haystack (MMNeedle): Benchmarking Long-Context Capability of Multimodal Large Language Models☆47Updated 2 months ago
- [TMLR'24] This repository includes the official implementation our paper "FedConv: Enhancing Convolutional Neural Networks for Handling D…☆25Updated last year
- Matryoshka Multimodal Models☆111Updated 5 months ago
- ☆86Updated last month
- [CVPR 2024] Prompt Highlighter: Interactive Control for Multi-Modal LLMs☆147Updated 11 months ago
- ☆27Updated last year
- VeriThinker: Learning to Verify Makes Reasoning Model Efficient☆47Updated last week
- [WACV 2025] Official implementation of "Online-LoRA: Task-free Online Continual Learning via Low Rank Adaptation" by Xiwen Wei, Guihong L…☆46Updated 8 months ago
- Main source code of SRPO framework.☆29Updated 3 weeks ago
- ☆53Updated 2 months ago
- [ICCV 2025] Auto Interpretation Pipeline and many other functionalities for Multimodal SAE Analysis.☆144Updated last week
- Elucidated Dataset Condensation (NeurIPS 2024)☆21Updated 9 months ago
- PyTorch code for "ADEM-VL: Adaptive and Embedded Fusion for Efficient Vision-Language Tuning"☆20Updated 8 months ago
- Improving Your Model Ranking on Chatbot Arena by Vote Rigging (ICML 2025)☆21Updated 4 months ago
- LongLLaVA: Scaling Multi-modal LLMs to 1000 Images Efficiently via Hybrid Architecture☆206Updated 6 months ago
- NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation☆78Updated last month
- ☆88Updated last month
- Mobile-VideoGPT: Fast and Accurate Video Understanding Language Model☆105Updated last week
- [ICLR 2025] Source code for paper "A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegr…☆76Updated 7 months ago
- Github repository for "Bring Reason to Vision: Understanding Perception and Reasoning through Model Merging" (ICML 2025)☆63Updated last month
- [arXiv 2025] "CoT-UQ: Improving Response-wise Uncertainty Quantification in LLMs with Chain-of-Thought"☆13Updated 3 months ago