banjiuyufen / RecoverableCompressionLinks
Recoverable Compression: A Multimodal Vision Token Recovery Mechanism Guided by Text Information
☆22Updated 9 months ago
Alternatives and similar repositories for RecoverableCompression
Users that are interested in RecoverableCompression are comparing it to the libraries listed below
Sorting:
- 🔥An open-source survey of the latest video reasoning tasks, paradigms, and benchmarks.☆137Updated 3 weeks ago
- Unified the Anonymous and Camera Ready Version, hope everyone can get an ACCEPT☆249Updated 7 months ago
- Latest Advances on Modality Priors in Multimodal Large Language Models☆29Updated last month
- ☆41Updated 10 months ago
- [ICLR'25] Official code for the paper 'MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs'☆340Updated 9 months ago
- Poison as Cure: Visual Noise for Mitigating Object Hallucinations in LVMs☆32Updated 4 months ago
- A Comprehensive Survey on Continual Learning in Generative Models.☆116Updated last week
- A paper list of Awesome Latent Space.☆333Updated this week
- An example reproduction checklist for AAAI-26 submissions.☆103Updated 6 months ago
- MCITlib: Multimodal Continual Instruction Tuning Library and Benchmark☆60Updated this week
- Official repository for VisionZip (CVPR 2025)☆403Updated 6 months ago
- [Neurips'24 Spotlight] Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought …☆423Updated last year
- ☆56Updated last year
- Official implementation of MC-LLaVA.☆140Updated 2 months ago
- 历年ICLR论文和开源项目合集,包含ICLR2021、ICLR2022、ICLR2023、ICLR2024、ICLR2025.☆549Updated 10 months ago
- A curated list of awesome papers on dataset reduction, including dataset distillation (dataset condensation) and dataset pruning (coreset…☆60Updated last year
- a brief repo about paper research☆15Updated last year
- [CVPR' 25] Interleaved-Modal Chain-of-Thought☆106Updated last month
- Provide .bst files for NeurIPS latex template☆48Updated 9 months ago
- up-to-date curated list of state-of-the-art Large vision language models hallucinations research work, papers & resources☆262Updated 4 months ago
- 关于LLM和Multimodal LLM的paper list☆56Updated 3 weeks ago
- Analyze top AI conference papers to discover research hotspots and trends using topic modeling.☆129Updated last month
- Towards Efficient Multimodal Large Language Models: A Survey on Token Compression☆98Updated 3 weeks ago
- [ICML 2024] Model Tailor: Mitigating Catastrophic Forgetting in Multi-modal Large Language Models☆35Updated last year
- 在没有sudo权限的情况下,在linux上使用clash☆173Updated last year
- OOD Generalization相关文章的阅读笔记☆35Updated last year
- The official repository of "Whoever Started the Interference Should End It: Guiding Data-Free Model Merging via Task Vectors""☆46Updated 4 months ago
- [TMLR 2026] Survey: https://arxiv.org/pdf/2507.20198☆290Updated this week
- Visualizing the attention of vision-language models☆279Updated 11 months ago
- [AAAI2025] Unlearning Concepts in Diffusion Model via Concept Domain Correction and Concept Preserving Gradient☆44Updated 9 months ago