sun-hailong / TVC
π The code repository for "Mitigating Visual Forgetting via Take-along Visual Conditioning for Multi-modal Long CoT Reasoning" in PyTorch.
β11Updated 2 weeks ago
Alternatives and similar repositories for TVC:
Users that are interested in TVC are comparing it to the libraries listed below
- [ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigationβ71Updated 4 months ago
- HallE-Control: Controlling Object Hallucination in LMMsβ30Updated last year
- β35Updated 10 months ago
- β82Updated last month
- [CVPR 2025] Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attentionβ32Updated 9 months ago
- HalluciDoctor: Mitigating Hallucinatory Toxicity in Visual Instruction Data (Accepted by CVPR 2024)β45Updated 9 months ago
- [ECCV2024] Reflective Instruction Tuning: Mitigating Hallucinations in Large Vision-Language Modelsβ16Updated 9 months ago
- Instruction Tuning in Continual Learning paradigmβ47Updated 3 months ago
- γNeurIPS 2024γThe official code of paper "Automated Multi-level Preference for MLLMs"β19Updated 7 months ago
- Code for paper: Nullu: Mitigating Object Hallucinations in Large Vision-Language Models via HalluSpace Projectionβ24Updated last month
- A Comprehensive Survey on Evaluating Reasoning Capabilities in Multimodal Large Language Models.β58Updated last month
- VoCoT: Unleashing Visually Grounded Multi-Step Reasoning in Large Multi-Modal Modelsβ55Updated 9 months ago
- β47Updated 5 months ago
- CLIP-MoE: Mixture of Experts for CLIPβ32Updated 6 months ago
- This repository will continuously update the latest papers, technical reports, benchmarks about multimodal reasoning!β36Updated last month
- β16Updated 5 months ago
- [CVPR2024 Highlight] Official implementation for Transferable Visual Prompting. The paper "Exploring the Transferability of Visual Promptβ¦β39Updated 4 months ago
- β24Updated 2 months ago
- β8Updated last week
- Less is More: Mitigating Multimodal Hallucination from an EOS Decision Perspective (ACL 2024)β46Updated 6 months ago
- Envolving Temporal Reasoning Capability into LMMs via Temporal Consistent Rewardβ34Updated last month
- [ICML 2025] Official implementation of paper 'Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation inβ¦β48Updated last week
- [NeurIPS 2024] Calibrated Self-Rewarding Vision Language Modelsβ73Updated 10 months ago
- Collections of Papers and Projects for Multimodal Reasoning.β104Updated 2 weeks ago
- [NeurIPS 2024] MoME: Mixture of Multimodal Experts for Generalist Multimodal Large Language Modelsβ59Updated this week
- Official implement of MIA-DPOβ56Updated 3 months ago
- The First to Know: How Token Distributions Reveal Hidden Knowledge in Large Vision-Language Models?β28Updated 6 months ago
- β44Updated this week
- β11Updated 6 months ago
- official repo for paper "[CLS] Token Tells Everything Needed for Training-free Efficient MLLMs"β19Updated 2 weeks ago