taco-group / Re-Align
A novel alignment framework that leverages image retrieval to mitigate hallucinations in Vision Language Models.
☆41Updated 2 weeks ago
Alternatives and similar repositories for Re-Align:
Users that are interested in Re-Align are comparing it to the libraries listed below
- AutoTrust, a groundbreaking benchmark designed to assess the trustworthiness of DriveVLMs. This work aims to enhance public safety by ens…☆45Updated 4 months ago
- [ECCV 2024] API: Attention Prompting on Image for Large Vision-Language Models☆87Updated 6 months ago
- The First to Know: How Token Distributions Reveal Hidden Knowledge in Large Vision-Language Models?☆28Updated 6 months ago
- ☆82Updated last month
- ☆40Updated 4 months ago
- Edit Away and My Face Will not Stay: Personal Biometric Defense against Malicious Generative Editing☆37Updated 4 months ago
- Prioritize Alignment in Dataset Distillation☆20Updated 5 months ago
- ☆16Updated 5 months ago
- [CVPR 2025 (Oral)] Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key☆50Updated last month
- official repo for paper "[CLS] Token Tells Everything Needed for Training-free Efficient MLLMs"☆19Updated 2 weeks ago
- ☆53Updated 6 months ago
- LEO: A powerful Hybrid Multimodal LLM☆18Updated 3 months ago
- Official repo of M$^2$PT: Multimodal Prompt Tuning for Zero-shot Instruction Learning☆21Updated last month
- [ACL 2024] Multi-modal preference alignment remedies regression of visual instruction tuning on language model☆44Updated 5 months ago
- SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models☆99Updated 2 weeks ago
- [ICLR 2025] Code for Self-Correcting Decoding with Generative Feedback for Mitigating Hallucinations in Large Vision-Language Models☆17Updated 3 weeks ago
- Official implement of MIA-DPO☆56Updated 3 months ago
- [Blog 1] Recording a bug of grpo_trainer in some R1 projects☆19Updated 2 months ago
- [EMNLP'23] The official GitHub page for ''Evaluating Object Hallucination in Large Vision-Language Models''☆83Updated last year
- Official code for ICLR 2024 paper "Do Generated Data Always Help Contrastive Learning?"☆30Updated last year
- ☆23Updated 11 months ago
- This repo contains the source code for VB-LoRA: Extreme Parameter Efficient Fine-Tuning with Vector Banks (NeurIPS 2024).☆37Updated 6 months ago
- [AAAI 2025] HiRED strategically drops visual tokens in the image encoding stage to improve inference efficiency for High-Resolution Visio…☆32Updated 2 weeks ago
- [NeurIPS 2024] Calibrated Self-Rewarding Vision Language Models☆73Updated 10 months ago
- Code for Reducing Hallucinations in Vision-Language Models via Latent Space Steering☆52Updated 5 months ago
- ☆75Updated 4 months ago
- [CVPR2024 Highlight] Official implementation for Transferable Visual Prompting. The paper "Exploring the Transferability of Visual Prompt…☆39Updated 4 months ago
- [ICML 2024 Oral] Official code repository for MLLM-as-a-Judge.☆67Updated 2 months ago
- [CVPR 2025] PVC: Progressive Visual Token Compression for Unified Image and Video Processing in Large Vision-Language Models☆39Updated 2 months ago
- The official repository for paper "MLLM-Protector: Ensuring MLLM’s Safety without Hurting Performance"☆36Updated last year