EchoseChen / SPA-VL-RLHF
The reinforcement learning codes for dataset SPA-VL
β21Updated 5 months ago
Related projects β
Alternatives and complementary repositories for SPA-VL-RLHF
- π curated list of awesome LMM hallucinations papers, methods & resources.β146Updated 8 months ago
- mPLUG-HalOwl: Multimodal Hallucination Evaluation and Mitigatingβ79Updated 9 months ago
- A Survey on the Honesty of Large Language Modelsβ47Updated last month
- β25Updated last month
- Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learningβ153Updated 9 months ago
- β72Updated 10 months ago
- β22Updated last month
- β39Updated 5 months ago
- β13Updated last year
- Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"β60Updated 8 months ago
- A RLHF Infrastructure for Vision-Language Modelsβ111Updated last week
- my commonly-used toolsβ47Updated 3 months ago
- Code for ACL 2024 accepted paper titled "SAPT: A Shared Attention Framework for Parameter-Efficient Continual Learning of Large Language β¦β25Updated last month
- [CVPR 2024 Highlight] Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decodingβ215Updated last month
- [ACL 2024 (Oral)] A Prospector of Long-Dependency Data for Large Language Modelsβ53Updated 4 months ago
- Code and data for "ConflictBank: A Benchmark for Evaluating the Influence of Knowledge Conflicts in LLM" (NeurIPS 2024 Track Datasets andβ¦β29Updated last month
- The official GitHub page for ''Evaluating Object Hallucination in Large Vision-Language Models''β182Updated 7 months ago
- Accepted by ECCV 2024β74Updated last month
- β116Updated 4 months ago
- [ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".β100Updated 3 weeks ago
- Paper list and datasets for the paper: A Survey on Data Selection for LLM Instruction Tuningβ33Updated 9 months ago
- [CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(β¦β248Updated last week
- [EMNLP 2024] mDPO: Conditional Preference Optimization for Multimodal Large Language Models.β34Updated 2 weeks ago
- [EMNLP 2024] Source code for the paper "Learning Planning-based Reasoning with Trajectory Collection and Process Rewards Synthesizing".β43Updated 2 months ago
- An Easy-to-use Hallucination Detection Framework for LLMs.β48Updated 7 months ago
- An LLM-free Multi-dimensional Benchmark for Multi-modal Hallucination Evaluationβ93Updated 10 months ago
- Official Repo of "MMBench: Is Your Multi-modal Model an All-around Player?"β163Updated 2 months ago
- γACL 2024γ SALAD benchmark & MD-Judgeβ106Updated last month
- The official implementation of "ICDPO: Effectively Borrowing Alignment Capability of Others via In-context Direct Preference Optimizationβ¦β13Updated 9 months ago
- β54Updated 2 months ago