PKU-YuanGroup / Peer-review-in-LLMs

Peer-review-in-LLMs: Automatic Evaluation Method for LLMs in Open-environment，https://arxiv.org/pdf/2402.01830.pdf

☆33

Alternatives and similar repositories for Peer-review-in-LLMs:

Users that are interested in Peer-review-in-LLMs are comparing it to the libraries listed below

PKU-YuanGroup / GPT-as-Language-Tree
GPT as a Monte Carlo Language Tree: A Probabilistic Perspective
☆41Updated 3 weeks ago
pipilurj / bootstrapped-preference-optimization-BPO
code for "Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization"
☆54Updated 5 months ago
OpenGVLab / MMIU
[ICLR2025] MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models
☆61Updated 5 months ago
yihedeng9 / STIC
Enhancing Large Vision Language Models with Self-Training on Image Comprehension.
☆63Updated 8 months ago
YiyangZhou / CSR
[NeurIPS 2024] Calibrated Self-Rewarding Vision Language Models
☆67Updated 8 months ago
MAmmoTH-VL / MAmmoTH-VL
MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale
☆32Updated 2 months ago
yu-rp / apiprompting
[ECCV 2024] API: Attention Prompting on Image for Large Vision-Language Models
☆63Updated 4 months ago
RainBowLuoCS / DEEM
DEEM: Official implementation of Diffusion models serve as the eyes of large language models for image perception. (ICLR2025 Spotlight)
☆18Updated 2 months ago
NUS-HPC-AI-Lab / DD-Ranking
Data distillation benchmark
☆47Updated this week
sail-sg / MMCBench
☆27Updated last year
PKU-YuanGroup / N-LoRA
【COLING 2025🔥】Code for the paper "Is Parameter Collision Hindering Continual Learning in LLMs?".
☆31Updated 2 months ago
TencentARC / GVT
Official code for "What Makes for Good Visual Tokenizers for Large Language Models?".
☆58Updated last year
Qinyu-Allen-Zhao / LVLM-LP
The First to Know: How Token Distributions Reveal Hidden Knowledge in Large Vision-Language Models?
☆24Updated 3 months ago
VIStA-H / GPT-4V_Social_Media
GPT-4V(ision) as A Social Media Analysis Engine
☆31Updated last month
OpenGVLab / Multitask-Model-Selector
[NIPS2023]Implementation of Foundation Model is Efficient Multimodal Multitask Model Selector
☆36Updated 11 months ago
yfzhang114 / LLaVA-Align
This is the official repo for Debiasing Large Visual Language Models, including a Post-Hoc debias method and Visual Debias Decoding strat…
☆76Updated 10 months ago
sail-sg / Attention-Sink
[ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical View (Spotlight)
☆48Updated 3 months ago
si0wang / VisVM
☆33Updated last month
yaolinli / DeCo
☆28Updated 7 months ago
OpenGVLab / V2PE
[ArXiv] V2PE: Improving Multimodal Long-Context Capability of Vision-Language Models with Variable Visual Position Encoding
☆28Updated 2 months ago
AoiDragon / POPE
[EMNLP'23] The official GitHub page for ''Evaluating Object Hallucination in Large Vision-Language Models''
☆78Updated 10 months ago
zwq2018 / Multi-modal-Self-instruct
The codebase for our EMNLP24 paper: Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Mo…
☆71Updated 2 weeks ago
Dongping-Chen / MLLM-Judge
[ICML 2024 Oral] Official code repository for MLLM-as-a-Judge.
☆62Updated 2 months ago
locuslab / llava-token-compression
☆37Updated 3 months ago
luka-group / mDPO
[EMNLP 2024] mDPO: Conditional Preference Optimization for Multimodal Large Language Models.
☆62Updated 3 months ago
BAAI-DCAI / Multimodal-Robustness-Benchmark
☆42Updated last month
Dongping-Chen / ISG
(ICLR 2025 Spotlight) Official code repository for Interleaved Scene Graph.
☆16Updated last week
PKU-YuanGroup / Video-Bench
A Comprehensive Benchmark and Toolkit for Evaluating Video-based Large Language Models!
☆125Updated last year
Share14 / ShareGemini
☆26Updated 6 months ago
Cooperx521 / PyramidDrop
The official code of the paper "PyramidDrop: Accelerating Your Large Vision-Language Models via Pyramid Visual Redundancy Reduction".
☆51Updated last month