kokolerk / R1-V-GUI-agent

☆15

Alternatives and similar repositories for R1-V-GUI-agent:

Users that are interested in R1-V-GUI-agent are comparing it to the libraries listed below

OpenKG-ORG / EasyDetect
An Easy-to-use Hallucination Detection Framework for LLMs.
☆58Updated last year
MJ-Bench / MJ-Bench
Official implementation for "MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?"
☆43Updated 2 months ago
RainBowLuoCS / DEEM
(ICLR2025 Spotlight) DEEM: Official implementation of Diffusion models serve as the eyes of large language models for image perception.
☆34Updated last month
SihengLi99 / LLM-Honesty-Survey
A Survey on the Honesty of Large Language Models
☆57Updated 5 months ago
ShuheSH / A-Survey-of-the-Reasoning-Abilities-of-LLMs
☆21Updated 2 months ago
VLKEB / VLKEB
☆53Updated 6 months ago
ChnQ / TracingLLM
☆25Updated 11 months ago
Dongping-Chen / MLLM-Judge
[ICML 2024 Oral] Official code repository for MLLM-as-a-Judge.
☆67Updated 2 months ago
cathyxl / MAgIC
☆40Updated 5 months ago
gzcch / Bingo
☆54Updated last year
haonan3 / V1
V1: Toward Multimodal Reasoning by Designing Auxiliary Task
☆34Updated 3 weeks ago
1zhou-Wang / MemVR
[ICML 2025] Official implementation of paper 'Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in…
☆48Updated last week
hkust-nlp / Activation_Decoding
In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)
☆57Updated last year
sail-sg / Attention-Sink
[ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical View (Spotlight)
☆72Updated 6 months ago
JLZhong23 / awesome-reward-models
☆26Updated last week
hkust-nlp / mstar
[ICML 2025] M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoning
☆58Updated 4 months ago
zhxieml / remiss-jailbreak
☆28Updated 10 months ago
bethgelab / sober-reasoning
Code for "A Sober Look at Progress in Language Model Reasoning" paper
☆41Updated 3 weeks ago
maitrix-org / de-arena
Official repository for Decentralized Arena via Collective LLM Intelligence
☆10Updated 6 months ago
LiuAmber / RAHF
☆22Updated 7 months ago
yuezih / less-is-more
Less is More: Mitigating Multimodal Hallucination from an EOS Decision Perspective (ACL 2024)
☆46Updated 6 months ago
lfy79001 / S3Eval
[NAACL 2024] A Synthetic, Scalable and Systematic Evaluation Suite for Large Language Models
☆32Updated 10 months ago
clemneo / llava-interp
☆53Updated 6 months ago
junyangwang0410 / AMBER
An LLM-free Multi-dimensional Benchmark for Multi-modal Hallucination Evaluation
☆117Updated last year
luka-group / vlm-knowledge-conflict
Code for paper "Unraveling Cross-Modality Knowledge Conflicts in Large Vision-Language Models."
☆42Updated 6 months ago
rookie-joe / AutoPSV
☆45Updated 6 months ago
luka-group / mDPO
[EMNLP 2024] mDPO: Conditional Preference Optimization for Multimodal Large Language Models.
☆73Updated 5 months ago
limenlp / verl
AdaRFT: Efficient Reinforcement Finetuning via Adaptive Curriculum Learning
☆29Updated 3 weeks ago
zitian-gao / SC-MCTS
Interpretable Contrastive Monte Carlo Tree Search Reasoning
☆48Updated 5 months ago
Alsace08 / Chain-of-Embedding
[ICLR 2025] Code and Data Repo for Paper "Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation"
☆46Updated 4 months ago