ModalityDance/Awesome-Agent-as-a-Judge

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ModalityDance/Awesome-Agent-as-a-Judge)

ModalityDance / Awesome-Agent-as-a-Judge

"A Survey on Agent-as-a-Judge"

☆138

Alternatives and similar repositories for Awesome-Agent-as-a-Judge

Users that are interested in Awesome-Agent-as-a-Judge are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ModalityDance / LatentTTS
View on GitHub
"Parallel Test-Time Scaling for Latent Reasoning Models"
☆22Apr 12, 2026Updated 3 months ago
ModalityDance / MRM
View on GitHub
[SIGIR 2026] "One Adapts to Any: Meta Reward Modeling for Personalized LLM Alignment"
☆15Apr 21, 2026Updated 3 months ago
ModalityDance / Omni-R1
View on GitHub
[ACL 2026 Findings] "Omni-R1: Towards the Unified Generative Paradigm for Multimodal Reasoning"
☆63May 26, 2026Updated last month
ModalityDance / AR-Omni
View on GitHub
"AR-Omni: A Unified Autoregressive Model for Any-to-Any Generation"
☆43May 26, 2026Updated last month
hemingkx / Whisper
View on GitHub
[ACL 2026] Enabling Efficient Reasoning in LLMs via Black-box Persuasive Prompting
☆22Jan 9, 2026Updated 6 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
liyongqi67 / LTRGR
View on GitHub
☆21Aug 9, 2024Updated last year
WangHanLinHenry / STeCa
View on GitHub
(ACL2025 Findings) Official code for the paper "STeCa: Step-level Trajectory Calibration for LLM Agent Learning"
☆29Mar 2, 2026Updated 4 months ago
wangjs9 / Muffin
View on GitHub
Codes for Mitigating Unhelpfulness in Emotional Support Conversations with Multifaceted AI Feedback (ACL 2024 Findings)
☆17Jul 2, 2024Updated 2 years ago
YiCheng98 / IntegrativeDecoding
View on GitHub
Official Implementation for the paper "Integrative Decoding: Improving Factuality via Implicit Self-consistency"
☆33Apr 12, 2025Updated last year
cooperleong00 / ToxificationReversal
View on GitHub
Code for the paper "Self-Detoxifying Language Models via Toxification Reversal" (EMNLP 2023)
☆18Oct 17, 2023Updated 2 years ago
hemingkx / SWIFT
View on GitHub
[ICLR 2025] SWIFT: On-the-Fly Self-Speculative Decoding for LLM Inference Acceleration
☆70Feb 21, 2025Updated last year
liyongqi67 / GRACE
View on GitHub
☆29Aug 25, 2024Updated last year
wjhou / Recap
View on GitHub
[EMNLP 2023 Findings] RECAP: Towards Precise Radiology Report Generation via Dynamic Disease Progression Reasoning
☆28Jun 12, 2025Updated last year
WangHanLinHenry / SPA-RL-Agent
View on GitHub
Official code for paper "SPA-RL: Reinforcing LLM Agent via Stepwise Progress Attribution"
☆89Sep 13, 2025Updated 10 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
caiqizh / LUQ
View on GitHub
☆14Jan 14, 2026Updated 6 months ago
wjhou / Radar
View on GitHub
[ACL 2025] RADAR: Enhancing Radiology Report Generation with Supplementary Knowledge Injection
☆34Jul 23, 2025Updated 11 months ago
liyongqi67 / GCoQA
View on GitHub
☆18Jun 24, 2025Updated last year
wangjs9 / Aligned-dPM
View on GitHub
PyTorch implementation of experiments in the paper Aligning Language Models with Human Preferences via a Bayesian Approach
☆32Nov 6, 2023Updated 2 years ago
kaishxu / DFMed
View on GitHub
Code and data for "Medical Dialogue Generation via Dual Flow Modeling" (ACL 2023 Findings)
☆14Nov 22, 2023Updated 2 years ago
wangjs9 / CARE-master
View on GitHub
PyTorch implementation of CARE
☆16Oct 6, 2023Updated 2 years ago
Yui010206 / Adaptive-Visual-Imagination-Control
View on GitHub
When and How Much to Imagine: Adaptive Test-Time Scaling with World Models for Visual Spatial Reasoning
☆18Jun 2, 2026Updated last month
yczhou001 / PF-OPSD
View on GitHub
World Models Meet Language Models: On the Complementarity of Concrete and Abstract Reasoning
☆21Jun 3, 2026Updated last month
yczhou001 / Awesome-Medical-LLM-Agent
View on GitHub
Reasoning as the Engine: The Evolution from Medical LLMs to Versatile Medical Agents
☆35Jan 27, 2026Updated 5 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
YiyangZhou / CSR
View on GitHub
[NeurIPS 2024] Calibrated Self-Rewarding Vision Language Models
☆87Oct 26, 2025Updated 8 months ago
hemingkx / TokenSkip
View on GitHub
[EMNLP 2025] TokenSkip: Controllable Chain-of-Thought Compression in LLMs
☆224Nov 30, 2025Updated 7 months ago
DualityRL / multi-attempt
View on GitHub
☆19Mar 10, 2025Updated last year
iwangjian / Midi-Tuning
View on GitHub
[ACL 2024] Instruct Once, Chat Consistently in Multiple Rounds: An Efficient Tuning Framework for Dialogue
☆26Oct 18, 2025Updated 9 months ago
wjhou / ORGan
View on GitHub
[ACL 2023] ORGAN: Observation-Guided Radiology Report Generation via Tree Reasoning
☆55Oct 3, 2024Updated last year
EIT-NLP / Distilling-CoT-Reasoning
View on GitHub
[ACL 2025 Findings] Official implementation of the paper "Unveiling the Key Factors for Distilling Chain-of-Thought Reasoning".
☆22Feb 26, 2025Updated last year
UMass-Embodied-AGI / CHAIC
View on GitHub
[NeurIPS D&B Track 2024] Source code for the paper "Constrained Human-AI Cooperation: An Inclusive Embodied Social Intelligence Challenge…
☆25May 2, 2025Updated last year
loyiv / ITP
View on GitHub
Code of Paper: Imagine-then-Plan: Agent Learning from Adaptive Lookahead with World Models
☆16Mar 17, 2026Updated 4 months ago
iwangjian / Coding-Tutor
View on GitHub
[ACL 2025 Findings] Training Turn-by-Turn Verifiers for Dialogue Tutoring Agents: The Curious Case of LLMs as Your Coding Tutors
☆90Jun 2, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
wjhou / ICon
View on GitHub
[EMNLP 2024 Findings] ICON: Improving Inter-Report Consistency in Radiology Report Generation via Lesion-aware Mixup Augmentation
☆19Dec 11, 2024Updated last year
xzxxntxdy / PEPO
View on GitHub
Official repo for ”Rethinking Token-Level Policy Optimization for Multimodal Chain-of-Thought“
☆26Mar 29, 2026Updated 3 months ago
iwangjian / TopDial
View on GitHub
[EMNLP 2023] Target-oriented Proactive Dialogue Systems with Personalization: Problem Formulation and Dataset Curation
☆31Oct 18, 2025Updated 9 months ago
ChangyuChen347 / MaskedThought
View on GitHub
[ACL 2024] Masked Thought: Simply Masking Partial Reasoning Steps Can Improve Mathematical Reasoning Learning of Language Models
☆27Jul 9, 2024Updated 2 years ago
llm-as-a-judge / Awesome-LLM-as-a-judge
View on GitHub
☆566May 21, 2026Updated 2 months ago
iwangjian / TRIP
View on GitHub
[TOIS 2024] Target-constrained Bidirectional Planning for Generation of Target-oriented Proactive Dialogue
☆14Oct 18, 2025Updated 9 months ago
iwangjian / Color4Dial
View on GitHub
Dialogue Planning via Brownian Bridge Stochastic Process for Goal-directed Proactive Dialogue (ACL Findings 2023)
☆21Nov 10, 2025Updated 8 months ago