"A Survey on Agent-as-a-Judge"
☆119Jan 12, 2026Updated 3 months ago
Alternatives and similar repositories for Awesome-Agent-as-a-Judge
Users that are interested in Awesome-Agent-as-a-Judge are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Jan 14, 2026Updated 3 months ago
- ☆27Mar 10, 2026Updated last month
- "Omni-R1: Towards the Unified Generative Paradigm for Multimodal Reasoning"☆62Jan 28, 2026Updated 2 months ago
- [ICLR 2025] SWIFT: On-the-Fly Self-Speculative Decoding for LLM Inference Acceleration☆66Feb 21, 2025Updated last year
- Code for Incorporating Relevance Feedback for Information-Seeking Retrieval using Few-Shot Document Re-Ranking, EMNLP 2022, https://aclan…☆14Mar 30, 2026Updated 3 weeks ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- [ICML 2025] Closed-Loop Long-Horizon Robotic Planning via Equilibrium Sequence Modeling☆13May 5, 2025Updated 11 months ago
- Official Project Page for HLA: Higher-order Linear Attention (https://arxiv.org/abs/2510.27258)☆48Jan 6, 2026Updated 3 months ago
- "AR-Omni: A Unified Autoregressive Model for Any-to-Any Generation"☆42Jan 27, 2026Updated 2 months ago
- Official repository for the paper Local Linear Attention: An Optimal Interpolation of Linear and Softmax Attention For Test-Time Regressi…☆23Oct 1, 2025Updated 6 months ago
- THEORY OF SPACE: a benchmark for evaluating whether foundation models can actively explore under partial observability efficiently to bui…☆71Feb 27, 2026Updated last month
- Official code for paper "SPA-RL: Reinforcing LLM Agent via Stepwise Progress Attribution"☆78Sep 13, 2025Updated 7 months ago
- [ISSTA 2025] A Large-scale Empirical Study on Fine-tuning Large Language Models for Unit Testing☆13Feb 9, 2025Updated last year
- Official implementation for paper "How Far Are We from Genuinely Useful Deep Research Agents?"☆65Dec 10, 2025Updated 4 months ago
- Training tiny models to prove hard theorems☆72Mar 5, 2026Updated last month
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- MLX Implementation of Recursive Reasoning with Tiny Networks☆78Oct 11, 2025Updated 6 months ago
- Evaluation code of ASE24 accepted paper "On the Evaluation of LLM in Unit Test Generation"☆13Dec 9, 2024Updated last year
- ☆12Sep 12, 2024Updated last year
- Simple and Ideal Circuit Simulation☆13Dec 4, 2017Updated 8 years ago
- ☆34Apr 11, 2025Updated last year
- Official Implementation for NorMuon paper☆65Mar 11, 2026Updated last month
- ☆13May 21, 2024Updated last year
- ☆15Apr 30, 2025Updated 11 months ago
- Data and codes for EMNLP 2022 paper "CDConv: A Benchmark for Contradiction Detection in Chinese Conversations"☆13May 8, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A project for tri-modal LLM benchmarking and instruction tuning.☆58Mar 27, 2025Updated last year
- ☆20Oct 13, 2020Updated 5 years ago
- The repo for using the model https://huggingface.co/thu-coai/Attacker-v0.1☆13Apr 23, 2025Updated 11 months ago
- Connect VS Code to Google Colab Runtimes☆37Nov 13, 2025Updated 5 months ago
- ☆13Nov 20, 2024Updated last year
- ☆11Jul 11, 2023Updated 2 years ago
- CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval☆13Jun 27, 2025Updated 9 months ago
- [NeurIPS D&B Track 2024] Source code for the paper "Constrained Human-AI Cooperation: An Inclusive Embodied Social Intelligence Challenge…☆25May 2, 2025Updated 11 months ago
- ☆29Aug 25, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A PyTorch implementation of a conditional Denoising Diffusion Probabilistic Model (DDPM) for multi-modal trajectory prediction. This proj…☆37Feb 20, 2026Updated 2 months ago
- Transfer Learning for Stenosis Detection in X-ray Coronary Angiography☆13Jul 3, 2021Updated 4 years ago
- ☆14Mar 26, 2024Updated 2 years ago
- [EMNLP 2023 Findings] RECAP: Towards Precise Radiology Report Generation via Dynamic Disease Progression Reasoning☆28Jun 12, 2025Updated 10 months ago
- Reflect-RL: Two-Player Online RL Fine-Tuning for LMs☆18Jul 19, 2025Updated 9 months ago
- ☆11Jun 21, 2025Updated 9 months ago
- ☆14Nov 19, 2024Updated last year