"A Survey on Agent-as-a-Judge"
☆110Jan 12, 2026Updated 2 months ago
Alternatives and similar repositories for Awesome-Agent-as-a-Judge
Users that are interested in Awesome-Agent-as-a-Judge are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Jan 14, 2026Updated 2 months ago
- ☆27Mar 10, 2026Updated 2 weeks ago
- "Omni-R1: Towards the Unified Generative Paradigm for Multimodal Reasoning"☆60Jan 28, 2026Updated last month
- ☆21Aug 9, 2024Updated last year
- ☆18Jun 24, 2025Updated 9 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆31Oct 9, 2025Updated 5 months ago
- [ICLR 2025] SWIFT: On-the-Fly Self-Speculative Decoding for LLM Inference Acceleration☆65Feb 21, 2025Updated last year
- Code for "Incorporating Relevance Feedback for Information-Seeking Retrieval using Few-Shot Document Re-Ranking" (https://arxiv.org/abs/2…☆14Feb 2, 2026Updated last month
- [WWW'25 Oral] Large Language Models Empowered Personalized Web Agents.☆20Nov 11, 2025Updated 4 months ago
- Official Project Page for HLA: Higher-order Linear Attention (https://arxiv.org/abs/2510.27258)☆46Jan 6, 2026Updated 2 months ago
- THEORY OF SPACE: a benchmark for evaluating whether foundation models can actively explore under partial observability efficiently to bui…☆66Feb 27, 2026Updated last month
- Training tiny models to prove hard theorems☆64Mar 5, 2026Updated 3 weeks ago
- Official repository for the paper Local Linear Attention: An Optimal Interpolation of Linear and Softmax Attention For Test-Time Regressi…☆23Oct 1, 2025Updated 5 months ago
- Official code for paper "SPA-RL: Reinforcing LLM Agent via Stepwise Progress Attribution"☆71Sep 13, 2025Updated 6 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Look, Compare, Decide: Alleviating Hallucination in Large Vision-Language Models via Multi-View Multi-Path Reasoning☆24Sep 9, 2024Updated last year
- MLX Implementation of Recursive Reasoning with Tiny Networks☆79Oct 11, 2025Updated 5 months ago
- Standalone repo for our Atropos integration with Thinking Machines Tinker API (https://thinkingmachines.ai/tinker/)☆20Updated this week
- Simple and Ideal Circuit Simulation☆13Dec 4, 2017Updated 8 years ago
- Official Project Page for Web World Models (https://arxiv.org/abs/2512.23676)☆89Jan 30, 2026Updated last month
- ☆33Apr 11, 2025Updated 11 months ago
- Official Implementation for NorMuon paper☆62Mar 11, 2026Updated 2 weeks ago
- ☆14Apr 30, 2025Updated 10 months ago
- Radiantloom Email Assist 7B is an email-assistant large language model fine-tuned from Zephyr-7B-Beta, over a custom-curated dataset of 1…☆14Jan 19, 2024Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Data and codes for EMNLP 2022 paper "CDConv: A Benchmark for Contradiction Detection in Chinese Conversations"☆13May 8, 2023Updated 2 years ago
- ☆20Oct 13, 2020Updated 5 years ago
- LEMMA: Logical Engine for Multi-domain Mathematical Analysis☆28Feb 14, 2026Updated last month
- An hardware-aware Efficient Implementation for "Mixture-of-Depths Attention".☆143Updated this week
- Connect VS Code to Google Colab Runtimes☆35Nov 13, 2025Updated 4 months ago
- The repo for using the model https://huggingface.co/thu-coai/Attacker-v0.1☆13Apr 23, 2025Updated 11 months ago
- ☆11Jul 11, 2023Updated 2 years ago
- [NeurIPS D&B Track 2024] Source code for the paper "Constrained Human-AI Cooperation: An Inclusive Embodied Social Intelligence Challenge…☆24May 2, 2025Updated 10 months ago
- REAP expert pruning for MoE LLMs on Apple Silicon via MLX☆49Mar 16, 2026Updated last week
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆29Aug 25, 2024Updated last year
- Official code for Guiding Language Model Math Reasoning with Planning Tokens☆19Feb 29, 2024Updated 2 years ago
- A buyers checklist guide for purchasing your new BYD Car☆10Feb 18, 2024Updated 2 years ago
- A Comprehensive Benchmark of Imbalanced Graph Learning (Accepted by ICLR 2025 Spotlight)☆12Apr 17, 2025Updated 11 months ago
- [EMNLP 2023 Findings] RECAP: Towards Precise Radiology Report Generation via Dynamic Disease Progression Reasoning☆28Jun 12, 2025Updated 9 months ago
- Reflect-RL: Two-Player Online RL Fine-Tuning for LMs☆18Jul 19, 2025Updated 8 months ago
- CycleGAN adaptation for day-to-night domain transfer of driving-related scenes.☆14Apr 9, 2019Updated 6 years ago