CSHaitao / Awesome-LLMs-as-Judges
The official repo for paper, LLMs-as-Judges: A Comprehensive Survey on LLM-based Evaluation Methods.
☆357Updated 4 months ago
Alternatives and similar repositories for Awesome-LLMs-as-Judges
Users that are interested in Awesome-LLMs-as-Judges are comparing it to the libraries listed below
Sorting:
- Controllable Text Generation for Large Language Models: A Survey☆172Updated 8 months ago
- ☆317Updated last week
- A recipe for online RLHF and online iterative DPO.☆511Updated 4 months ago
- LLM hallucination paper list☆315Updated last year
- [ACL 2024] User-friendly evaluation framework: Eval Suite & Benchmarks: UHGEval, HaluEval, HalluQA, etc.☆166Updated 6 months ago
- ☆543Updated last month
- 😎 A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond☆208Updated last week
- Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models☆372Updated 2 weeks ago
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.☆345Updated last month
- R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning☆509Updated 3 weeks ago
- Recipes to train reward model for RLHF.☆1,330Updated 2 weeks ago
- Project for the paper entitled `Instruction Tuning for Large Language Models: A Survey`☆176Updated 5 months ago
- ☆100Updated last month
- This is the repo for the paper "OS Agents: A Survey on MLLM-based Agents for Computer, Phone and Browser Use".☆267Updated last week
- This is the repository for the Tool Learning survey.☆369Updated 2 months ago
- Generative AI Act II: Test Time Scaling Drives Cognition Engineering☆168Updated 3 weeks ago
- ✨ A synthetic dataset generation framework that produces diverse coding questions and verifiable solutions - all in one framwork☆213Updated last month
- Codebase for Iterative DPO Using Rule-based Rewards☆243Updated last month
- L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning☆198Updated last week
- Recipes to train the self-rewarding reasoning LLMs.☆216Updated 2 months ago
- Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning☆458Updated this week
- [ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale☆245Updated 3 weeks ago
- This is a collection of research papers for Self-Correcting Large Language Models with Automated Feedback.☆520Updated 6 months ago
- Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It contains…☆208Updated 2 weeks ago
- Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"☆363Updated 3 months ago
- Fantastic Data Engineering for Large Language Models☆87Updated 4 months ago
- ☆155Updated 3 weeks ago
- This includes the original implementation of CtrlA: Adaptive Retrieval-Augmented Generation via Inherent Control.☆60Updated 7 months ago
- Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]☆554Updated 5 months ago
- Explore concepts like Self-Correct, Self-Refine, Self-Improve, Self-Contradict, Self-Play, and Self-Knowledge, alongside o1-like reasonin…☆167Updated 5 months ago