modelscope / OpenJudgeLinks
OpenJudge: A Unified Framework for Holistic Evaluation and Quality Rewards
☆117Updated this week
Alternatives and similar repositories for OpenJudge
Users that are interested in OpenJudge are comparing it to the libraries listed below
Sorting:
- ☆404Updated 2 months ago
- Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (…☆464Updated this week
- a-m-team's exploration in large language modeling☆195Updated 7 months ago
- A live reading list for LLM data synthesis (Updated to July, 2025).☆434Updated 4 months ago
- The related works and background techniques about Openai o1☆221Updated last year
- InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning☆283Updated 2 years ago
- ☆153Updated 2 months ago
- OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning☆154Updated last year
- ☆47Updated 10 months ago
- [NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other mo…☆412Updated 6 months ago
- A visuailzation tool to make deep understaning and easier debugging for RLHF training.☆275Updated 10 months ago
- ☆161Updated 11 months ago
- ☆87Updated 2 years ago
- 🔧Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning☆300Updated 2 months ago
- ☆325Updated 7 months ago
- ☆132Updated 7 months ago
- [AAAI 2026] Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".☆92Updated last month
- Evergreen, contamination-free, real-world, domain-specific AI evaluation framework☆114Updated 2 months ago
- ☆318Updated last year
- Generative AI Act II: Test Time Scaling Drives Cognition Engineering☆209Updated 8 months ago
- SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis☆114Updated 7 months ago
- ☆77Updated 11 months ago
- [ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".☆139Updated last year
- ☆147Updated last year
- A Comprehensive Survey on Long Context Language Modeling☆216Updated last month
- Awesome papers for role-playing with language models☆216Updated last year
- AutoCoA (Automatic generation of Chain-of-Action) is an agent model framework that enhances the multi-turn tool usage capability of reaso…☆131Updated 9 months ago
- Scaling Preference Data Curation via Human-AI Synergy☆135Updated 6 months ago
- ☆178Updated 8 months ago
- xVerify: Efficient Answer Verifier for Reasoning Model Evaluations☆143Updated last month