UCSC-VLAA / ReasoningEvalLinks
Official repo of Knowledge or Reasoning? A Close Look at How LLMs Think Across Domains.
☆44Updated 8 months ago
Alternatives and similar repositories for ReasoningEval
Users that are interested in ReasoningEval are comparing it to the libraries listed below
Sorting:
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆125Updated 8 months ago
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆50Updated last week
- ☆43Updated 8 months ago
- Revisiting Mid-training in the Era of Reinforcement Learning Scaling☆182Updated 6 months ago
- ☆52Updated last year
- [NeurIPS'25 D&B] Mind2Web-2 Benchmark: Evaluating Agentic Search with Agent-as-a-Judge☆98Updated last month
- General Reasoner: Advancing LLM Reasoning Across All Domains [NeurIPS25]☆216Updated 2 months ago
- SSRL: Self-Search Reinforcement Learning