Reinforcing General Reasoning without Verifiers
☆100Jun 24, 2025Updated 11 months ago
Alternatives and similar repositories for VeriFree
Users that are interested in VeriFree are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Optimizing Anytime Reasoning via Budget Relative Policy Optimization☆55Jul 15, 2025Updated 11 months ago
- Tiny evaluation of leading LLMs on competitive programming problems☆14Apr 10, 2026Updated 2 months ago
- Code for "Variational Reasoning for Language Models"☆60Sep 29, 2025Updated 8 months ago
- ☆56Oct 23, 2023Updated 2 years ago
- [NeurIPS 2025] The implementation of paper "On Reasoning Strength Planning in Large Reasoning Models"☆32Jul 6, 2025Updated 11 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [ICLR 2025] Code&Data for the paper "Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization"☆15Jun 21, 2024Updated last year
- The official implement of paper "Does Federated Learning Really Need Backpropagation?"☆23Feb 9, 2023Updated 3 years ago
- ☆47Jun 24, 2025Updated 11 months ago
- ☆24May 20, 2025Updated last year
- V1: Toward Multimodal Reasoning by Designing Auxiliary Task☆36Apr 14, 2025Updated last year
- ☆14Oct 28, 2023Updated 2 years ago
- ☆28Jul 18, 2025Updated 10 months ago
- An Ultra-Long Output Reinforcement Learning Approach☆23Jul 31, 2025Updated 10 months ago
- ☆80Jun 8, 2026Updated last week
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- 🔱 Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs☆73Mar 21, 2025Updated last year
- ☆10May 21, 2026Updated 3 weeks ago
- Revisiting Mid-training in the Era of Reinforcement Learning Scaling