kubernetes-bad / reward-composerView external linksLinks
Lego for GRPO
☆30May 27, 2025Updated 8 months ago
Alternatives and similar repositories for reward-composer
Users that are interested in reward-composer are comparing it to the libraries listed below
Sorting:
- ☆16Feb 22, 2025Updated 11 months ago
- ☆37Aug 4, 2025Updated 6 months ago
- This unique variation on Thinking Claude maps Claude's thought process steps to unicode and forces Claude to think in unicode, potentiall…☆15Feb 24, 2025Updated 11 months ago
- ☆12Mar 23, 2025Updated 10 months ago
- RLVR Testing and Training☆23Aug 28, 2025Updated 5 months ago
- Train your own SOTA deductive reasoning model☆107Mar 6, 2025Updated 11 months ago
- Example AI chat UI built with Cloudflare Workers, Vercel AI SDK, and Shadcn☆20Apr 29, 2025Updated 9 months ago
- Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers☆27Mar 1, 2025Updated 11 months ago
- An AI Agent framework in Go for building Agents with RAG, Knowledge, Memory, Tools☆22May 16, 2025Updated 9 months ago
- QAlign is a new test-time alignment approach that improves language model performance by using Markov chain Monte Carlo methods.☆26Updated this week
- Neurosity EEG Dataset repository☆29Apr 8, 2024Updated last year
- A simple lightweight Model Context Protocol (MCP) server integration framework☆17Jan 23, 2026Updated 3 weeks ago
- LLM plugin for models hosted by Anyscale Endpoints☆35Apr 22, 2024Updated last year
- Build use cases with VideoDB☆30Updated this week