Lego for GRPO
☆30May 27, 2025Updated 9 months ago
Alternatives and similar repositories for reward-composer
Users that are interested in reward-composer are comparing it to the libraries listed below
Sorting:
- ☆16Feb 22, 2025Updated last year
- ☆37Aug 4, 2025Updated 7 months ago
- This unique variation on Thinking Claude maps Claude's thought process steps to unicode and forces Claude to think in unicode, potentiall…☆16Feb 24, 2025Updated last year
- ☆12Mar 23, 2025Updated 11 months ago
- RLVR Testing and Training☆23Aug 28, 2025Updated 6 months ago
- Train your own SOTA deductive reasoning model☆107Mar 6, 2025Updated last year
- Example AI chat UI built with Cloudflare Workers, Vercel AI SDK, and Shadcn☆21Apr 29, 2025Updated 10 months ago
- Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers☆28Mar 1, 2025Updated last year
- Web3 Infrastructure for gardening, growing, selling and earning crypto from your flowers.☆12Jan 28, 2026Updated last month
- Train transformer language models with reinforcement learning.☆19Feb 25, 2025Updated last year
- An AI Agent framework in Go for building Agents with RAG, Knowledge, Memory, Tools☆22May 16, 2025Updated 9 months ago
- QAlign is a new test-time alignment approach that improves language model performance by using Markov chain Monte Carlo methods.☆26Mar 2, 2026Updated last week
- LLM plugin for models hosted by Anyscale Endpoints☆35Apr 22, 2024Updated last year
- A comprehensive React Native starter template built with Expo. It includes reusable UI components, Poppins font setup, NativeWind, Fireba…☆23Updated this week
- Neurosity EEG Dataset repository☆29Apr 8, 2024Updated last year
- A simple lightweight Model Context Protocol (MCP) server integration framework☆17Jan 23, 2026Updated last month
- Build use cases with VideoDB☆31Feb 12, 2026Updated 3 weeks ago
- Gradio UI for a Cog API☆70Apr 8, 2024Updated last year
- The original Shared Recurrent Memory Transformer implementation☆33Jul 11, 2025Updated 7 months ago
- ☆37Mar 24, 2025Updated 11 months ago
- https://x.com/BlinkDL_AI/status/1884768989743882276☆28May 4, 2025Updated 10 months ago
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆42Dec 29, 2025Updated 2 months ago
- AuraMatrix is personality analysis web which using llm to do evaluation. I have made this for Gyanotsav-2025 to show different ways to ut…☆11Dec 22, 2025Updated 2 months ago
- Structured TRIZ prompt engineering for LLMs in an open, portable XML format – MIT licensed.☆16Nov 11, 2025Updated 3 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆90Mar 18, 2025Updated 11 months ago
- Concatenated documentation for use with LLMs☆54Feb 26, 2026Updated last week
- This is the repository for brain state prediction using fMRI data and transformer.☆81Jul 22, 2024Updated last year
- Glitch Gremlin AI☆15Apr 5, 2025Updated 11 months ago
- For comparing different tunes differences from WinOLS!☆11Dec 8, 2020Updated 5 years ago
- Resilient Virtual Machine Monitor is a complete fault tolerance solution for type-I hypervisors adopting one of the most popular VMM arch…☆11Jul 30, 2020Updated 5 years ago
- MiniMax-Provider-Verifier offers a rigorous, vendor-agnostic way to verify whether third-party deployments of the Minimax M2 model are co…☆29Feb 18, 2026Updated 2 weeks ago
- VibEx (vx) is a developer-friendly CLI tool that streamlines the process of working with AI coding assistants. It helps developers prepar…☆28May 17, 2025Updated 9 months ago
- CoachLint is your AI coding coach. It guides you through errors instead of just solving them for you.☆23Nov 20, 2025Updated 3 months ago
- smolLM with Entropix sampler on pytorch☆149Oct 31, 2024Updated last year
- Systematic evaluation framework that automatically rates overthinking behavior in large language models.☆97May 16, 2025Updated 9 months ago
- code for polite☆11Feb 28, 2024Updated 2 years ago
- Orca is a workspace for vibe coding built upon the principals of tracking what the agent changes and only keeping what you want☆49Updated this week
- React Native, Right Now (rn-rn)☆18Sep 2, 2025Updated 6 months ago
- A collection of notebooks for the Hugging Face blog series (https://huggingface.co/blog).☆46Feb 13, 2026Updated 3 weeks ago