RewardAnything: Generalizable Principle-Following Reward Models
☆45Jun 11, 2025Updated 8 months ago
Alternatives and similar repositories for RewardAnything
Users that are interested in RewardAnything are comparing it to the libraries listed below
Sorting:
- Implementation for the paper "Fictitious Synthetic Data Can Improve LLM Factuality via Prerequisite Learning"☆11Jan 10, 2025Updated last year
- ☆33Jun 3, 2025Updated 8 months ago
- Python package for generating datasets to evaluate reasoning and retrieval of large language models☆20Sep 22, 2025Updated 5 months ago
- Extensive Self-Contrast Enables Feedback-Free Language Model Alignment☆21Apr 2, 2024Updated last year
- A curated registry of free 3D VRM avatars for games, VR, and the metaverse☆40Jan 27, 2026Updated last month
- This repository includes a benchmark and code for the paper "Evaluating LLMs at Detecting Errors in LLM Responses".☆30Aug 18, 2024Updated last year
- This is the official repo for the paper "AMO-Bench: Large Language Models Still Struggle in High School Math Competitions".☆64Feb 6, 2026Updated 3 weeks ago
- Structured outputs from DSPy and Jinja2☆27Jun 27, 2025Updated 8 months ago
- How to think step-by-step: A mechanistic understanding of chain-of-thought reasoning☆26Aug 29, 2024Updated last year
- This is the official repository for the "Towards Vision-Language Mechanistic Interpretability: A Causal Tracing Tool for BLIP" paper acce…☆25Feb 16, 2026Updated last week
- This repository contains the code used for the experiments in the paper "Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity…☆30Oct 27, 2025Updated 4 months ago
- The original Shared Recurrent Memory Transformer implementation☆33Jul 11, 2025Updated 7 months ago
- The repository for ACL 2024 paper "TimeBench: A Comprehensive Evaluation of Temporal Reasoning Abilities in Large Language Models"☆34Jun 29, 2024Updated last year
- [NeurIPS 2025] RL Tango: Reinforcing Generator and Verifier Together for Language Reasoning☆52Oct 23, 2025Updated 4 months ago
- RM-R1: Unleashing the Reasoning Potential of Reward Models☆159Jun 26, 2025Updated 8 months ago
- ☆352Jul 29, 2025Updated 6 months ago
- DisTime: Distribution-based Time Representation for Video Large Language Models.☆18Jul 10, 2025Updated 7 months ago
- TOON as DSPy adapter☆25Feb 1, 2026Updated 3 weeks ago
- A framework for high-fidelity retrieval augmented generation in industrial knowledge bases. Integrates jargon identification, context rec…☆35Nov 4, 2025Updated 3 months ago
- Code Repository for Linux Troubleshooting Course with Real Life Examples, published by Packt☆12Jul 8, 2025Updated 7 months ago
- About Code release for "Imagination Mechanism: Mesh Information Propagation for Enhancing Data Efficiency in Reinforcement Learning"☆13Oct 7, 2023Updated 2 years ago
- [CVPR 2025] DiscoVLA: Discrepancy Reduction in Vision, Language, and Alignment for Parameter-Efficient Video-Text Retrieval☆21Jun 23, 2025Updated 8 months ago
- ☆17Jun 8, 2025Updated 8 months ago
- Open Source Avatars Gallery website featuring 4260+ free CC0 avatars for Vtubing VR, games and metaverse. Built with Next.js and ArDrive …☆16Jan 27, 2026Updated last month
- ☆14Mar 21, 2024Updated last year
- The code for the paper "A Bayesian Approach to Online Planning" published in ICML 2024.☆13Jun 17, 2024Updated last year
- This JavaScript CLI "undeletes' packages that have been removed from the NPM registry☆29Dec 18, 2025Updated 2 months ago
- ☆11Dec 23, 2023Updated 2 years ago
- This is a PoC using native windows API directx, to hide and decrypt shellcode via compute shader☆10May 3, 2025Updated 9 months ago
- DreamSmooth: Improving Model-Based RL with Reward Smoothing (ICLR 2024)☆12May 6, 2024Updated last year
- code for polite☆11Feb 28, 2024Updated 2 years ago
- ☆16Jun 25, 2025Updated 8 months ago
- 用Paddle复现论文ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin Information(ACL2021)☆10Nov 15, 2021Updated 4 years ago
- Teaching a humanoid to walk(ish), then displaying in your browser (using tensorflow.js and reinforcement learning)☆10Sep 7, 2020Updated 5 years ago
- A collection of heat engines, based on the OpenAI Gym environment framework for use with reinforcement learning applications.☆15Dec 20, 2021Updated 4 years ago
- ☆11Jan 11, 2022Updated 4 years ago
- Llama2 chinese finetuning☆38Aug 2, 2023Updated 2 years ago
- Rui Qian, Xin Yin, Dejing Dou†: Reasoning to Attend: Try to Understand How <SEG> Token Works (CVPR 2025)☆50Feb 4, 2026Updated 3 weeks ago
- ReLAx - Reinforcement Learning Applications Library☆15Feb 19, 2023Updated 3 years ago