Recipes to train reward model for RLHF.
☆1,515Apr 24, 2025Updated 10 months ago
Alternatives and similar repositories for RLHF-Reward-Modeling
Users that are interested in RLHF-Reward-Modeling are comparing it to the libraries listed below
Sorting:
- A recipe for online RLHF and online iterative DPO.☆539Dec 28, 2024Updated last year
- kight is a static analysis tool for c/c++ programs.☆214Dec 27, 2024Updated last year
- ☆247Nov 24, 2024Updated last year
- An Workspace for HMI tools☆164Jul 11, 2024Updated last year
- Official Implementation of AttentionShift: Iteratively Estimated Part-based Attention Map for Pointly Supervised Instance Segmentation☆155Oct 18, 2024Updated last year
- Advanced Unsupervised Image Enhancement with GAN☆247Nov 11, 2024Updated last year
- Evaluation of Text-to-Video Generation Models: A Dynamics Perspective[NeurIPS 2024].☆274Dec 3, 2024Updated last year
- Deep Reinforcement Learning Algorithms for solving Atari 2600 Games☆143Mar 23, 2023Updated 2 years ago
- YiTu is an easy-to-use runtime to fully exploit the hybrid parallelism of different hardwares (e.g., GPU) to efficiently support the exec…☆254Jan 7, 2026Updated last month
- A python package that integrate algorithms and various machine learning approaches to extract features (genes) effective for classificati…☆252Jan 15, 2026Updated last month
- ☆288Jul 6, 2024Updated last year
- A code repository designed to show the best GitHub has to offer.☆165Jun 30, 2024Updated last year
- ML-Bench: Evaluating Large Language Models and Agents for Machine Learning Tasks on Repository-Level Code (https://arxiv.org/abs/2311.098…☆318Jul 31, 2025Updated 7 months ago
- ☆142Nov 13, 2024Updated last year
- 一个轻量的企业级BFF框架,集成xprofiler能力,可直接使用其强大的监控告警能力。☆265Feb 7, 2024Updated 2 years ago
- C++ codes for FDTD Maxwell's equation.☆161Jun 11, 2023Updated 2 years ago
- Welcome to the 'Open-Alteryx-Macro' project. This project is aimed at providing an open-source solution for managing and updating Alteryx…☆156May 25, 2024Updated last year
- ☆242Jul 5, 2024Updated last year
- ☆142May 8, 2024Updated last year