A recipe for online RLHF and online iterative DPO.
☆543Dec 28, 2024Updated last year
Alternatives and similar repositories for Online-RLHF
Users that are interested in Online-RLHF are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Recipes to train reward model for RLHF.☆1,527Apr 24, 2025Updated 11 months ago
- Directional Preference Alignment☆61Sep 23, 2024Updated last year
- RewardBench: the first evaluation tool for reward models.☆707Feb 16, 2026Updated 2 months ago
- Deep Reinforcement Learning Algorithms for solving Atari 2600 Games☆143Mar 23, 2023Updated 3 years ago
- kight is a static analysis tool for c/c++ programs.☆213Dec 27, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- YiTu is an easy-to-use runtime to fully exploit the hybrid parallelism of different hardwares (e.g., GPU) to efficiently support the exec…☆254Jan 7, 2026Updated 3 months ago
- Advanced Unsupervised Image Enhancement with GAN☆246Nov 11, 2024Updated last year
- ☆249Jul 19, 2023Updated 2 years ago
- 🤙 Control Your Mouse with Hand Gestures in the Air 🤙☆250Jun 19, 2023Updated 2 years ago
- C++ codes for FDTD Maxwell's equation.☆161Jun 11, 2023Updated 2 years ago
- An awesome list of self-sovereign identity resources.☆137Jul 9, 2024Updated last year
- A ReAct-Based Highly Robust Autonomous Agent (Harness) Framework.☆208Mar 19, 2026Updated 3 weeks ago
- An Workspace for HMI tools☆163Jul 11, 2024Updated last year
- ☆246Nov 24, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- The official implementation of Self-Play Preference Optimization (SPPO)☆586Jan 23, 2025Updated last year
- Welcome to the 'Open-Alteryx-Macro' project. This project is aimed at providing an open-source solution for managing and updating Alteryx…☆156May 25, 2024Updated last year
- Official Implementation of AttentionShift: Iteratively Estimated Part-based Attention Map for Pointly Supervised Instance Segmentation☆155Oct 18, 2024Updated last year
- This project features optimized Go language, expert source code, concurrent processing, and industry-best practices.☆142Mar 14, 2023Updated 3 years ago
- 最终幻想14英文笔记☆96May 25, 2024Updated last year
- ☆141May 8, 2024Updated last year
- ☆286Jul 6, 2024Updated last year
- ☆120Sep 30, 2024Updated last year
- A code repository designed to show the best GitHub has to offer.☆165Jun 30, 2024Updated last year
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Build a simple yet effective CNN to work as a sketch recognizer. Just like Google Quick-Draw Project.☆143Mar 23, 2023Updated 3 years ago
- Harnessing the Power of AI to Navigate the Information Age – Uncovering Truth, Promoting Transparency, and Championing Fact-Based Discour…☆147Jun 2, 2023Updated 2 years ago
- AI-powered document summarization engine that transforms lengthy texts into crystallized insights☆145Nov 5, 2024Updated last year
- ☆241Jul 5, 2024Updated last year
- AI solution for Patent Classification☆142Jun 29, 2020Updated 5 years ago
- A python package that integrate algorithms and various machine learning approaches to extract features (genes) effective for classificati…☆251Jan 15, 2026Updated 3 months ago
- Book Recommendation System☆234May 2, 2024Updated last year
- ☆153Jul 28, 2022Updated 3 years ago
- ☆142Nov 13, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Visualization, simulation, manipulation of Intrinsically disorder proteins with Gibbs sampling☆288Oct 24, 2024Updated last year
- An extension for Visual Studio Code that integrates the power of OpenAI's GPT models into VSCode.☆160Mar 24, 2024Updated 2 years ago
- 🔗 Serverless blockchain analytics pipeline on AWS - Extract, process and visualize Ethereum data using Kinesis, Lambda, Redshift Serverl…☆102Oct 5, 2023Updated 2 years ago
- 一个轻量级Java RPC 框架, 底层采用Netty实现, 模拟Dubbo运行模式(闲来无事 练习一下)☆66May 30, 2023Updated 2 years ago
- Large-Scale Selfie Video Dataset (L-SVD): A Benchmark for Emotion Recognition☆306Aug 18, 2024Updated last year
- Inscriptions on CoreDao, powered by Insdexer.☆147Mar 20, 2024Updated 2 years ago
- 一个轻量的企业级BFF框架,集成xprofiler能力,可直接使用其强大的监控告警能力。☆264Feb 7, 2024Updated 2 years ago