RLHFlow/Reinforce-Ada

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/RLHFlow/Reinforce-Ada)

RLHFlow / Reinforce-Ada

[COLM 2026] An adaptive sampling framework for Reinforce-style LLM post training.

☆96

Alternatives and similar repositories for Reinforce-Ada

Users that are interested in Reinforce-Ada are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

sinberCS / switch2ai
View on GitHub
switch2ai - A JetBrains IDE plugin enabling seamless collaboration between JetBrains IDEs and various AI agents (Cursor, Qoder, Claude co…
☆173Nov 11, 2025Updated 8 months ago
zhangyulin-space / ChatFerry
View on GitHub
☆104Oct 8, 2025Updated 9 months ago
MarkLee131 / PoC-Research-Papers
View on GitHub
Research papers on Proot-of-Concepts
☆114Feb 3, 2026Updated 5 months ago
Tanglumy / Finance-Bro
View on GitHub
your finance bro Agent for trading and investing
☆111Nov 8, 2025Updated 8 months ago
gulucaptain / DynamiCtrl
View on GitHub
[TMM'26] Dynamic human image animation with strong identity preservation, heterogeneous character driving, and controllable backgrounds.
☆142May 23, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
ECNU-SII / Continual-NExT
View on GitHub
☆235Jun 27, 2026Updated 3 weeks ago
ant-research / AvatarArtist
View on GitHub
[CVPR'25] Official PyTorch implementation of AvatarArtist: Open-Domain 4D Avatarization.
☆280Jun 14, 2025Updated last year
DEFENSE-SEU / RobustFlow
View on GitHub
Official Repo of "RobustFlow: Towards Robust Agentic Workflow Generation"
☆238Oct 19, 2025Updated 9 months ago
aoda-zhang / PawHaven-FullStack-React-NodeJS
View on GitHub
🐱 PawHaven — an open-source platform that helps volunteers, shelters, and adopters report, track, and share stray animal rescue cases (f…
☆90Updated this week
Jinxhy / THEMIS
View on GitHub
[USENIX Security'25] THEMIS: Towards Practical Intellectual Property Protection for Post-Deployment On-Device Deep Learning Models
☆108Aug 13, 2025Updated 11 months ago
wguo-ai / SSV2A
View on GitHub
Gotta Hear Them All: Towards Sound Source Aware Audio Generation.
☆69Nov 15, 2025Updated 8 months ago
MarkLee131 / Hypervisor-Testing-Survey
View on GitHub
A collection of research papers on hypervisor testing.
☆65May 21, 2026Updated 2 months ago
AIR-DISCOVER / FreeAskWorld
View on GitHub
[AAAI 2026 Oral] FreeAskWorld is an interactive simulation framework that integrates large language models (LLMs) for high-level plannin…
☆228Jul 3, 2026Updated 2 weeks ago
liufanfanlff / C3-Context-Cascade-Compression
View on GitHub
Official code implementation of Context Cascade Compression: Exploring the Upper Limits of Text Compression
☆313Jan 27, 2026Updated 5 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
Tsinghua-dhy / UR2
View on GitHub
UR2: Unify RAG and Reasoning through Reinforcement Learning
☆131May 26, 2026Updated last month
tangpan360 / MicroRCA-Agent
View on GitHub
2025 CCF International AIOps Challenge | Track 1: Microservice Root Cause Localization Based on Large Model Agents | "男团910" Solution · T…
☆256Jan 14, 2026Updated 6 months ago
serendipity800 / open-motion-apis
View on GitHub
☆80Mar 5, 2026Updated 4 months ago
yunbeizhang / Awesome-Visual-Prompt-Tuning
View on GitHub
[TMLR] A curated list of awesome papers, resources, and tools for Visual Prompt Tuning (VPT).
☆115Feb 22, 2026Updated 5 months ago
Pony-Unicorn / tiny-client-only
View on GitHub
A lightweight React component that renders its children only on the client side, helping avoid SSR hydration errors in frameworks like Ne…
☆31Jul 10, 2026Updated last week
bcmi / Object-Reflection-Generation-Dataset-DEROBA
View on GitHub
The dataset, code, and model for our paper "Reflection Generation for Composite Image Using Diffusion Model", ICME, 2026.
☆58Apr 4, 2026Updated 3 months ago
JingyuanXu / ucfaceconbainall
View on GitHub
Unified Semantic Curation Face (USCFace): An RDF Curation & Visualization System
☆38Jul 18, 2025Updated last year
EDAPINENUT / ExplicitShortCut
View on GitHub
Official implementation of the paper <On the Design of One-Step Diffusion via Shortcutting Flow Paths>
☆286Apr 1, 2026Updated 3 months ago
damo-cv / JCo-MVTON
View on GitHub
☆124Aug 29, 2025Updated 10 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
jiaweizzhao / InRank
View on GitHub
☆153Jan 2, 2024Updated 2 years ago
YOUNG-bit / OpenGS-Fusion
View on GitHub
[IROS2025] OpenGS-Fusion: Open-Vocabulary Dense Mapping with Hybrid 3D Gaussian Splatting for Refined Object-Level Understanding
☆76Aug 2, 2025Updated 11 months ago
Harrydirk41 / ProTDyn
View on GitHub
Generative Protein Emulator
☆69Sep 25, 2025Updated 9 months ago
HKUDS / LightReasoner
View on GitHub
[ACL 2026 Oral] "LightReasoner: Can Small Language Models Teach Large Language Models Reasoning?"
☆602May 22, 2026Updated 2 months ago
kand-ta / kand
View on GitHub
Kand: Blazing-Fast, Modern Technical Analysis in Rust, Python, and WASM.
☆564Jan 22, 2026Updated 6 months ago
bcmi / OSInsert-Image-Composition
View on GitHub
☆62Jun 28, 2026Updated 3 weeks ago
ByteDance-Seed / DAComp
View on GitHub
[ICLR 2026] DAComp: Benchmarking Data Agents across the Full Data Intelligence Lifecycle
☆432Jul 10, 2026Updated last week
Victor20082018 / -Optimized-Aquatic-Target-Recognition-Model
View on GitHub
The enhanced model is specially trained for aquatic targets, achieving higher accuracy. It can detect sailboats, humans, other vessels, b…
☆47May 15, 2025Updated last year
Uderwood-TZ / LSTM-PINN-and-PINN-for-population-forecasting
View on GitHub
LSTM-PINN and PINN for population forecasting
☆39May 9, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
THUDM / INFTY
View on GitHub
INFTY Engine: An Optimization Toolkit to Support Continual AI
☆573Jun 8, 2026Updated last month
Ma-Zhuang / OmniNWM
View on GitHub
[ECCV 2026] OmniNWM: Omniscient Navigation World Models for Autonomous Driving
☆364Jun 18, 2026Updated last month
curryqka / AgentThink
View on GitHub
[EMNLP2025]Official implementation: Agent-style vision question answer in Autonomous Driving!
☆147Sep 27, 2025Updated 9 months ago
gwh22 / UniVoice
View on GitHub
UniVoice: Unifying Autoregressive ASR and Flow-Matching based TTS with Large Language Models
☆115Oct 30, 2025Updated 8 months ago
Ga-Lee / Frequency-aware-Length-EXtension
View on GitHub
official implementation for paper titled "Training-free Horizon Extension for Autoregressive Video Generation"
☆117Feb 17, 2026Updated 5 months ago
fscdc / Oracle-Pruning-Sanity-Check
View on GitHub
[TMLR 2026] Is Oracle Pruning the True Oracle?
☆34Jul 1, 2026Updated 3 weeks ago
WWIIITT / 360-degree-video-super-resolution
View on GitHub
Leveraging AI, this solution boosts 360° video quality through 4x upscaling with Real-ESRGAN. It integrates GFPGAN for smart face enhance…
☆25Jun 27, 2025Updated last year