Zyq-scut/RLTF

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Zyq-scut/RLTF)

Zyq-scut / RLTF

Accepted by Transactions on Machine Learning Research (TMLR)

☆135

Alternatives and similar repositories for RLTF

Users that are interested in RLTF are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

reddy-lab-code-research / PPOCoder
View on GitHub
Code for the TMLR 2023 paper "PPOCoder: Execution-based Code Generation using Deep Reinforcement Learning"
☆116Jan 9, 2024Updated 2 years ago
salesforce / CodeRL
View on GitHub
This is the official code for the paper CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning (Neur…
☆573Jun 2, 2026Updated last month
Ablustrund / APPS_Plus
View on GitHub
StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback
☆73Aug 31, 2024Updated last year
bigcode-project / octopack
View on GitHub
🐙 OctoPack: Instruction Tuning Code Large Language Models
☆479Feb 5, 2025Updated last year
nickrosh / evol-teacher
View on GitHub
Open Source WizardCoder Dataset
☆166Jul 12, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
theblackcat102 / evol-dataset
View on GitHub
evol augment any dataset online
☆61Aug 3, 2023Updated 2 years ago
amazon-science / llm-code-preference
View on GitHub
Training and Benchmarking LLMs for Code Preference.
☆38Nov 15, 2024Updated last year
GanjinZero / RRHF
View on GitHub
[NIPS2023] RRHF & Wombat
☆806Sep 22, 2023Updated 2 years ago
martin-wey / CodeUltraFeedback
View on GitHub
CodeUltraFeedback: aligning large language models to coding preferences (TOSEM 2025)
☆76Jun 25, 2024Updated 2 years ago
loubnabnl / santacoder-finetuning
View on GitHub
Fine-tune SantaCoder for Code/Text Generation.
☆196Apr 11, 2023Updated 3 years ago
bigcode-project / astraios
View on GitHub
Astraios: Parameter-Efficient Instruction Tuning Code Language Models
☆63Apr 10, 2024Updated 2 years ago
hendrycks / apps
View on GitHub
APPS: Automated Programming Progress Standard (NeurIPS 2021)
☆534Jun 19, 2024Updated 2 years ago
iSEngLab / LLM4UT_Empirical
View on GitHub
[ISSTA 2025] A Large-scale Empirical Study on Fine-tuning Large Language Models for Unit Testing
☆13Feb 9, 2025Updated last year
awsm-research / pytester
View on GitHub
☆13Nov 20, 2024Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
VITA-Group / ChainCoder
View on GitHub
[ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …
☆43Nov 9, 2023Updated 2 years ago
LAMDA-NeSy / Self-Backtracking
View on GitHub
☆52Feb 12, 2025Updated last year
bigcode-project / selfcodealign
View on GitHub
[NeurIPS'24] SelfCodeAlign: Self-Alignment for Code Generation
☆323Feb 24, 2025Updated last year
nyu-mll / ILF-for-code-generation
View on GitHub
☆81Mar 24, 2025Updated last year
NL2Code / CodeM
View on GitHub
☆44Jun 2, 2024Updated 2 years ago
hanningzhang / ER-PRM
View on GitHub
☆20Dec 14, 2024Updated last year
CEC-Agent / CEC
View on GitHub
Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"
☆32Oct 12, 2023Updated 2 years ago
shunzh / Code-AI-Tree-Search
View on GitHub
☆118Jul 17, 2024Updated 2 years ago
awsm-research / VulRepair
View on GitHub
VulRepair: A T5-Based Automated Software Vulnerability Repair
☆84May 13, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
zarakiquemparte / zaraki-tools
View on GitHub
☆28Aug 30, 2023Updated 2 years ago
microsoft / CodeT
View on GitHub
☆678Nov 1, 2024Updated last year
bigcode-project / bigcode-evaluation-harness
View on GitHub
A framework for the evaluation of autoregressive code generation language models.
☆1,052Jul 22, 2025Updated 11 months ago
ArmelRandy / Self-instruct
View on GitHub
A repository to perform self-instruct with a model on HF Hub
☆32Sep 29, 2023Updated 2 years ago
RUCAIBox / SWE-World
View on GitHub
☆49Mar 6, 2026Updated 4 months ago
scottlogic-alex / prm800k-denorm
View on GitHub
Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format
☆27Jul 12, 2023Updated 3 years ago
IBM / SALMON
View on GitHub
Self-Alignment with Principle-Following Reward Models
☆170Sep 18, 2025Updated 10 months ago
bprabhakar / upside-down-reinforcement-learning
View on GitHub
Pytorch based implementation of Upside Down Reinforcement Learning (UDRL) by J. Schmidhuber et al.
☆12May 1, 2020Updated 6 years ago
shuyanzhou / docprompting
View on GitHub
Data and code for "DocPrompting: Generating Code by Retrieving the Docs" @ICLR 2023
☆253Dec 15, 2023Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
luyug / MORES
View on GitHub
☆10Apr 16, 2021Updated 5 years ago
callummcdougall / sae-exercises-mats
View on GitHub
☆26Dec 20, 2023Updated 2 years ago
niansong1996 / lever
View on GitHub
Code for paper "LEVER: Learning to Verifiy Language-to-Code Generation with Execution" (ICML'23)
☆90Jul 5, 2023Updated 3 years ago
yeahrmek / pylean
View on GitHub
Python wrapper for lean-gym
☆13Apr 5, 2023Updated 3 years ago
RaoNikitha / CAT-LM
View on GitHub
☆15Feb 28, 2024Updated 2 years ago
sahil280114 / codealpaca
View on GitHub
☆1,513May 12, 2023Updated 3 years ago
microsoft / toga
View on GitHub
☆33Jun 12, 2023Updated 3 years ago