RUCAIBox/RLMEC

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/RUCAIBox/RLMEC)

RUCAIBox / RLMEC

The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"

☆39

Alternatives and similar repositories for RLMEC

Users that are interested in RLMEC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

RUCAIBox / ChatCoT
View on GitHub
The official repository of "ChatCoT: Tool-Augmented Chain-of-Thought Reasoning on Chat-based Large Language Models"
☆47Jun 2, 2023Updated 3 years ago
yale-nlp / refdpo
View on GitHub
☆16Jul 23, 2024Updated 2 years ago
isle-dev / MetricEval
View on GitHub
MetricEval: A framework that conceptualizes and operationalizes four main components of metric evaluation, in terms of reliability and va…
☆12Nov 6, 2023Updated 2 years ago
swtheing / PF-PPO-RLHF
View on GitHub
☆34Sep 14, 2024Updated last year
wutaiqiang / LLM_KD_AKL
View on GitHub
☆22Oct 22, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
CodeGuardPlus / CodeGuardPlus
View on GitHub
CodeGuard+: Constrained Decoding for Secure Code Generation
☆22Jul 30, 2024Updated last year
kyegomez / SelfExtend
View on GitHub
Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta
☆13Nov 11, 2024Updated last year
GAIR-NLP / BeHonest
View on GitHub
BeHonest: Benchmarking Honesty in Large Language Models
☆35Aug 15, 2024Updated last year
BinWang28 / FacEval
View on GitHub
EMNLP 2022: Analyzing and Evaluating Faithfulness in Dialogue Summarization
☆13Mar 20, 2025Updated last year
Shentao-YANG / Preference_Grounded_Guidance
View on GitHub
Source codes for "Preference-grounded Token-level Guidance for Language Model Fine-tuning" (NeurIPS 2023).
☆17Jan 8, 2025Updated last year
XueyangFeng / ReHAC
View on GitHub
Repo of "Large Language Model-based Human-Agent Collaboration for Complex Task Solving(EMNLP2024 Findings)"
☆34Sep 20, 2024Updated last year
yaof20 / ReaL
View on GitHub
Implementation and datasets for "Training Language Models to Generate Quality Code with Program Analysis Feedback"
☆42Jul 21, 2025Updated last year
zlxxlz1026 / CSHI
View on GitHub
☆14Jun 18, 2024Updated 2 years ago
nasosger / MuToR
View on GitHub
[NeurIPS '25] Multi-Token Prediction Needs Registers
☆30Dec 14, 2025Updated 7 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
Cohere-Labs-Community / goodtriever
View on GitHub
Code for "Goodtriever: Toxicity Mitigation with Retrieval-augmented Language Models"
☆25May 30, 2024Updated 2 years ago
tml-epfl / icl-alignment
View on GitHub
Is In-Context Learning Sufficient for Instruction Following in LLMs? [ICLR 2025]
☆33Jan 23, 2025Updated last year
AxelSorensenDev / Eevee
View on GitHub
An Easy Annotation Tool for Natural Language Processing
☆12May 17, 2024Updated 2 years ago
limenlp / safer-instruct
View on GitHub
This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"
☆17Feb 22, 2024Updated 2 years ago
sauc-abadal / ALT
View on GitHub
Official repository for ALT (ALignment with Textual feedback).
☆10Jul 25, 2024Updated last year
IBM / SALMON
View on GitHub
Self-Alignment with Principle-Following Reward Models
☆170Sep 18, 2025Updated 10 months ago
HypherX / Evolution-Analysis
View on GitHub
☆25Dec 13, 2024Updated last year
ServiceNow / promptmix-emnlp-2023
View on GitHub
Offical code repository for PromptMix: A Class Boundary Augmentation Method for Large Language Model Distillation, EMNLP 2023
☆12Dec 13, 2023Updated 2 years ago
zhongwanjun / ProQA
View on GitHub
The code for paper "ProQA: Structural Prompt-based Pre-training for Unified Question Answering"
☆11Feb 7, 2023Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
shunzh / mcts-for-llm
View on GitHub
This is a pip package implementing Reinforcement Learning algorithms in non-stationary environments supported by the OpenAI Gym toolkit.
☆16Jun 28, 2024Updated 2 years ago
waterhorse1 / Natural-language-RL
View on GitHub
Natural Language Reinforcement Learning
☆101Jul 30, 2025Updated 11 months ago
hbin0701 / Self-Explore
View on GitHub
[𝐄𝐌𝐍𝐋𝐏 𝐅𝐢𝐧𝐝𝐢𝐧𝐠𝐬 𝟐𝟎𝟐𝟒 & 𝐀𝐂𝐋 𝟐𝟎𝟐𝟒 𝐍𝐋𝐑𝐒𝐄 𝐎𝐫𝐚𝐥] 𝘌𝘯𝘩𝘢𝘯𝘤𝘪𝘯𝘨 𝘔𝘢𝘵𝘩𝘦𝘮𝘢𝘵𝘪𝘤𝘢𝘭 𝘙𝘦𝘢𝘴𝘰𝘯𝘪𝘯…
☆52May 4, 2024Updated 2 years ago
GAIR-NLP / ReAlign
View on GitHub
Reformatted Alignment
☆111Sep 23, 2024Updated last year
jczhang02 / MUSIC_dataset_script
View on GitHub
This repo contains script to download MUSIC dataset from youtube
☆12Jan 19, 2024Updated 2 years ago
kobayashikanna01 / Chain-of-Discussion
View on GitHub
☆11May 28, 2024Updated 2 years ago
chtmp223 / suri
View on GitHub
Suri: Multi-constraint instruction following for long-form text generation [EMNLP’24]
☆27Oct 3, 2025Updated 9 months ago
liutianlin0121 / decoding-time-realignment
View on GitHub
Implementation of "Decoding-time Realignment of Language Models", ICML 2024.
☆21Jun 17, 2024Updated 2 years ago
feiyang-k / AutoScale
View on GitHub
Official Code Repository for [AutoScale📈: Scale-Aware Data Mixing for Pre-Training LLMs] Published as a conference paper at **COLM 2025*…
☆14Aug 8, 2025Updated 11 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
zankner / CLoud
View on GitHub
Critique-out-Loud Reward Models
☆76Oct 18, 2024Updated last year
weizhepei / ReadingList
View on GitHub
A list of research resources that I've appreciated.
☆12Dec 10, 2019Updated 6 years ago
chosendai / ChineseBLUE
View on GitHub
Chinese Biomedical Language Understanding Evaluation benchmark (ChineseBLUE)
☆13Dec 23, 2019Updated 6 years ago
shenao-zhang / reward-augmented-preference
View on GitHub
The official implementation of Preference Data Reward-Augmentation.
☆18May 1, 2025Updated last year
Vance0124 / Token-level-Direct-Preference-Optimization
View on GitHub
Reference implementation for Token-level Direct Preference Optimization(TDPO)
☆156Feb 14, 2025Updated last year
yixinL7 / SumLLM
View on GitHub
Repo for "On Learning to Summarize with Large Language Models as References"
☆44May 24, 2023Updated 3 years ago
yxuansu / Contrastive_Search_versus_Contrastive_Decoding
View on GitHub
An Empirical Study On Contrastive Search And Contrastive Decoding For Open-ended Text Generation
☆27Jun 7, 2024Updated 2 years ago