yaof20 / ReaLLinks
Implementation and datasets for "Training Language Models to Generate Quality Code with Program Analysis Feedback"
☆20Updated last month
Alternatives and similar repositories for ReaL
Users that are interested in ReaL are comparing it to the libraries listed below
Sorting:
- StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback☆67Updated 10 months ago
- Training and Benchmarking LLMs for Code Preference.☆33Updated 8 months ago
- CodeUltraFeedback: aligning large language models to coding preferences☆71Updated last year
- ☆28Updated last week
- [EMNLP'23] Execution-Based Evaluation for Open Domain Code Generation☆48Updated last year
- ☆27Updated 6 months ago
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆33Updated 4 months ago
- ☆55Updated 3 weeks ago
- ☆20Updated last year
- Using FlexAttention to compute attention with different masking patterns☆44Updated 9 months ago
- ☆34Updated 3 weeks ago
- Code for EMNLP'24 paper - On Diversified Preferences of Large Language Model Alignment☆16Updated 11 months ago
- InstructCoder: Instruction Tuning Large Language Models for Code Editing | Oral ACL-2024 srw☆61Updated 9 months ago
- Aioli: A unified optimization framework for language model data mixing☆27Updated 6 months ago
- Astraios: Parameter-Efficient Instruction Tuning Code Language Models☆58Updated last year
- ☆23Updated 3 months ago
- ☆41Updated last year
- ☆49Updated last year
- ☆36Updated 2 months ago
- Learning from preferences is a common paradigm for fine-tuning language models. Yet, many algorithmic design decisions come into play. Ou…☆29Updated last year
- Kinetics: Rethinking Test-Time Scaling Laws☆65Updated last week
- SCoRe: Training Language Models to Self-Correct via Reinforcement Learning☆10Updated 5 months ago
- [ICLR 2025] "Training LMs on Synthetic Edit Sequences Improves Code Synthesis" (Piterbarg, Pinto, Fergus)☆19Updated 5 months ago
- Codebase for Context-aware Meta-learned Loss Scaling (CaMeLS). https://arxiv.org/abs/2305.15076.☆25Updated last year
- The repository contains code for Adaptive Data Optimization☆25Updated 7 months ago
- Efficient Scaling laws and collaborative pretraining.☆16Updated 5 months ago
- ☆27Updated 2 years ago
- Official code repo for paper "Great Memory, Shallow Reasoning: Limits of kNN-LMs"☆23Updated 2 months ago
- ☆27Updated 5 months ago
- Concise Reasoning via Reinforcement Learning☆13Updated 3 months ago