ARiSE-Lab / CYCLE_OOPSLA_24Links
Open-source repository for the OOPSLA'24 paper "CYCLE: Learning to Self-Refine Code Generation"
☆10Updated last year
Alternatives and similar repositories for CYCLE_OOPSLA_24
Users that are interested in CYCLE_OOPSLA_24 are comparing it to the libraries listed below
Sorting:
- StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback☆68Updated last year
- Astraios: Parameter-Efficient Instruction Tuning Code Language Models☆62Updated last year
- Training and Benchmarking LLMs for Code Preference.☆36Updated 10 months ago
- ☆53Updated last year
- Official code implementation for the ACL 2025 paper: 'Dynamic Scaling of Unit Tests for Code Reward Modeling'☆25Updated 4 months ago
- ☆28Updated 11 months ago
- ☆35Updated 2 years ago
- ☆28Updated 3 weeks ago
- Reinforcement Learning for Repository-Level Code Completion☆40Updated last year
- Codebase for Instruction Following without Instruction Tuning☆35Updated last year
- CodeUltraFeedback: aligning large language models to coding preferences (TOSEM 2025)☆72Updated last year
- NaturalCodeBench (Findings of ACL 2024)☆67Updated 11 months ago
- ☆39Updated 3 months ago
- XFT: Unlocking the Power of Code Instruction Tuning by Simply Merging Upcycled Mixture-of-Experts☆35Updated last year
- [KDD24-ADS] R-Eval: A Unified Toolkit for Evaluating Domain Knowledge of Retrieval Augmented Large Language Models☆12Updated last year
- ☆22Updated last year
- [NeurIPS 2024] Evaluation harness for SWT-Bench, a benchmark for evaluating LLM repository-level test-generation☆56Updated last week
- ☆17Updated last month
- Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF☆22Updated 11 months ago
- Measuring and Controlling Persona Drift in Language Model Dialogs☆17Updated last year
- ☆71Updated last year
- InstructCoder: Instruction Tuning Large Language Models for Code Editing | Oral ACL-2024 srw☆62Updated 11 months ago
- This is the official project of paper: Compress to Impress: Unleashing the Potential of Compressive Memory in Real-World Long-Term Conver…☆21Updated 10 months ago
- The official repo for "AceCoder: Acing Coder RL via Automated Test-Case Synthesis" [ACL25]☆88Updated 5 months ago
- A dataset of LLM-generated chain-of-thought steps annotated with mistake location.☆81Updated last year
- Dialogue Action Tokens: Steering Language Models in Goal-Directed Dialogue with a Multi-Turn Planner☆28Updated last year
- [EACL 2024] ICE-Score: Instructing Large Language Models to Evaluate Code☆79Updated last year
- ☆33Updated 3 months ago
- ☆20Updated 5 months ago
- Syntax Error-Free and Generalizable Tool Use for LLMs via Finite-State Decoding☆27Updated last year