RUCKBReasoning/CodeRM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/RUCKBReasoning/CodeRM)

RUCKBReasoning / CodeRM

Official code implementation for the ACL 2025 paper: 'Dynamic Scaling of Unit Tests for Code Reward Modeling'

☆27

Alternatives and similar repositories for CodeRM

Users that are interested in CodeRM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

satori-reasoning / Satori-SWE
View on GitHub
☆21May 30, 2025Updated last year
RUCKBReasoning / CoT-based-Synthesizer
View on GitHub
Official code implementation for the ACL 2025 paper: 'CoT-based Synthesizer: Enhancing LLM Performance through Answer Synthesis'
☆32May 19, 2025Updated last year
psunlpgroup / FoVer
View on GitHub
This repository includes code and materials for the paper "Efficient PRM Training Data Synthesis via Formal Verification" (ACL 2026 Findi…
☆18Apr 7, 2026Updated 3 months ago
drarijitdas / Natural-GaLore
View on GitHub
An extention to the GaLore paper, to perform Natural Gradient Descent in low rank subspace
☆19Oct 21, 2024Updated last year
mlfoundations / tabliblib
View on GitHub
A Python library for processing and filtering TabLib
☆14Aug 24, 2024Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
IBM / ColPret
View on GitHub
Efficient Scaling laws and collaborative pretraining.
☆22Sep 18, 2025Updated 10 months ago
linkedin / ControlLLM
View on GitHub
Control LLM
☆23Apr 6, 2025Updated last year
zmzhang2000 / trustworthy-alignment
View on GitHub
Official repository for Trustworthy Alignment of Retrieval-Augmented Large Language Models via Reinforcement Learning
☆12Sep 2, 2024Updated last year
akhilkedia / TranformersGetStable
View on GitHub
[ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"
☆11Jul 19, 2024Updated 2 years ago
Qichuzyy / POA
View on GitHub
Official implementation of ECCV24 paper: POA
☆24Aug 8, 2024Updated last year
hangeol / UniR
View on GitHub
Official repo for paper: Universal Reasoner: A Single, Composable Plug-and-Play Reasoner for Frozen LLMs
☆20Nov 26, 2025Updated 7 months ago
CriticBench / CriticBench
View on GitHub
[ACL 2024 Findings] CriticBench: Benchmarking LLMs for Critique-Correct Reasoning
☆31Mar 5, 2024Updated 2 years ago
chujiezheng / LLM-Extrapolation
View on GitHub
Official repository for ACL 2025 paper "Model Extrapolation Expedites Alignment"
☆75May 20, 2025Updated last year
iSEngLab / LLM4UT_Empirical
View on GitHub
[ISSTA 2025] A Large-scale Empirical Study on Fine-tuning Large Language Models for Unit Testing
☆13Feb 9, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
XueruiSu / Trust-Region-Preference-Approximation
View on GitHub
Trust Region Preference Approximation: A simple and stable reinforcement learning algorithm for LLM reasoning
☆15Jun 28, 2025Updated last year
UKPLab / incorporating-relevance
View on GitHub
Code for Incorporating Relevance Feedback for Information-Seeking Retrieval using Few-Shot Document Re-Ranking, EMNLP 2022, https://aclan…
☆14Mar 30, 2026Updated 3 months ago
uservan / ThinkPO
View on GitHub
☆17Aug 1, 2025Updated 11 months ago
wtybest / EnMMDiT
View on GitHub
[TPAMI 2026] Enhancing MMDiT-Based Text-to-Image Models for Similar Subject Generation
☆15Mar 7, 2026Updated 4 months ago
MasterVito / DAC-RL
View on GitHub
Official Repo for DAC-RL: Training LLMs for Divide-and-Conquer Reasoning Elevates Test-Time Scalability
☆16Feb 26, 2026Updated 4 months ago
w-yibo / R1-Compress
View on GitHub
[NeurIPS 2025@FoRLM] R1-Compress: Long Chain-of-Thought Compression via Chunk Compression and Search
☆17Jan 24, 2026Updated 5 months ago
yongchao98 / R1-Code-Interpreter
View on GitHub
R1-Code-Interpreter: Training LLMs to Reason with Code via Supervised and Reinforcement Learning
☆44Feb 9, 2026Updated 5 months ago
tianyi-lab / DisCL
View on GitHub
[ICCV 2025] Diffusion Curriculum (DisCL)
☆18Sep 26, 2025Updated 9 months ago
shaharl6000 / MoreDocsSameLen
View on GitHub
This repository contains code and datasets for our paper on the effects of document multiplicity while the context size is fixed in Retri…
☆18Mar 13, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
EvanZhuang / AgenticLU
View on GitHub
Official implementation of Self-Taught Agentic Long Context Understanding (ACL 2025).
☆13Sep 22, 2025Updated 9 months ago
riccizz / HRF
View on GitHub
☆18May 13, 2025Updated last year
Anonymous1252022 / Megatron-DeepSpeed
View on GitHub
☆18Sep 22, 2024Updated last year
AI45Lab / DEAN
View on GitHub
☆11Oct 25, 2024Updated last year
mxzheng / TrojViT
View on GitHub
[CVPR 2023] "TrojViT: Trojan Insertion in Vision Transformers" by Mengxin Zheng, Qian Lou, Lei Jiang
☆15Jan 5, 2024Updated 2 years ago
tanganke / subspace_fusion
View on GitHub
Code for paper "Concrete Subspace Learning based Interference Elimination for Multi-task Model Fusion"
☆14Mar 28, 2024Updated 2 years ago
rezashkv / diffusion_pruning
View on GitHub
[ICLR 2025] Adaptive prompt tailored pruning of T2I diffusion models.
☆15Feb 1, 2025Updated last year
SongW-SW / CEB
View on GitHub
☆15Jun 25, 2025Updated last year
mtharrison / promptscaper
View on GitHub
A client-only OpenAI LLM Playground for prototyping agents without writing any code.
☆22Aug 31, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
tanganke / pareto_set_learning
View on GitHub
Code for paper "Towards Efficient Pareto Set Approximation via Weight-Ensembling Mixture of Experts"
☆11Sep 13, 2024Updated last year
iLearn-Lab / ACL25-PTQ1.61
View on GitHub
☆15Apr 6, 2026Updated 3 months ago
tongxuluo / LeaP
View on GitHub
Code, Data and Model for Paper "Learning from Peers in Reasoning Models"
☆26May 13, 2025Updated last year
Jiahao004 / DeepTheorem
View on GitHub
☆26Jun 10, 2025Updated last year
MLE-Dojo / MLE-Dojo
View on GitHub
☆99Oct 30, 2025Updated 8 months ago
yale-nlp / refdpo
View on GitHub
☆16Jul 23, 2024Updated last year
thunlp / Optima
View on GitHub
Code for paper "Optima: Optimizing Effectiveness and Efficiency for LLM-Based Multi-Agent System"
☆72Nov 14, 2024Updated last year