sdiehl/prm

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sdiehl/prm)

sdiehl / prm

Library for training process reward models

☆29

Alternatives and similar repositories for prm

Users that are interested in prm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

benlipkin / decoding
View on GitHub
Composable inference algorithms with LLMs and programmable logic
☆69Dec 4, 2024Updated last year
OpenMatch / Gist-COCO
View on GitHub
This is the code repo for our paper "Say More with Less: Understanding Prompt Learning Behaviors through Gist Compression".
☆13Feb 27, 2024Updated 2 years ago
NEUIR / P-ALIGN
View on GitHub
[ACL '26] source code for the paper: "Long-Chain Reasoning Distillation via Adaptive Prefix Alignment"
☆16Jan 21, 2026Updated 6 months ago
OpenBMB / ConsJudge
View on GitHub
☆18Mar 23, 2025Updated last year
OpenBMB / RAG-DDR
View on GitHub
This is the code repo for the paper "RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rewards".
☆23Oct 28, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
OpenBMB / DEBATER
View on GitHub
This is the code repo for our paper "Learning More Effective Representations for Dense Retrieval through Deliberate Thinking Before Searc…
☆26Mar 2, 2025Updated last year
NEUIR / LISRec
View on GitHub
[KDD '26] This is the code repo for our KDD '26 paper "LISRec: Modeling User Preferences with Learned Item Shortcuts for Sequential Recom…
☆18Jul 20, 2026Updated last week
NEUIR / LegalDuet
View on GitHub
[ADMA 2025 Best Paper Award] Code repo for our ADMA'25 paper: LegalDuet: Learning Fine-grained Representations for Legal Judgment Predict…
☆17Feb 11, 2026Updated 5 months ago
sdiehl / tiny-r1
View on GitHub
Recreating the minimal training methods of DeepSeek-R1 for small langauge models.
☆22Feb 10, 2025Updated last year
NEUIR / Lang2Act
View on GitHub
[ACL '26] Lang2Act: Fine-Grained Visual Reasoning through Self-Emergent Linguistic Toolchains
☆25Apr 7, 2026Updated 3 months ago
JmlrOrg / jmlr-coverletter
View on GitHub
JMLR Cover Letter Template
☆10Dec 15, 2021Updated 4 years ago
smthemex / ComfyUI_CustomNet
View on GitHub
A CustomNet node for ComfyUI
☆10Aug 11, 2024Updated last year
NEUIR / LegalDelta
View on GitHub
[ICASSP '26] This is the code repo for our paper: LegalΔ: Enhancing Legal Reasoning in LLMs via Reinforcement Learning with Chain-of-Thou…
☆31Jul 1, 2026Updated 3 weeks ago
model-similarity / lm-similarity
View on GitHub
☆21Feb 10, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
OpenBMB / MetaMem
View on GitHub
[ACL '26] This is the code repo for our ACL '26 Findings paper "MetaMem: Evolving Meta-Memory for Knowledge Utilization through Self-Refl…
☆40Jul 2, 2026Updated 3 weeks ago
Maras13 / git_playground
View on GitHub
A hands-on repository for learning GitHub basics! Dive into beginner-friendly exercises that guide you through creating repositories, mak…
☆15Dec 3, 2024Updated last year
mukhal / ThinkPRM
View on GitHub
[TMLR] Process Reward Models That Think
☆90Nov 29, 2025Updated 8 months ago
yale-nlp / QTSumm
View on GitHub
Data and Code for EMNLP 2023 paper "QTSumm: Query-Focused Summarization over Tabular Data"
☆23Mar 29, 2024Updated 2 years ago
deep-spin / non-exchangeable-crc
View on GitHub
☆11Sep 25, 2025Updated 10 months ago
NEUIR / Uncode
View on GitHub
[ACL '26] Source code for paper "Empirical Analysis of Decoding Biases in Masked Diffusion Models"
☆45Jun 26, 2026Updated last month
km1994 / nlp_paper_study_search_engine
View on GitHub
该仓库主要记录 NLP 算法工程师相关的搜索引擎学习笔记
☆14Apr 9, 2022Updated 4 years ago
viking-sudo-rm / rusty-dawg
View on GitHub
Rust library for indexing and quickly searching large pretraining corpora
☆31Oct 30, 2025Updated 8 months ago
lucassilveira96 / silveirinha
View on GitHub
☆11Nov 23, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
tianyi-lab / Moltbook_Socialization
View on GitHub
Does Socialization Emerge in AI Agent Society? A Case Study of Moltbook
☆18Feb 17, 2026Updated 5 months ago
ssmisya / PRMBench
View on GitHub
[ACL' 25] The official code repository for PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models.
☆94Feb 15, 2025Updated last year
fywalter / label-bias
View on GitHub
A codebase for ACL 2023 paper: Mitigating Label Biases for In-context Learning
☆10Aug 4, 2023Updated 2 years ago
ml-stat-Sustech / conformal_prediction_via_label_ranking
View on GitHub
[ICML'24] Conformal Prediction for Deep Classifier via Label Ranking
☆14Jun 14, 2024Updated 2 years ago
morning9393 / ETPO
View on GitHub
☆14Mar 5, 2024Updated 2 years ago
SafeRoboticsLab / Who_Plays_First
View on GitHub
Repository for "Who Plays First? Optimizing the Order of Play in Stackelberg Games with Many Robots" - RSS 2024
☆18Jun 25, 2024Updated 2 years ago
2003pro / TAGCOS
View on GitHub
This is the official implementation of TAGCOS: Task-agnostic Gradient Clustered Coreset Selection for Instruction Tuning Data
☆13Jul 21, 2024Updated 2 years ago
lingchen0331 / UQ_ICL
View on GitHub
Uncertainty quantification for in-context learning of large language models
☆15Apr 1, 2024Updated 2 years ago
AshOlogn / Paragraph-level-Simplification-of-Medical-Texts
View on GitHub
☆24Jan 17, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
OpenMatch / RAG-DDR
View on GitHub
[ICLR 2025] This is the code repo for our ICLR’25 paper "RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rew…
☆55Feb 10, 2025Updated last year
NEUIR / RankCoT
View on GitHub
[ACL '25] Source code for our paper ''RankCoT: Refining Knowledge for Retrieval-Augmented Generation through Ranking Chain-of-Thoughts''
☆53Nov 27, 2025Updated 8 months ago
shihux / sa_transformer
View on GitHub
Code for "Transformer-Based Deep Survival Analysis"
☆13May 27, 2022Updated 4 years ago
VanekPetr / flan-t5-text-classifier
View on GitHub
Fine-tuning of Flan-5T LLM for text classification 🤖 focuses on adapting a state-of-the-art language model to enhance its ability to cla…
☆44Oct 28, 2024Updated last year
intuit-ai-research / SPUQ
View on GitHub
SPUQ: Perturbation-Based Uncertainty Quantification for Large Language Models
☆17Jun 24, 2024Updated 2 years ago
princetonvisualai / icons
View on GitHub
☆22Apr 24, 2025Updated last year
WindyLee0822 / Process_Q_Model
View on GitHub
official implementation of paper "Process Reward Model with Q-value Rankings"
☆69Feb 5, 2025Updated last year