sanowl/OmegaPRM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sanowl/OmegaPRM)

sanowl / OmegaPRM

this is an implementation for the paper Improve Mathematical Reasoning in Language Models by Automated Process Supervision from google deepmind

☆50

Alternatives and similar repositories for OmegaPRM

Users that are interested in OmegaPRM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

RUCAIBox / FIGA
View on GitHub
[ICLR 2024] This is the official implementation for the paper: "Beyond imitation: Leveraging fine-grained quality signals for alignment"
☆10May 5, 2024Updated 2 years ago
CRIPAC-DIG / tgm-dlm
View on GitHub
Code for AAAI24 paper Text-Guided Molecule Generation with Diffusion Language Model
☆33Jun 24, 2025Updated last year
jins7 / LatentEvolve
View on GitHub
☆27Oct 9, 2025Updated 9 months ago
CSSLab / ThinkTwice
View on GitHub
Jointly Optimizing Large Language Models for Reasoning and Self-Refinement
☆15Apr 22, 2026Updated 3 months ago
deepglint / Victor
View on GitHub
ViCToR: Improving Visual Comprehension via Token Reconstruction for Pretraining LMMs
☆29Aug 15, 2025Updated 11 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
NJUNLP / AdaR
View on GitHub
☆15Dec 8, 2025Updated 7 months ago
lqtrung1998 / mwp_cot_design
View on GitHub
☆14Oct 11, 2023Updated 2 years ago
dali-does / clevr-math
View on GitHub
☆13May 9, 2023Updated 3 years ago
MAGAer13 / DeCapBench
View on GitHub
Official Code for "Painting with Words: Elevating Detailed Image Captioning with Benchmark and Alignment Learning" (ICLR 2025)
☆14Mar 6, 2025Updated last year
SALT-NLP / PersuationGames
View on GitHub
[ACL2023, Findings] Source codes for the paper "Werewolf Among Us: Multimodal Resources for Modeling Persuasion Behaviors in Social Deduc…
☆16Feb 22, 2025Updated last year
JingMog / THOR
View on GitHub
[ICLR-2026] Official Implementation of our paper "THOR: Tool-Integrated Hierarchical Optimization via RL for Mathematical Reasoning".
☆33Feb 26, 2026Updated 5 months ago
ahnobari / ActivationInformedMerging
View on GitHub
Official repository for Activation-Informed Merging (AIM) of Large Language Models
☆24Feb 10, 2025Updated last year
hbin0701 / Self-Explore
View on GitHub
[𝐄𝐌𝐍𝐋𝐏 𝐅𝐢𝐧𝐝𝐢𝐧𝐠𝐬 𝟐𝟎𝟐𝟒 & 𝐀𝐂𝐋 𝟐𝟎𝟐𝟒 𝐍𝐋𝐑𝐒𝐄 𝐎𝐫𝐚𝐥] 𝘌𝘯𝘩𝘢𝘯𝘤𝘪𝘯𝘨 𝘔𝘢𝘵𝘩𝘦𝘮𝘢𝘵𝘪𝘤𝘢𝘭 𝘙𝘦𝘢𝘴𝘰𝘯𝘪𝘯…
☆52May 4, 2024Updated 2 years ago
FreedomIntelligence / OVM
View on GitHub
☆74Apr 2, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
google-deepmind / streamingqa
View on GitHub
☆51Oct 10, 2023Updated 2 years ago
daje0601 / Google_SCoRe
View on GitHub
Paper Reproduction Google SCoRE(Training Language Models to Self-Correct via Reinforcement Learning)
☆141Sep 21, 2024Updated last year
Deno-V / tgm-dlm
View on GitHub
Code for AAAI24 paper Text-Guided Molecule Generation with Diffusion Language Model
☆21Jun 24, 2025Updated last year
YuxiXie / MCTS-DPO
View on GitHub
This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.
☆331Jan 29, 2026Updated 5 months ago
SparkJiao / dpo-trajectory-reasoning
View on GitHub
[EMNLP 2024] Source code for the paper "Learning Planning-based Reasoning with Trajectory Collection and Process Rewards Synthesizing".
☆84Jan 14, 2025Updated last year
open-compass / CIBench
View on GitHub
Official Repo of "CIBench: Evaluation of LLMs as Code Interpreter "
☆15Jul 19, 2024Updated 2 years ago
tdietert / lambda-pi
View on GitHub
A toy implementation of the dependently typed lambda calculus known as λΠ
☆12Jan 29, 2020Updated 6 years ago
purbeshmitra / MOTIF
View on GitHub
MOTIF: Modular Thinking via Reinforcement Fine-tuning in LLMs
☆17Jul 6, 2025Updated last year
xiusic / MinPrompt
View on GitHub
MinPrompt: Graph-based Minimal Prompt Data Augmentation for Few-shot Question Answering
☆14May 3, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
tianyi-lab / R2-T2
View on GitHub
[ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"
☆19Mar 10, 2025Updated last year
TencentARC / Plot2Code
View on GitHub
☆23Aug 17, 2024Updated last year
Jiaxin-Pei / Prompting-with-Social-Roles
View on GitHub
☆50Oct 14, 2024Updated last year
morning9393 / ETPO
View on GitHub
☆14Mar 5, 2024Updated 2 years ago
Hui-design / R1-Video-fixbug
View on GitHub
[Blog 1] Recording a bug of grpo_trainer in some R1 projects
☆23Feb 23, 2025Updated last year
PeterDWhite / Osker
View on GitHub
Sharing work on resumption monad
☆12Sep 18, 2012Updated 13 years ago
ku-fpg / hood
View on GitHub
Hood debugger, based on the idea of observing functions and structures as they are evaluated.
☆20Jun 3, 2018Updated 8 years ago
weitongseu / PCL
View on GitHub
☆10Jul 11, 2022Updated 4 years ago
EricTan7 / TGP-T
View on GitHub
[AAAI2024] Official implementation of TGP-T
☆31Apr 1, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
THU-CVML / TextureDiffusion
View on GitHub
[ICASSP 2025 Oral] The official implementation of paper "TextureDiffusion: Target Prompt Disentangled Editing for Various Texture Transfe…
☆18Mar 13, 2025Updated last year
rgrishman / ice
View on GitHub
Ice is a rapid information extraction customizer
☆15Apr 26, 2021Updated 5 years ago
MasterVito / SwS
View on GitHub
Official Repo for SwS: A Weakness-driven Problem Synthesis Framework in RL for LLM Reasoning
☆42Nov 11, 2025Updated 8 months ago
aaronserianni / attention-iou
View on GitHub
[CVPR'25] Attention IoU: Examining Biases in CelebA using Attention Maps
☆13Mar 26, 2025Updated last year
aminglinux / centos7
View on GitHub
☆11Apr 29, 2020Updated 6 years ago
pm25 / Semi-Supervised-Regression
View on GitHub
[NeurIPS 2024] Official code for the paper 'RankUp: Boosting Semi-Supervised Regression with an Auxiliary Ranking Classifier'
☆14Aug 22, 2025Updated 11 months ago
yanqiangmiffy / tree2retriever
View on GitHub
Recursive Abstractive Processing for Tree-Organized Retrieval
☆10May 30, 2024Updated 2 years ago