Freder-chen/ReasonGenRM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Freder-chen/ReasonGenRM)

Freder-chen / ReasonGenRM

A simple implementation of ReasonGenRM.

☆19

Alternatives and similar repositories for ReasonGenRM

Users that are interested in ReasonGenRM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

HarlynDN / WebCiteS
View on GitHub
[ACL'24] WebCiteS: Attributed Query-Focused Summarization on Chinese Web Search Results with Citations
☆13Sep 11, 2024Updated last year
liziniu / cold_start_rl
View on GitHub
Code for Blog Post: Can Better Cold-Start Strategies Improve RL Training for LLMs?
☆20Mar 9, 2025Updated last year
dayu11 / Availability-Attacks-Create-Shortcuts
View on GitHub
☆10Jul 28, 2022Updated 3 years ago
tpoisonooo / open-r1
View on GitHub
Fully open reproduction of DeepSeek-R1
☆11Mar 24, 2025Updated last year
TomSheng21 / AdaptGuard
View on GitHub
ICCV 2023 - AdaptGuard: Defending Against Universal Attacks for Model Adaptation
☆11Dec 23, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
junkangwu / alpha-DPO
View on GitHub
[ICML 2025] Official code of "AlphaDPO: Adaptive Reward Margin for Direct Preference Optimization"
☆31Jan 10, 2026Updated 6 months ago
Hanpx20 / SafeSwitch
View on GitHub
Official code repository for the paper "Internal Activation as the Polar Star for Steering Unsafe LLM Behavior"
☆15May 31, 2026Updated last month
dongjinhao-ruc / MarginGAN
View on GitHub
This repository is the replication package of the NeurIPS19 paper "MarginGAN: Adversarial Training in Semi-Supervised Learning"
☆12Oct 27, 2019Updated 6 years ago
zhliu0106 / learning-to-refuse
View on GitHub
Official Implementation of "Learning to Refuse: Towards Mitigating Privacy Risks in LLMs"
☆10Dec 13, 2024Updated last year
WisdomShell / RewardAnything
View on GitHub
RewardAnything: Generalizable Principle-Following Reward Models
☆44Jun 11, 2025Updated last year
thu-coai / LongSafety
View on GitHub
[ACL 2025] LongSafety: Evaluating Long-Context Safety of Large Language Models
☆16Jun 18, 2025Updated last year
namkoong-lab / PersonalLLM
View on GitHub
☆18Oct 8, 2024Updated last year
hcoxec / soft_h
View on GitHub
soft entropy estimation
☆16May 29, 2026Updated last month
sebbyjp / ros2_transformers
View on GitHub
Robotics transformers inference servers in ROS2. RT-1, RT-X, Octo.
☆17Oct 14, 2024Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
sooonwoo / CL-Baselines
View on GitHub
This is a Pytorch implementation of contrastive Learning(CL) baselines.
☆14Aug 29, 2022Updated 3 years ago
alvin-yang68 / Marching-Cubes
View on GitHub
Implementation of the Marching Cubes algorithm on Python.
☆11Dec 10, 2020Updated 5 years ago
liujch1998 / ppo-mcts
View on GitHub
☆21Nov 13, 2023Updated 2 years ago
mwoedlinger / ecsic
View on GitHub
Official code of our WACV paper "ECSIC: Epipolar Cross Attention for Stereo Image Compression"
☆15Dec 27, 2023Updated 2 years ago
genrm-star / genrm-critiques
View on GitHub
GenRM-CoT: Data release for verification rationales
☆68Oct 16, 2024Updated last year
thu-ml / Efficient-Diffusion-Alignment
View on GitHub
Official Codebase for "Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control" (NeurIPS 2024)
☆15Oct 29, 2024Updated last year
guanjiyang / SAC
View on GitHub
☆18Oct 7, 2022Updated 3 years ago
prnake / kimi-deepresearch
View on GitHub
Kimi K2 Thinking Agentic Search Unofficial Implementation
☆15Nov 9, 2025Updated 8 months ago
Cognition2Action-Lab / VLA-TMEE
View on GitHub
Reshaping Action Error Distributions for Reliable Vision-Language-Action Models
☆17Feb 5, 2026Updated 5 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
ernie-research / Tool-Augmented-Reward-Model
View on GitHub
[ICLR'24 spotlight] Tool-Augmented Reward Modeling
☆54Jun 6, 2025Updated last year
bpwu1 / confidence-regulation-neurons
View on GitHub
Confidence Regulation Neurons in Language Models (NeurIPS 2024)
☆15Feb 1, 2025Updated last year
RUC-GSAI / Llama-3-SynE
View on GitHub
Llama-3-SynE: A Significantly Enhanced Version of Llama-3 with Advanced Scientific Reasoning and Chinese Language Capabilities | 继续预训练提升 …
☆40May 31, 2025Updated last year
27182812 / ChineseBERT_paddle
View on GitHub
用Paddle复现论文ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin Information（ACL2021）
☆10Nov 15, 2021Updated 4 years ago
lavinal712 / control-lora-v3
View on GitHub
☆11Dec 15, 2025Updated 7 months ago
somuchtome / SimAC
View on GitHub
[CVPR 2024] official code for SimAC
☆21Jan 23, 2025Updated last year
WangYipu2002 / CrossPoint
View on GitHub
Official implementation of “Towards Cross-View Point Correspondence in Vision-Language Models”.
☆15Dec 24, 2025Updated 6 months ago
zeaver / MultiFactor
View on GitHub
Implementation of EMNLP 2023 Findings: Improving Question Generation with Multi-level Content Planning
☆17Nov 30, 2023Updated 2 years ago
DzvinkaYarish / ControlNet-different-backbones
View on GitHub
☆12Jun 15, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Alibaba-AAIG / ClawArmor
View on GitHub
Self-Evolving Defense for AI Agents — Protect against prompt injection, data exfiltration, and multi-stage attacks with adaptive security…
☆19Apr 8, 2026Updated 3 months ago
NuoJohnChen / JudgeLRM
View on GitHub
JudgeLRM: Large Reasoning Models as a Judge
☆42May 6, 2026Updated 2 months ago
waltonfuture / Diff-eRank
View on GitHub
[NeurIPS 2024] A Novel Rank-Based Metric for Evaluating Large Language Models
☆59May 28, 2025Updated last year
Jordan-HS / Diversity_is_Definitely_Needed
View on GitHub
[CVPRW 2023] Diversity is Definitely Needed: Improving Model-Agnostic Zero-shot Classification via Stable Diffusion
☆24Jan 24, 2024Updated 2 years ago
dunzeng / MORE
View on GitHub
Code for EMNLP'24 paper - On Diversified Preferences of Large Language Model Alignment
☆16Aug 6, 2024Updated last year
DoYangTan / verl-rubric
View on GitHub
☆29Jan 31, 2026Updated 5 months ago
Evanwu1125 / AutoWebWorld
View on GitHub
☆25Jul 10, 2026Updated last week