Yuancheng-Xu/GenARM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Yuancheng-Xu/GenARM)

Yuancheng-Xu / GenARM

Code for ICLR 2025 Paper "GenARM: Reward Guided Generation with Autoregressive Reward Model for Test-time Alignment"

☆24

Alternatives and similar repositories for GenARM

Users that are interested in GenARM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

INK-USC / hypter
View on GitHub
Zero-shot Learning by Generating Task-specific Adapters
☆14Apr 2, 2021Updated 5 years ago
deeplearning-wisc / args
View on GitHub
☆47Feb 8, 2024Updated 2 years ago
SteveKGYang / MetaAligner
View on GitHub
Models, data, and codes for the paper: MetaAligner: Towards Generalizable Multi-Objective Alignment of Language Models
☆24Sep 26, 2024Updated last year
ZHZisZZ / weak-to-strong-search
View on GitHub
[NeurIPS'24] Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Models
☆67Dec 10, 2024Updated last year
dong-river / Personalized-Judge
View on GitHub
☆10Jun 15, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
F2-Song / Weak-to-Strong-Decoding
View on GitHub
The official implementation of "Well Begun is Half Done: Low-resource Preference Alignment by Weak-to-Strong Decoding"
☆22Jun 26, 2025Updated last year
zjunlp / ModelKinship
View on GitHub
Exploring Model Kinship for Merging Large Language Models
☆28Apr 16, 2025Updated last year
umd-huang-lab / WAVES
View on GitHub
Code for our paper "Benchmarking the Robustness of Image Watermarks"
☆105Sep 15, 2024Updated last year
emory-irlab / DUQGen
View on GitHub
Effective Unsupervised Domain Adaptation of Neural Rankers by Diversifying Synthetic Query Generation
☆16Apr 23, 2025Updated last year
keven980716 / weak-to-strong-deception
View on GitHub
[ICLR 2025] Code&Data for the paper "Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization"
☆15Jun 21, 2024Updated 2 years ago
rotaryhammer / code-autodan
View on GitHub
An unofficial implementation of AutoDAN attack on LLMs (arXiv:2310.15140)
☆46Feb 8, 2024Updated 2 years ago
jczhang02 / MUSIC_dataset_script
View on GitHub
This repo contains script to download MUSIC dataset from youtube
☆12Jan 19, 2024Updated 2 years ago
ZIB-IOL / SMS
View on GitHub
Code to reproduce the experiments of the ICLR24-paper: "Sparse Model Soups: A Recipe for Improved Pruning via Model Averaging"
☆12Oct 14, 2025Updated 9 months ago
tlc4418 / llm_optimization
View on GitHub
A repo for RLHF training and BoN over LLMs, with support for reward model ensembles.
☆49Jan 16, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Ken-Chy129 / student-course-choosing
View on GitHub
基于 Spring Boot + Redis + RabbitMQ 的高并发学生选课系统，支持选退课、课程管理、实时消息通知
☆11Mar 31, 2026Updated 3 months ago
okarthikb / DPO
View on GitHub
Implementation of Direct Preference Optimization
☆17Jul 17, 2023Updated 3 years ago
Gary-code / KECVQG
View on GitHub
[ACM MM 2023] The released code of paper "Deconfounded Visual Question Generation with Causal Inference"
☆10Sep 3, 2024Updated last year
Zhang-Yihao / Adversarial-Representation-Engineering
View on GitHub
Official implementation repository for the paper Towards General Conceptual Model Editing via Adversarial Representation Engineering.
☆20Dec 6, 2024Updated last year
peterljq / Parsimonious-Concept-Engineering
View on GitHub
PaCE: Parsimonious Concept Engineering for Large Language Models (NeurIPS 2024)
☆43Jan 18, 2026Updated 6 months ago
w-yibo / R1-Compress
View on GitHub
[NeurIPS 2025@FoRLM] R1-Compress: Long Chain-of-Thought Compression via Chunk Compression and Search
☆17Jan 24, 2026Updated 6 months ago
haoweiz23 / DistDiff
View on GitHub
[NeurIPS 2024] The official repository of "Distribution-Aware Data Expansion with Diffusion Models".
☆17Dec 15, 2025Updated 7 months ago
zwhong714 / weak-to-strong-preference-optimization
View on GitHub
[ICLR 2025 Spotlight] Weak-to-strong preference optimization: stealing reward from weak aligned model
☆18Feb 24, 2025Updated last year
ydc123 / MMP-Attack
View on GitHub
Official repository for "On the Multi-modal Vulnerability of Diffusion Models"
☆17Jul 15, 2024Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
Vekteur / probabilistic-calibration-study
View on GitHub
Implementation of "A Large-Scale Study of Probabilistic Calibration in Neural Network Regression" (ICML 2023)
☆11Oct 7, 2025Updated 9 months ago
ParrotClever / DL_point
View on GitHub
☆12Jan 9, 2025Updated last year
tmlr-group / AR-Bench
View on GitHub
[ICML 2025] "From Passive to Active Reasoning: Can Large Language Models Ask the Right Questions under Incomplete Information?"
☆47Oct 8, 2025Updated 9 months ago
1229095296 / ResRL
View on GitHub
This repository includes code for our paper: ResRL: Boosting LLM Reasoning via Negative Sample Projection Residual Reinforcement Learning…
☆15May 2, 2026Updated 2 months ago
dmhyun / MSRP
View on GitHub
Official repository of Generating Multiple-Length Summaries via Reinforcement Learning for Unsupervised Sentence Summarization [EMNLP'22 …
☆10May 20, 2023Updated 3 years ago
weiiguo / Wireless-Agent
View on GitHub
☆13May 6, 2025Updated last year
brightjade / PRiSM
View on GitHub
Source code for paper "PRiSM: Enhancing Low-Resource Document-Level Relation Extraction with Relation-Aware Score Calibration", Findings …
☆11Jun 20, 2025Updated last year
ASTRAL-Group / ASTRA
View on GitHub
[CVPR 2025] Official implementation for "Steering Away from Harm: An Adaptive Approach to Defending Vision Language Model Against Jailbre…
☆62Jul 5, 2025Updated last year
QizhouWang / MAIL
View on GitHub
source code for NeurIPS21 paper robabilistic Margins for Instance Reweighting in Adversarial Training
☆11Apr 28, 2022Updated 4 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
Asap7772 / fewshot-preference-optimization
View on GitHub
Few-Shot Preference Optimization (FSPO) personalizes LLMs by reframing reward modeling as a meta-learning problem, enabling rapid adaptat…
☆16Feb 27, 2025Updated last year
Shark-NLP / self-adaptive-ICL
View on GitHub
self-adaptive in-context learning
☆45May 5, 2023Updated 3 years ago
zaixizhang / FoldMark
View on GitHub
Implementation of FoldMark: Safeguarding Protein Structure Generative Models with Distributional and Evolutionary Watermarking
☆23Apr 19, 2026Updated 3 months ago
IandRover / MAML_noisy_contrasive_learner
View on GitHub
☆17Oct 7, 2022Updated 3 years ago
zyxxmu / DSnoT
View on GitHub
Official Pytorch Implementation of Our Paper Accepted at ICLR 2024-- Dynamic Sparse No Training: Training-Free Fine-tuning for Sparse LLM…
☆51Apr 9, 2024Updated 2 years ago
nbelle1 / strategy-game-agents
View on GitHub
☆16Oct 23, 2025Updated 9 months ago
KaiyuanZh / CENSOR
View on GitHub
[NDSS 2025] CENSOR: Defense Against Gradient Inversion via Orthogonal Subspace Bayesian Sampling
☆19Jan 18, 2025Updated last year