GXimingLu/IPA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/GXimingLu/IPA)

GXimingLu / IPA

Codebase for Inference-Time Policy Adapters

☆25

Alternatives and similar repositories for IPA

Users that are interested in IPA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

chentong0 / rl-binary-rar
View on GitHub
Official repo for "Binary Retrieval-augmented Reward Mitigates Hallucinations"
☆15Nov 13, 2025Updated 8 months ago
minbeomkim / CriticControl
View on GitHub
Official github repository of CriticControl
☆29Aug 6, 2023Updated 2 years ago
r-three / RAD
View on GitHub
Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model
☆45Oct 1, 2025Updated 9 months ago
stellalisy / alfa
View on GitHub
Repository for the paper: Aligning LLMs to Ask Good Questions A Case Study in Clinical Reasoning
☆18Feb 21, 2025Updated last year
kschweig / OfflineRL
View on GitHub
Experiment for Understanding the Effects of Dataset Characteristics on Offline Reinforcement Learning
☆26Jan 16, 2023Updated 3 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
allenai / hybrid-preferences
View on GitHub
Learning to route instances for Human vs AI Feedback (ACL Main '25)
☆29Jul 23, 2025Updated last year
naver-ai / ALMoST
View on GitHub
☆24Dec 2, 2023Updated 2 years ago
GXimingLu / Quark
View on GitHub
☆75Nov 3, 2023Updated 2 years ago
illidanlab / ABD
View on GitHub
[ICML2023] Revisiting Data-Free Knowledge Distillation with Poisoned Teachers
☆24Jul 7, 2024Updated 2 years ago
wwxu21 / CUT
View on GitHub
Source code of "Reasons to Reject? Aligning Language Models with Judgments"
☆58Feb 29, 2024Updated 2 years ago
jaehunjung1 / cascaded-selective-evaluation
View on GitHub
☆29Feb 24, 2025Updated last year
Sachin19 / mucoco
View on GitHub
Official Code for the papers: "Controlled Text Generation as Continuous Optimization with Multiple Constraints" and "Gradient-based Const…
☆67Mar 21, 2024Updated 2 years ago
epfl-dlab / invariant-language-models
View on GitHub
A framework to train language models to learn invariant representations.
☆14Jan 24, 2022Updated 4 years ago
wskbest / MFC-Bench
View on GitHub
☆12Oct 17, 2024Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
launchnlp / BOLT
View on GitHub
Code for ACL 2023 paper "BOLT: Fast Energy-based Controlled Text Generation with Tunable Biases".
☆22Sep 7, 2023Updated 2 years ago
pharaouk / dharma
View on GitHub
☆13Apr 25, 2024Updated 2 years ago
declare-lab / resta
View on GitHub
Restore safety in fine-tuned language models through task arithmetic
☆33Mar 28, 2024Updated 2 years ago
yangkevin2 / naacl-2021-fudge-controlled-generation
View on GitHub
☆102Aug 24, 2022Updated 3 years ago
allenai / MacGyver
View on GitHub
Code and Data for the NAACL 24 paper: MacGyver: Are Large Language Models Creative Problem Solvers?
☆30Mar 26, 2024Updated 2 years ago
SparkJiao / dpo-trajectory-reasoning
View on GitHub
[EMNLP 2024] Source code for the paper "Learning Planning-based Reasoning with Trajectory Collection and Process Rewards Synthesizing".
☆84Jan 14, 2025Updated last year
jbshp / LongDocFACTScore
View on GitHub
☆10May 28, 2024Updated 2 years ago
alisawuffles / DExperts
View on GitHub
code associated with ACL 2021 DExperts paper
☆119May 24, 2023Updated 3 years ago
morning9393 / ETPO
View on GitHub
☆14Mar 5, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
Joachm / neural_diversity_nets
View on GitHub
optimize neuro-centric parameters instead of weights to solve RL tasks
☆14Oct 2, 2023Updated 2 years ago
raunak96 / google-docs
View on GitHub
A google doc clone using NextJs, TailwindCss, DraftJs based Rich Text Editor with styles even applied for printing the doc.
☆10Jul 20, 2021Updated 5 years ago
huiwy / reflection-on-trees
View on GitHub
☆14May 9, 2024Updated 2 years ago
HCY123902 / atg-w-fg-rw
View on GitHub
☆10May 27, 2024Updated 2 years ago
HarlynDN / WebCiteS
View on GitHub
[ACL'24] WebCiteS: Attributed Query-Focused Summarization on Chinese Web Search Results with Citations
☆13Sep 11, 2024Updated last year
PKU-Alignment / aligner
View on GitHub
[NeurIPS 2024 Oral] Aligner: Efficient Alignment by Learning to Correct
☆194Jan 16, 2025Updated last year
keanekwa / MathUwU
View on GitHub
Gamified training platform for quantitative finance interviews. Full stack application developed with JavaScript (TypeScript & Next.js) a…
☆12Dec 19, 2024Updated last year
azinmatin / prince
View on GitHub
☆11Mar 25, 2022Updated 4 years ago
shizhediao / Black-Box-Prompt-Learning
View on GitHub
Source code for the TMLR paper "Black-Box Prompt Learning for Pre-trained Language Models"
☆59Sep 7, 2023Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
mireshghallah / neighborhood-curvature-mia
View on GitHub
☆27Aug 18, 2023Updated 2 years ago
janphilippfranken / sami
View on GitHub
Self-Supervised Alignment with Mutual Information
☆20May 24, 2024Updated 2 years ago
Miaoranmmm / SelfChecker
View on GitHub
codes for "Self-Checker: Plug-and-Play Modules for Fact-Checking with Large Language Models"
☆12Feb 10, 2025Updated last year
facebookresearch / motif
View on GitHub
Intrinsic Motivation from Artificial Intelligence Feedback
☆136Nov 7, 2023Updated 2 years ago
SalesforceAIResearch / indict_code_gen
View on GitHub
INDICT: Code Generation with Internal Dialogues of Critiques for Both Security and Helpfulness
☆15Jun 2, 2026Updated last month
mariodev12 / react-native-styling-cheat-sheet
View on GitHub
Most of the React Native styling material in one page
☆14Aug 19, 2016Updated 9 years ago
alisawuffles / proxy-tuning
View on GitHub
Code associated with Tuning Language Models by Proxy (Liu et al., 2024)
☆134Mar 30, 2024Updated 2 years ago