holarissun/Prompt-OIRL

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/holarissun/Prompt-OIRL)

holarissun / Prompt-OIRL

code for paper Query-Dependent Prompt Evaluation and Optimization with Offline Inverse Reinforcement Learning

☆45

Alternatives and similar repositories for Prompt-OIRL

Users that are interested in Prompt-OIRL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

lqtrung1998 / mwp_cot_design
View on GitHub
☆14Oct 11, 2023Updated 2 years ago
AIforIP / FlowPIE
View on GitHub
FlowPIE: Test-Time Scientific Idea Evolution with Flow-Guided Literature Exploration
☆19Apr 26, 2026Updated 3 months ago
nirgreshler / bayesian-online-planning
View on GitHub
The code for the paper "A Bayesian Approach to Online Planning" published in ICML 2024.
☆13Jun 17, 2024Updated 2 years ago
ritzz-ai / PACS
View on GitHub
☆31Sep 12, 2025Updated 10 months ago
MJ-Jang / BECEL
View on GitHub
☆10Jan 28, 2024Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
SihengLi99 / LLM-Honesty-Survey
View on GitHub
[2025-TMLR] A Survey on the Honesty of Large Language Models
☆66Dec 8, 2024Updated last year
RUC-NLPIR / fullrank
View on GitHub
☆39Apr 6, 2026Updated 3 months ago
lemon-prog123 / LongRePS
View on GitHub
Chain-of-Thought Matters: Improving Long-Context Language Models with Reasoning Path Supervision
☆19Apr 1, 2025Updated last year
anniesch / single-life-rl
View on GitHub
Single-Life Reinforcement Learning
☆14Dec 17, 2022Updated 3 years ago
morning9393 / ETPO
View on GitHub
☆14Mar 5, 2024Updated 2 years ago
x35f / unstable_baselines
View on GitHub
Re-implementations of SOTA RL algorithms.
☆137Sep 7, 2023Updated 2 years ago
OSU-NLP-Group / llm-planning-eval
View on GitHub
[ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"
☆54Feb 23, 2024Updated 2 years ago
dengyang17 / PACIFIC
View on GitHub
PACIFIC: Towards Proactive Conversational Question Answering over Tabular and Textual Data in Finance
☆14May 15, 2024Updated 2 years ago
init0xyz / AdaCQR
View on GitHub
Implementation of AdaCQR(COLING 2025)
☆15Dec 30, 2024Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
lamda-bbo / mcts-transfer
View on GitHub
Official implementation of NeurIPS'24 Spotlight paper "Monte Carlo Tree Search based Space Transfer for Black-box Optimization".
☆13Nov 28, 2024Updated last year
needylove / PH-Reg
View on GitHub
The code of "Deep Regression Representation Learning with Topology" in ICML 2024
☆14Jul 4, 2024Updated 2 years ago
Hambaobao / SWE-Flow
View on GitHub
SWE-Flow: Synthesizing Software Engineering Data in a Test-Driven Manner
☆40Jun 29, 2025Updated last year
FanmingL / SmartLogger
View on GitHub
☆12May 14, 2024Updated 2 years ago
UnHans / Awesome-Data-Centric-Graph-Learning-Papers
View on GitHub
n awesome&curated list of the advanced graph data-centric (i.e., graph sparsification, graph denoise, graph condensation) learning papers
☆17Jun 9, 2025Updated last year
limenlp / safer-instruct
View on GitHub
This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"
☆17Feb 22, 2024Updated 2 years ago
styfeng / GenAug
View on GitHub
Code for GenAug: Data Augmentation for Finetuning Text Generators.
☆28Oct 8, 2021Updated 4 years ago
jpheffne / epe
View on GitHub
Data and code for Emotion Prediction Errors
☆11Feb 22, 2022Updated 4 years ago
lamps-lab / Patent-figure-segmentor
View on GitHub
☆14Aug 12, 2022Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
RainBowLuoCS / DEEM
View on GitHub
(ICLR 2025 Spotlight) DEEM: Official implementation of Diffusion models serve as the eyes of large language models for image perception.
☆51Jul 1, 2025Updated last year
UCSB-NLP-Chang / Prereq_tune
View on GitHub
Implementation for the paper "Fictitious Synthetic Data Can Improve LLM Factuality via Prerequisite Learning"
☆11Jan 10, 2025Updated last year
mmrezaee / VRTM
View on GitHub
"A Discrete Variational Recurrent Topic Model without the Reparametrization Trick" (NeurIPS 2020)
☆11Apr 26, 2021Updated 5 years ago
jxzhangjhu / Awesome-OOD-detection
View on GitHub
SOTA work about out-of-distribution detection
☆14Mar 5, 2021Updated 5 years ago
holarissun / RewardModelingBeyondBradleyTerry
View on GitHub
official implementation of ICLR'2025 paper: Rethinking Bradley-Terry Models in Preference-based Reward Modeling: Foundations, Theory, and…
☆73Apr 2, 2025Updated last year
CharlieMat / GFN4Rec
View on GitHub
Source code for paper "Generative Flow Network for Listwise Recommendation"
☆18Nov 8, 2024Updated last year
yifeiwang77 / Self-Correction
View on GitHub
☆20Nov 3, 2024Updated last year
analyzer2004 / weathersankey
View on GitHub
A beautiful weather visualization Javascript library ☀🌤☁🌧🌨
☆17Apr 26, 2021Updated 5 years ago
upiterbarg / diff_history
View on GitHub
[ICML 2024] Official code release accompanying the paper "diff History for Neural Language Agents" (Piterbarg, Pinto, Fergus)
☆20Aug 20, 2024Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
flageval-baai / HalluDial
View on GitHub
☆21Aug 19, 2024Updated last year
UCSB-NLP-Chang / llm_uncertainty
View on GitHub
☆43Feb 2, 2024Updated 2 years ago
junming-yang / mopo
View on GitHub
Model-based Offline Policy Optimization re-implement all by pytorch
☆43Sep 13, 2023Updated 2 years ago
lmatosevic / chatgpt-cli
View on GitHub
CLI tool for interacting with ChatGPT using terminal
☆12Jan 28, 2026Updated 6 months ago
jiangshdd / ReviewCritique
View on GitHub
☆13Sep 26, 2024Updated last year
ht014 / SG2HOI
View on GitHub
☆12Sep 19, 2021Updated 4 years ago
jdchang1 / milo
View on GitHub
☆16Oct 5, 2021Updated 4 years ago