code for paper Query-Dependent Prompt Evaluation and Optimization with Offline Inverse Reinforcement Learning
β45Mar 20, 2024Updated 2 years ago
Alternatives and similar repositories for Prompt-OIRL
Users that are interested in Prompt-OIRL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- (ACL 2025) π₯π₯π₯Code for "Empowering Multimodal Large Language Models with Evol-Instruct"β22May 15, 2025Updated last year
- Meta RL codebase for Unstable Baselinesβ22Dec 6, 2022Updated 3 years ago
- The code for the paper "A Bayesian Approach to Online Planning" published in ICML 2024.β13Jun 17, 2024Updated last year
- [2025-TMLR] A Survey on the Honesty of Large Language Modelsβ66Dec 8, 2024Updated last year
- This is AI implementation (not official) of the DreamGym framework from the paper "Scaling Agent Learning via Experience Synthesis" (arXiβ¦β42Nov 9, 2025Updated 7 months ago
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [ACL 2026] Repository of IPBenchβ23Apr 6, 2026Updated 2 months ago
- Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Modelsβ41Sep 30, 2024Updated last year
- β40Apr 6, 2026Updated 2 months ago
- LongAttn οΌSelecting Long-context Training Data via Token-level Attentionβ15Jul 16, 2025Updated 11 months ago
- β34Jan 15, 2026Updated 5 months ago
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"β17Feb 22, 2024Updated 2 years ago
- β14Mar 5, 2024Updated 2 years ago
- Implementation for the paper "Fictitious Synthetic Data Can Improve LLM Factuality via Prerequisite Learning"β11Jan 10, 2025Updated last year
- Marathon: A Multiple-choice Long Context Evaluation Benchmark for Large Language Models.β10May 16, 2024Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting β’ AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Re-implementations of SOTA RL algorithms.β137Sep 7, 2023Updated 2 years ago
- Project of ACL 2025 "UAlign: Leveraging Uncertainty Estimations for Factuality Alignment on Large Language Models"β14Mar 25, 2025Updated last year
- Unofficial implementation of Chain of Hindsight (https://arxiv.org/abs/2302.02676) using pytorch and huggingface Trainers.β11Apr 5, 2023Updated 3 years ago
- Format your bibtex (.bib) file to help standardize citations for conference and journal submissionsβ14Nov 23, 2025Updated 6 months ago
- β13Aug 12, 2022Updated 3 years ago
- Official implementation of NeurIPS'24 Spotlight paper "Monte Carlo Tree Search based Space Transfer for Black-box Optimization".β13Nov 28, 2024Updated last year
- Implementation of AdaCQR(COLING 2025)β15Dec 30, 2024Updated last year
- β12May 14, 2024Updated 2 years ago
- The code of "Deep Regression Representation Learning with Topology" in ICML 2024β14Jul 4, 2024Updated last year
- End-to-end encrypted email - Proton Mail β’ AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- β21Dec 5, 2022Updated 3 years ago
- SOTA work about out-of-distribution detectionβ14Mar 5, 2021Updated 5 years ago
- "A Discrete Variational Recurrent Topic Model without the Reparametrization Trick" (NeurIPS 2020)β11Apr 26, 2021Updated 5 years ago
- Repository contains demo code for MTAnchor, an interactive, multilingual topic modeling system. The code accompanies the paper Multilingβ¦β12Jan 25, 2019Updated 7 years ago
- official implementation of ICLR'2025 paper: Rethinking Bradley-Terry Models in Preference-based Reward Modeling: Foundations, Theory, andβ¦β72Apr 2, 2025Updated last year
- Source code for paper "Generative Flow Network for Listwise Recommendation"β17Nov 8, 2024Updated last year
- β21Aug 19, 2024Updated last year
- β20Nov 3, 2024Updated last year
- β13Sep 26, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- PedHunter: Occlusion Robust Pedestrian Detector in Crowded Scenesβ11Nov 21, 2019Updated 6 years ago
- source code for AAMAS 2023 Imperfect-information Card Game Competitionβ13Mar 21, 2024Updated 2 years ago
- [ICML 2024] Official code release accompanying the paper "diff History for Neural Language Agents" (Piterbarg, Pinto, Fergus)β20Aug 20, 2024Updated last year
- Envision: Benchmarking Unified Understanding & Generation for Causal World Process Insightsβ32Jan 9, 2026Updated 5 months ago
- CLI tool for interacting with ChatGPT using terminalβ12Jan 28, 2026Updated 4 months ago
- β18Apr 11, 2024Updated 2 years ago
- β43Feb 2, 2024Updated 2 years ago