code for paper Query-Dependent Prompt Evaluation and Optimization with Offline Inverse Reinforcement Learning
β44Mar 20, 2024Updated 2 years ago
Alternatives and similar repositories for Prompt-OIRL
Users that are interested in Prompt-OIRL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- β14Oct 11, 2023Updated 2 years ago
- (ACL 2025) π₯π₯π₯Code for "Empowering Multimodal Large Language Models with Evol-Instruct"β22May 15, 2025Updated 11 months ago
- Meta RL codebase for Unstable Baselinesβ22Dec 6, 2022Updated 3 years ago
- Chain-of-Thought Matters: Improving Long-Context Language Models with Reasoning Path Supervisionβ19Apr 1, 2025Updated last year
- The code for the paper "A Bayesian Approach to Online Planning" published in ICML 2024.β13Jun 17, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI β’ AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- β10Jan 28, 2024Updated 2 years ago
- [2025-TMLR] A Survey on the Honesty of Large Language Modelsβ65Dec 8, 2024Updated last year
- This is AI implementation (not official) of the DreamGym framework from the paper "Scaling Agent Learning via Experience Synthesis" (arXiβ¦β40Nov 9, 2025Updated 5 months ago
- [ACL 2026] Repository of IPBenchβ21Apr 6, 2026Updated last month
- Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Modelsβ41Sep 30, 2024Updated last year
- β15Nov 19, 2021Updated 4 years ago
- β40Apr 6, 2026Updated last month
- Single-Life Reinforcement Learningβ14Dec 17, 2022Updated 3 years ago
- LongAttn οΌSelecting Long-context Training Data via Token-level Attentionβ15Jul 16, 2025Updated 9 months ago
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- The reproduce of paper "Continual Vision-Language Representation Learning with Off-Diagonal Information ".(Mod-X)β11Oct 31, 2023Updated 2 years ago
- β14Mar 5, 2024Updated 2 years ago
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"β54Feb 23, 2024Updated 2 years ago
- Implementation for the paper "Fictitious Synthetic Data Can Improve LLM Factuality via Prerequisite Learning"β11Jan 10, 2025Updated last year
- Re-implementations of SOTA RL algorithms.β137Sep 7, 2023Updated 2 years ago
- Project of ACL 2025 "UAlign: Leveraging Uncertainty Estimations for Factuality Alignment on Large Language Models"β14Mar 25, 2025Updated last year
- Format your bibtex (.bib) file to help standardize citations for conference and journal submissionsβ14Nov 23, 2025Updated 5 months ago
- Implementation of AdaCQR(COLING 2025)β15Dec 30, 2024Updated last year
- β12May 14, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- The code of "Deep Regression Representation Learning with Topology" in ICML 2024β14Jul 4, 2024Updated last year
- (ICLR 2025 Spotlight) DEEM: Official implementation of Diffusion models serve as the eyes of large language models for image perception.β51Jul 1, 2025Updated 10 months ago
- Data and code for Emotion Prediction Errorsβ10Feb 22, 2022Updated 4 years ago
- SOTA work about out-of-distribution detectionβ14Mar 5, 2021Updated 5 years ago
- "A Discrete Variational Recurrent Topic Model without the Reparametrization Trick" (NeurIPS 2020)β11Apr 26, 2021Updated 5 years ago
- Repository contains demo code for MTAnchor, an interactive, multilingual topic modeling system. The code accompanies the paper Multilingβ¦β12Jan 25, 2019Updated 7 years ago
- official implementation of ICLR'2025 paper: Rethinking Bradley-Terry Models in Preference-based Reward Modeling: Foundations, Theory, andβ¦β73Apr 2, 2025Updated last year
- Code for the paper "Inference via Interpolation: Contrastive Representations Provably Enable Planning and Inference"β43Jul 10, 2024Updated last year
- β14Jan 4, 2025Updated last year
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Source code for paper "Generative Flow Network for Listwise Recommendation"β17Nov 8, 2024Updated last year
- β16Nov 1, 2023Updated 2 years ago
- A beautiful weather visualization Javascript library βπ€βπ§π¨β17Apr 26, 2021Updated 5 years ago
- β20Nov 3, 2024Updated last year
- β13Sep 26, 2024Updated last year
- β11Oct 22, 2024Updated last year
- source code for AAMAS 2023 Imperfect-information Card Game Competitionβ13Mar 21, 2024Updated 2 years ago