code for paper Query-Dependent Prompt Evaluation and Optimization with Offline Inverse Reinforcement Learning
☆44Mar 20, 2024Updated 2 years ago
Alternatives and similar repositories for Prompt-OIRL
Users that are interested in Prompt-OIRL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆14Oct 11, 2023Updated 2 years ago
- (ACL 2025) 🔥🔥🔥Code for "Empowering Multimodal Large Language Models with Evol-Instruct"☆21May 15, 2025Updated 11 months ago
- Meta RL codebase for Unstable Baselines☆22Dec 6, 2022Updated 3 years ago
- The code for the paper "A Bayesian Approach to Online Planning" published in ICML 2024.☆13Jun 17, 2024Updated last year
- ☆10Jan 28, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [2025-TMLR] A Survey on the Honesty of Large Language Models☆64Dec 8, 2024Updated last year
- This is AI implementation (not official) of the DreamGym framework from the paper "Scaling Agent Learning via Experience Synthesis" (arXi…☆39Nov 9, 2025Updated 5 months ago
- Repository of IPBench☆20Apr 6, 2026Updated last week
- ☆15Nov 19, 2021Updated 4 years ago
- ☆40Apr 6, 2026Updated last week
- Single-Life Reinforcement Learning☆14Dec 17, 2022Updated 3 years ago
- LongAttn :Selecting Long-context Training Data via Token-level Attention☆15Jul 16, 2025Updated 9 months ago
- The reproduce of paper "Continual Vision-Language Representation Learning with Off-Diagonal Information ".(Mod-X)☆11Oct 31, 2023Updated 2 years ago
- ☆34Jan 15, 2026Updated 3 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆14Mar 5, 2024Updated 2 years ago
- Implementation for the paper "Fictitious Synthetic Data Can Improve LLM Factuality via Prerequisite Learning"☆11Jan 10, 2025Updated last year
- Marathon: A Multiple-choice Long Context Evaluation Benchmark for Large Language Models.☆10May 16, 2024Updated last year
- Project of ACL 2025 "UAlign: Leveraging Uncertainty Estimations for Factuality Alignment on Large Language Models"☆14Mar 25, 2025Updated last year
- Unofficial implementation of Chain of Hindsight (https://arxiv.org/abs/2302.02676) using pytorch and huggingface Trainers.☆11Apr 5, 2023Updated 3 years ago
- ☆13Aug 12, 2022Updated 3 years ago
- PACIFIC: Towards Proactive Conversational Question Answering over Tabular and Textual Data in Finance☆14May 15, 2024Updated last year
- Implementation of AdaCQR(COLING 2025)☆15Dec 30, 2024Updated last year
- (ICLR 2025 Spotlight) DEEM: Official implementation of Diffusion models serve as the eyes of large language models for image perception.☆51Jul 1, 2025Updated 9 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Data and code for Emotion Prediction Errors☆10Feb 22, 2022Updated 4 years ago
- "A Discrete Variational Recurrent Topic Model without the Reparametrization Trick" (NeurIPS 2020)☆11Apr 26, 2021Updated 4 years ago
- official implementation of ICLR'2025 paper: Rethinking Bradley-Terry Models in Preference-based Reward Modeling: Foundations, Theory, and…☆73Apr 2, 2025Updated last year
- Code for the paper "Inference via Interpolation: Contrastive Representations Provably Enable Planning and Inference"☆43Jul 10, 2024Updated last year
- ☆14Jan 4, 2025Updated last year
- Code implementation of R^2-Guard: Robust Reasoning Enabled LLM Guardrail via Knowledge-Enhanced Logical Reasoning☆22Jul 8, 2024Updated last year
- ☆16Nov 1, 2023Updated 2 years ago
- A beautiful weather visualization Javascript library ☀🌤☁🌧🌨☆17Apr 26, 2021Updated 4 years ago
- ☆20Nov 3, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- source code for AAMAS 2023 Imperfect-information Card Game Competition☆13Mar 21, 2024Updated 2 years ago
- [ICML 2024] Official code release accompanying the paper "diff History for Neural Language Agents" (Piterbarg, Pinto, Fergus)☆20Aug 20, 2024Updated last year
- 完整的 scrapy 爬虫示例,爬取股票和新闻数据☆15Aug 15, 2020Updated 5 years ago
- IndexCache: Accelerating Sparse Attention via Cross-Layer Index Reuse☆85Mar 14, 2026Updated last month
- CLI tool for interacting with ChatGPT using terminal☆12Jan 28, 2026Updated 2 months ago
- A sample omniverse extension demonstrating omni.ui.scene API☆12Sep 10, 2022Updated 3 years ago
- ☆42Feb 2, 2024Updated 2 years ago