Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"
☆27Aug 21, 2024Updated last year
Alternatives and similar repositories for vpl_llm
Users that are interested in vpl_llm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆16Oct 3, 2024Updated last year
- ☆36Jun 10, 2025Updated last year
- ☆13Sep 12, 2024Updated last year
- Official Repository for 'Promptable Behaviors: Personalizing Multi-Objective Rewards from Human Preferences' (CVPR 2024)☆16Mar 29, 2024Updated 2 years ago
- 【IJSR】Crowd-comfort Robot Navigation among Dynamic Environment Based on social-stressed deep reinforcement learning☆11Dec 1, 2023Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆16Dec 18, 2025Updated 6 months ago
- Personalized Story Evaluation Model☆17Nov 27, 2023Updated 2 years ago
- code for project llm-personalize☆19Aug 9, 2024Updated last year
- virtual node analysis on ogb benchmark dataset☆14Mar 9, 2023Updated 3 years ago
- The source code of ExFunTube☆10Aug 8, 2025Updated 10 months ago
- [NeurIPS 2025] L-MTP: Leap Multi-Token Prediction Beyond Adjacent Context for Large Language Models☆29May 8, 2026Updated last month
- [ACL 2024] FLEUR: An Explainable Reference-Free Evaluation Metric for Image Captioning Using a Large Multimodal Model☆17Apr 28, 2025Updated last year
- ☆15May 25, 2026Updated last month
- CLAIR: A (surprisingly) simple semantic text metric with large language models.☆22Jan 28, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Implementation of ``Actor-Critic Alignment for Offline-to-Online Reinforcement Learning''☆13Oct 12, 2023Updated 2 years ago
- Customer simulation for direct marketing experiments☆20Jul 9, 2021Updated 4 years ago
- Code for ACL 2022 long paper: Can Prompt Probe Pretrained Language Models? Understanding the Invisible Risks from a Causal View☆10May 17, 2022Updated 4 years ago
- Personalized Graph-based Retrieval for LLMs Benchmark☆34Feb 16, 2025Updated last year
- ☆39Oct 1, 2025Updated 9 months ago
- Code to reproduce results of our experiments using LoRe☆18Jun 10, 2026Updated 3 weeks ago
- Faithfully Explainable Recommendation via Neural Logic Reasoning☆16May 3, 2021Updated 5 years ago
- [CIKM'23] Test Time Embedding Normalization for Popularity Bias Mitigation☆15Jan 9, 2024Updated 2 years ago
- ☆10Jul 14, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆123May 2, 2024Updated 2 years ago
- [NeurIPS 2025] Continual Multimodal Contrastive Learning☆28Dec 18, 2025Updated 6 months ago
- The implementation of paper "Strategy-aware Bundle Recommender System", SIGIR'23.☆15Sep 4, 2023Updated 2 years ago
- [ACL'24] Can LLMs Speak For Diverse People? Tuning LLMs via Debate to Generate Controllable Controversial Statements☆25Sep 14, 2024Updated last year
- Code implementation of "Information Design in Multi-Agent Reinforcement Learning"☆16Aug 18, 2023Updated 2 years ago
- PRODIGy is a collection of dialogues in which each conversation is aligned with speaker profile representations.☆20Jan 8, 2025Updated last year
- DependEval: a hierarchical benchmark for evaluating LLMs on repository-level code understanding across 8 programming languages.☆16Jul 28, 2025Updated 11 months ago
- Taskmate - an open source grading desktop application with synchronisation capabilities for Windows and MacOS☆14May 29, 2023Updated 3 years ago
- ☆17Oct 13, 2019Updated 6 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆18Apr 11, 2021Updated 5 years ago
- [IROS2023]Learning to Solve Tasks with Exploring Prior Behaviours☆12Mar 3, 2024Updated 2 years ago
- ☆13Aug 27, 2021Updated 4 years ago
- Repository for the code of the paper "Neural Networks Regularization Through Class-wise Invariant Representation Learning".☆12Oct 1, 2017Updated 8 years ago
- ☆18Nov 25, 2024Updated last year
- ☆31Jun 16, 2026Updated 2 weeks ago
- Conformal Bayes with importance sampling☆23Oct 25, 2021Updated 4 years ago