WEIRDLabUW / vpl_llmLinks
Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"
☆20Updated 10 months ago
Alternatives and similar repositories for vpl_llm
Users that are interested in vpl_llm are comparing it to the libraries listed below
Sorting:
- Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories☆42Updated last year
- Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradients☆26Updated 9 months ago
- Implementations of Temporal Difference InfoNCE (TD InfoNCE)☆29Updated last year
- Reinforcement Learning via Regressing Relative Rewards☆34Updated 6 months ago
- Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"☆29Updated last year
- Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning (NeurIPS 2020)☆39Updated 4 years ago
- ☆17Updated last year
- Code for "Task-Agnostic Continual RL: In Praise of a Simple Baseline"☆34Updated 2 years ago
- Official implementation of "Direct Preference-based Policy Optimization without Reward Modeling" (NeurIPS 2023)☆42Updated 11 months ago
- Generalised UDRL☆37Updated 3 years ago
- Guide Your Agent with Adaptive Multimodal Rewards (NeurIPS 2023 Accepted)☆33Updated last year
- Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)☆58Updated 8 months ago
- Author's PyTorch implementation of SR-DICE for marginalized importance sampling☆17Updated 3 years ago
- Latent World Models For Intrinsically Motivated Exploration | Official repository☆22Updated 4 years ago
- Codebase for "Uni[MASK]: Unified Inference in Sequential Decision Problems"☆55Updated 11 months ago
- Repository for "Quality-Diversity Actor-Critic: Learning High-Performing and Diverse Behaviors via Value and Successor Features Critics" …☆16Updated last year
- Official data and code for our paper Systematic Evaluation of Causal Discovery in Visual Model Based Reinforcement Learning☆48Updated 3 years ago
- Code for Paper (Policy Optimization in RLHF: The Impact of Out-of-preference Data)☆28Updated last year
- Code for "World Model as a Graph: Learning Latent Landmarks for Planning" (ICML 2021 Long Presentation)☆66Updated 3 years ago
- Official code for "Pretraining Representations For Data-Efficient Reinforcement Learning" (NeurIPS 2021)☆54Updated 3 years ago
- This is the pytorch implementation of the UAI2023 paper "A Trajectory is Worth Three Sentences: Multimodal Transformer for Offline Reinf…☆11Updated last year
- Code for the paper "Inference via Interpolation: Contrastive Representations Provably Enable Planning and Inference"☆42Updated 11 months ago
- We develop world models that can be adapted with natural language. Intergrating these models into artificial agents allows humans to effe…☆24Updated last year
- Clean, extensible implementation of MACAW [ICML 2021]☆12Updated 3 years ago
- Official PyTorch Implementation for Metric Residual Networks for Sample Efficient Goal-Conditioned Reinforcement Learning☆17Updated 2 years ago
- Code repository of the paper "CITRIS: Causal Identifiability from Temporal Intervened Sequences" and "iCITRIS: Causal Representation Lear…☆53Updated 2 years ago
- Official codebase for "The Generalization Gap in Offline Reinforcement Learning" accepted to ICLR 2024☆28Updated 10 months ago
- A PyTorch Implementation of Skipper☆28Updated 9 months ago
- Representation Learning in RL☆15Updated 3 years ago
- Code for the paper Novelty Search in Representational Space for Sample Efficient Exploration presented at NeurIPS 2020.☆14Updated 11 months ago