dojeon-ai / SimTPRLinks
Code for the paper "On the Importance of Feature Decorrelation for Unsupervised Representation Learning for RL" (ICML 2023)
☆12Updated 2 years ago
Alternatives and similar repositories for SimTPR
Users that are interested in SimTPR are comparing it to the libraries listed below
Sorting:
- Jaehyung Kim et al's ACL 2023 paper on "infoVerse: A Universal Framework for Dataset Characterization with Multidimensional Meta-informat…☆16Updated 2 years ago
- Code for the paper "PLASTIC: Improving Input and Label Plasticity for Sample Efficient Reinforcement Learning" (NeurIPS 2023)☆20Updated 2 years ago
- Official PyTorch implementation of "Discovering Hierarchical Achievements in Reinforcement Learning via Contrastive Learning" (NeurIPS 20…☆35Updated 9 months ago
- Brain Agent for Large-Scale and Multi-Task Agent Learning☆93Updated last year
- Online Adaptation of Language Models with a Memory of Amortized Contexts (NeurIPS 2024)☆70Updated last year
- Official code for "Can Wikipedia Help Offline Reinforcement Learning?" by Machel Reid, Yutaro Yamada and Shixiang Shane Gu☆106Updated 3 years ago
- Guide Your Agent with Adaptive Multimodal Rewards (NeurIPS 2023 Accepted)☆33Updated 2 years ago
- Code for Contrastive Preference Learning (CPL)☆177Updated last year
- Yet Another Reinforcement Learning Tutorial☆72Updated 2 years ago
- Code for the paper: Dense Reward for Free in Reinforcement Learning from Human Feedback (ICML 2024) by Alex J. Chan, Hao Sun, Samuel Holt…☆37Updated last year
- Code for the paper "What Makes Better Augmentation Strategies? Augment Difficult but Not too Different" (ICLR 22)☆12Updated 2 years ago
- ML2-Multi Agent Environments☆35Updated last year
- This is code for most of the experiments in the paper Understanding the Effects of RLHF on LLM Generalisation and Diversity☆47Updated last year
- ☆19Updated last year
- Deep-RL algorithm Implementations using Pytorch☆16Updated 2 years ago
- PyTorch code accompanying the paper "Imitating Graph-Based Planning with Goal-Conditioned Policies" (ICLR 2023).☆20Updated 2 years ago
- Implementations of Temporal Difference InfoNCE (TD InfoNCE)☆33Updated 2 years ago
- RL Implementation☆19Updated 3 years ago
- Rewarded soups official implementation☆62Updated 2 years ago
- ☆18Updated 2 years ago
- Official code repository of the paper Learning Associative Inference Using Fast Weight Memory by Schlag et al.☆28Updated 4 years ago
- ☆16Updated 2 years ago
- Distributed Priortized Experience Replay☆10Updated 7 years ago
- Official implementation of "Direct Preference-based Policy Optimization without Reward Modeling" (NeurIPS 2023)☆42Updated last year
- Reinforcement Learning via Regressing Relative Rewards☆38Updated last year
- SimPER: A Minimalist Approach to Preference Alignment without Hyperparameters (ICLR 2025)☆15Updated 3 months ago
- Code for the paper "Multi-scale Diffusion Denoised Smoothing" (NeurIPS 2023)☆14Updated last year
- Official implementation of FIND (NeurIPS '23) Function Interpretation Benchmark and Automated Interpretability Agents☆51Updated last year
- Code for the paper "STRAP: A Spatio-Temporal Framework for Real Estate Apprisal" (CIKM 2023)☆14Updated 2 years ago
- Meta-Learning with Self-Improving Momentum Target (NeurIPS 2022)☆23Updated 3 years ago