dojeon-ai / SimTPRLinks
Code for the paper "On the Importance of Feature Decorrelation for Unsupervised Representation Learning for RL" (ICML 2023)
☆12Updated 2 years ago
Alternatives and similar repositories for SimTPR
Users that are interested in SimTPR are comparing it to the libraries listed below
Sorting:
- Jaehyung Kim et al's ACL 2023 paper on "infoVerse: A Universal Framework for Dataset Characterization with Multidimensional Meta-informat…☆16Updated last year
- Code for the paper "PLASTIC: Improving Input and Label Plasticity for Sample Efficient Reinforcement Learning" (NeurIPS 2023)☆20Updated last year
- Official PyTorch implementation of "Discovering Hierarchical Achievements in Reinforcement Learning via Contrastive Learning" (NeurIPS 20…☆33Updated 4 months ago
- Code for the paper "What Makes Better Augmentation Strategies? Augment Difficult but Not too Different" (ICLR 22)☆12Updated last year
- All-in-one repository for Fine-tuning & Pretraining (Large) Language Models☆15Updated 2 years ago
- Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"☆20Updated 10 months ago
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆70Updated last year
- Brain Agent for Large-Scale and Multi-Task Agent Learning☆93Updated last year
- Official code for ICML 2024 paper "Learning to Continually Learn with the Bayesian Principle"☆17Updated last year
- Code for the paper "Multi-scale Diffusion Denoised Smoothing" (NeurIPS 2023)☆14Updated last year
- Codes for "Learning bounds for risk-sensitive learning," NeurIPS 2020 (or see arXiv 2006.08138)☆11Updated 4 years ago
- Official PyTorch implementation of DistiLLM-2: A Contrastive Approach Boosts the Distillation of LLMs (ICML 2025 Oral)☆23Updated 2 weeks ago
- RL Implementation☆19Updated 3 years ago
- ☆13Updated 2 years ago
- Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradients☆26Updated 9 months ago
- Code for the paper: Dense Reward for Free in Reinforcement Learning from Human Feedback (ICML 2024) by Alex J. Chan, Hao Sun, Samuel Holt…☆32Updated 10 months ago
- Code for Tackling Long-Horizon Tasks with Model-based Offline Reinforcement Learning☆12Updated 4 months ago
- ☆41Updated last year
- This is code for most of the experiments in the paper Understanding the Effects of RLHF on LLM Generalisation and Diversity☆43Updated last year
- Guide Your Agent with Adaptive Multimodal Rewards (NeurIPS 2023 Accepted)☆33Updated last year
- Distributed Priortized Experience Replay☆10Updated 6 years ago
- ☆38Updated last year
- [ICLR 2025 Oral] Knowledge Entropy Decay during Language Model Pretraining Hinders New Knowledge Acquisition☆11Updated 7 months ago
- Official implementation of "Direct Preference-based Policy Optimization without Reward Modeling" (NeurIPS 2023)☆42Updated 11 months ago
- The git repository of Modular Prompted Chatbot paper☆34Updated 2 years ago
- [TACL 2024] Improving Probability-based Prompt Selection Through Unified Evaluation and Analysis☆10Updated 7 months ago
- ☆18Updated 2 years ago
- Implementations of Temporal Difference InfoNCE (TD InfoNCE)☆29Updated last year
- RAD: Reinforcement Learning with Augmented Data (code for state augmentation)☆11Updated 4 years ago
- ☆70Updated last week