dojeon-ai / SimTPRLinks
Code for the paper "On the Importance of Feature Decorrelation for Unsupervised Representation Learning for RL" (ICML 2023)
☆12Updated 2 years ago
Alternatives and similar repositories for SimTPR
Users that are interested in SimTPR are comparing it to the libraries listed below
Sorting:
- Jaehyung Kim et al's ACL 2023 paper on "infoVerse: A Universal Framework for Dataset Characterization with Multidimensional Meta-informat…☆16Updated 2 years ago
- Code for the paper "PLASTIC: Improving Input and Label Plasticity for Sample Efficient Reinforcement Learning" (NeurIPS 2023)☆20Updated last year
- ☆41Updated last year
- RL Implementation☆19Updated 3 years ago
- Brain Agent for Large-Scale and Multi-Task Agent Learning☆93Updated last year
- Code for the paper "STRAP: A Spatio-Temporal Framework for Real Estate Apprisal" (CIKM 2023)☆13Updated 2 years ago
- Yet Another Reinforcement Learning Tutorial☆73Updated 2 years ago
- ☆11Updated last year
- All-in-one repository for Fine-tuning & Pretraining (Large) Language Models☆15Updated 2 years ago
- Official PyTorch implementation of "Discovering Hierarchical Achievements in Reinforcement Learning via Contrastive Learning" (NeurIPS 20…☆34Updated 6 months ago
- Code for the paper "What Makes Better Augmentation Strategies? Augment Difficult but Not too Different" (ICLR 22)☆12Updated 2 years ago
- Code for the paper: Dense Reward for Free in Reinforcement Learning from Human Feedback (ICML 2024) by Alex J. Chan, Hao Sun, Samuel Holt…☆37Updated last year
- Online Adaptation of Language Models with a Memory of Amortized Contexts (NeurIPS 2024)☆65Updated last year
- Information and Materials for the Deep Learning Course☆31Updated 3 years ago
- Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradients☆26Updated 11 months ago
- Official repository for "DEnsity: Open-domain Dialogue Evaluation Metric using Density Estimation (ACL2023 Findings)"☆11Updated 2 years ago
- CareCall for Seniors: Role Specified Open-Domain Dialogue dataset generated by leveraging LLMs (NAACL 2022).☆60Updated 3 years ago
- Code for Contrastive Preference Learning (CPL)