dojeon-ai / SimTPRLinks
Code for the paper "On the Importance of Feature Decorrelation for Unsupervised Representation Learning for RL" (ICML 2023)
☆12Updated 2 years ago
Alternatives and similar repositories for SimTPR
Users that are interested in SimTPR are comparing it to the libraries listed below
Sorting:
- Jaehyung Kim et al's ACL 2023 paper on "infoVerse: A Universal Framework for Dataset Characterization with Multidimensional Meta-informat…☆16Updated 2 years ago
- Code for the paper "PLASTIC: Improving Input and Label Plasticity for Sample Efficient Reinforcement Learning" (NeurIPS 2023)☆20Updated 2 years ago
- Brain Agent for Large-Scale and Multi-Task Agent Learning☆92Updated 2 years ago
- Official PyTorch implementation of "Discovering Hierarchical Achievements in Reinforcement Learning via Contrastive Learning" (NeurIPS 20…☆35Updated 10 months ago
- Code for the paper "STRAP: A Spatio-Temporal Framework for Real Estate Apprisal" (CIKM 2023)☆14Updated 2 years ago
- RL Implementation☆19Updated 3 years ago
- Official code repository of the paper Learning Associative Inference Using Fast Weight Memory by Schlag et al.☆28Updated 4 years ago
- Code for the paper: Dense Reward for Free in Reinforcement Learning from Human Feedback (ICML 2024) by Alex J. Chan, Hao Sun, Samuel Holt…☆38Updated last year
- Official code for "Can Wikipedia Help Offline Reinforcement Learning?" by Machel Reid, Yutaro Yamada and Shixiang Shane Gu☆106Updated 3 years ago
- All-in-one repository for Fine-tuning & Pretraining (Large) Language Models☆15Updated 2 years ago
- Yet Another Reinforcement Learning Tutorial☆72Updated 2 years ago
- The git repository of Modular Prompted Chatbot paper☆35Updated 2 years ago
- NeuralWOZ: Learning to Collect Task-Oriented Dialogue via Model-based Simulation (ACL-IJCNLP 2021)☆36Updated 4 years ago
- On Efficient Language and Vision Assistants for Visually-Situated Natural Language Understanding: What Matters in Reading and Reasoning, …☆19Updated last year
- Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradients☆26Updated last year
- ☆66Updated 4 years ago
- Course Website for "AI618: Generative Model and Unsupervised Learning"☆37Updated 2 years ago
- Code for Contrastive Preference Learning (CPL)☆177Updated last year
- Online Adaptation of Language Models with a Memory of Amortized Contexts (NeurIPS 2024)☆70Updated last year
- [ACL 2023] Gradient Ascent Post-training Enhances Language Model Generalization☆29Updated last year
- This is code for most of the experiments in the paper Understanding the Effects of RLHF on LLM Generalisation and Diversity☆47Updated last year
- A hackable, simple, and reseach-friendly GRPO Training Framework with high speed weight synchronization in a multinode environment.☆35Updated 4 months ago
- 👻 Code and benchmark for our EMNLP 2023 paper - "FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions"☆57Updated last year
- Code for the paper "Multi-scale Diffusion Denoised Smoothing" (NeurIPS 2023)☆14Updated last year
- A Toolkit for Distributional Control of Generative Models☆74Updated last month
- ☆42Updated 2 years ago
- RL algorithm: Advantage induced policy alignment☆66Updated 2 years ago
- ☆14Updated 3 years ago
- Official code for ICML 2024 paper "Learning to Continually Learn with the Bayesian Principle"☆20Updated last year
- Information and Materials for the Deep Learning Course☆31Updated 3 years ago