Official repository for paper "Conservative Offline Distributional Reinforcement Learning" (NeurIPS 2021)
☆22Aug 1, 2021Updated 4 years ago
Alternatives and similar repositories for CODAC
Users that are interested in CODAC are comparing it to the libraries listed below
Sorting:
- An open source reinforcement learning codebase with a variety of intrinsic exploration methods implemented in PyTorch.☆11Feb 6, 2023Updated 3 years ago
- Official Github Repository for "Spectral-Risk Safe Reinforcement Learning with Convergence Guarantees". (NeurIPS 2024)☆11Nov 30, 2025Updated 3 months ago
- ☆11Aug 2, 2022Updated 3 years ago
- The Intermediate Goal of the project is to train a GPT like architecture to learn to summarise reddit posts from human preferences, as th…☆12Jul 14, 2021Updated 4 years ago
- Deep PILCO PyTorch Implementation☆15Mar 25, 2023Updated 2 years ago
- Implementation of Robust Adversarial Reinforcement Learning☆14Nov 27, 2017Updated 8 years ago
- Code for the paper Novelty Search in Representational Space for Sample Efficient Exploration presented at NeurIPS 2020.☆14Jul 16, 2024Updated last year
- ☆16Oct 5, 2021Updated 4 years ago
- Code to accompany the paper "The Information Geometry of Unsupervised Reinforcement Learning"☆20Oct 6, 2021Updated 4 years ago
- ICML 2022: Learning Iterative Reasoning through Energy Minimization☆48Feb 27, 2023Updated 3 years ago
- Self-implemented code for Model-Based Meta-Reinforcement Learning☆17Apr 28, 2019Updated 6 years ago
- Author's PyTorch implementation of SR-DICE for marginalized importance sampling☆28Dec 7, 2021Updated 4 years ago
- Official implementation of the paper: Safe Model-Based Reinforcement Learning with an Uncertainty-Aware Reachability Certificate☆25Dec 4, 2023Updated 2 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Jul 18, 2023Updated 2 years ago
- ☆28Jan 11, 2021Updated 5 years ago
- Reward shaping approach for instruction following settings, leveraging language at multiple levels of abstraction.☆21Mar 9, 2021Updated 4 years ago
- Official repository for paper "Versatile Offline Imitation from Observations and Examples via Regularized State-Occupancy Matching" (ICML…☆28Jan 12, 2023Updated 3 years ago
- Official code for "Pretraining Representations For Data-Efficient Reinforcement Learning" (NeurIPS 2021)☆55Jul 27, 2021Updated 4 years ago
- Automatic Data-Regularized Actor-Critic (Auto-DrAC)☆103Mar 24, 2023Updated 2 years ago
- Implementation of CASCADE in Learning General World Models in a Handful of Reward-Free Deployments (NeurIPS 22).☆29Oct 25, 2022Updated 3 years ago
- PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2R…☆24Apr 7, 2021Updated 4 years ago
- Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"☆31Jul 27, 2021Updated 4 years ago
- ☆30Jan 17, 2022Updated 4 years ago
- Sparse Graphical Memory for Robust Planning☆29Nov 21, 2022Updated 3 years ago
- Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning☆29Feb 21, 2022Updated 4 years ago
- Tensorflow implementation for Robust Adversarial Reinforcement Learning: https://arxiv.org/pdf/1703.02702.pdf☆28Mar 7, 2018Updated 7 years ago
- PyTorch implementation of FQF, IQN and QR-DQN.☆188Jul 25, 2024Updated last year
- Code for MOPO: Model-based Offline Policy Optimization☆191May 17, 2022Updated 3 years ago
- Official code for "RAMBO: Robust Adversarial Model-Based Offline RL", NeurIPS 2022☆32Jun 2, 2023Updated 2 years ago
- Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"☆33Dec 14, 2023Updated 2 years ago
- ☆31Jan 16, 2023Updated 3 years ago
- Adaptive Risk Tendency Implicit Quantile Network for Drone Navigation under Partial Observability.☆37Mar 29, 2022Updated 3 years ago
- A PyTorch implementation of Conditional PixelCNNs☆27Jan 24, 2018Updated 8 years ago
- Official repository for Paper "Offline Goal-Conditioned Reinforcement Learning via f-Advantage Regression" (NeurIPS 2022)☆36Oct 19, 2023Updated 2 years ago
- Scaling Pareto-Efficient Decision Making via Offline Multi-Objective RL, published in ICLR 2023☆33Dec 7, 2024Updated last year
- Offline Risk-Averse Actor-Critic (O-RAAC). A model-free RL algorithm for risk-averse RL in a fully offline setting☆35Feb 9, 2021Updated 5 years ago
- An index of algorithms for offline reinforcement learning (offline-rl)☆1,052May 23, 2024Updated last year
- [NeurIPS 2020, Spotlight] Code for "Robust Deep Reinforcement Learning against Adversarial Perturbations on Observations"☆140Nov 16, 2021Updated 4 years ago
- This repository contains the official code for our NeurIPS 2021 publication "Robust Deep Reinforcement Learning through Adversarial Loss…☆32Jan 21, 2022Updated 4 years ago