Pytorch implementation of BEAR in "Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction"
☆11Oct 29, 2019Updated 6 years ago
Alternatives and similar repositories for BEAR
Users that are interested in BEAR are comparing it to the libraries listed below
Sorting:
- [ICLR'20] Learning to Learn by Zeroth-Order Oracle☆14Feb 7, 2020Updated 6 years ago
- ☆15Jan 20, 2020Updated 6 years ago
- Code to reproduce Supervised Policy Update (ICLR 2019)☆17Dec 8, 2022Updated 3 years ago
- An adjustive SEIR model to estimate parameters of 2019-nCoV☆19Jun 22, 2022Updated 3 years ago
- Ranking Policy Gradient☆23Nov 27, 2019Updated 6 years ago
- TensorFlow implementation for our paper "Exploration via Hindsight Goal Generation"☆23Mar 11, 2022Updated 3 years ago
- A comparison of parameter space noise methods for exploration in deep reinforcement learning☆30Mar 14, 2019Updated 6 years ago
- Controllable Multi-Objective Re-ranking with Policy Hypernetworks (KDD 2023)☆38Oct 6, 2024Updated last year
- [ICLR 2020, Oral] Harnessing Structures for Value-Based Planning and Reinforcement Learning☆34Feb 1, 2020Updated 6 years ago
- Code for the papers "Induction of Subgoal Automata for Reinforcement Learning" (AAAI-20) and "Induction and Exploitation of Subgoal Autom…☆13Aug 15, 2023Updated 2 years ago
- Code for Stabilizing Off-Policy RL via Bootstrapping Error Reduction☆163Jul 17, 2020Updated 5 years ago
- An implementation of the paper "Solving the Rubik's Cube without Human Knowledge"☆14Dec 9, 2018Updated 7 years ago
- python implementation of the TPGR☆40Mar 27, 2019Updated 6 years ago
- Implicit Distributional Actor Critic☆11Dec 8, 2021Updated 4 years ago
- ☆12Oct 11, 2022Updated 3 years ago
- Anti exploration in offline reinforcement learning☆11May 17, 2021Updated 4 years ago
- Example of android app written in Qt/Qml which uses MXNet for plant image recognition.☆10Nov 4, 2017Updated 8 years ago
- Nonequispaced FFTs on GPUs (based on NFFT: http://www.nfft.org)☆11Apr 30, 2018Updated 7 years ago
- PESLA - TORCS Deep Reinforcement Learning Agent☆10Oct 20, 2019Updated 6 years ago
- Implementation prototype of the Deep Deterministic Off-Policy Gradient (DD-OPG) method.☆11Jun 12, 2019Updated 6 years ago
- Cross-entropy method variants for optimization in Julia☆12Apr 29, 2021Updated 4 years ago
- ☆10Aug 18, 2022Updated 3 years ago
- Code for the paper "Minimum-Delay Adaptation in Non-Stationary Reinforcement Learning via Online High-Confidence Change-Point Detection"☆11Aug 7, 2023Updated 2 years ago
- An implementation of effective policy ensemble.☆16Jul 5, 2023Updated 2 years ago
- Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees☆55Jul 26, 2019Updated 6 years ago
- Codes accompanying the paper "Offline Reinforcement Learning via High-Fidelity Generative Behavior Modeling" (ICLR 2023) https://arxiv.or…☆41Oct 11, 2023Updated 2 years ago
- Notebooks for the HE introduction☆10Sep 11, 2020Updated 5 years ago
- Influence Maximization in Near-Linear Time: A Martingale Approach Scala implementation☆14Sep 3, 2018Updated 7 years ago
- Dynamic Measurement Scheduling for Event Forecasting using Deep RL (ICML 2019)☆10Jun 16, 2020Updated 5 years ago
- ☆10Aug 17, 2022Updated 3 years ago
- Code for building and experimenting on saliency maps for RL agents.☆12Feb 13, 2020Updated 6 years ago
- Lightweight simulator of a roomba-like robot☆13Nov 30, 2022Updated 3 years ago
- ☆13Jul 3, 2022Updated 3 years ago
- some NDK sample☆11Mar 11, 2018Updated 7 years ago
- Code associated with our paper "Robust Domain Randomization for Reinforcement Learning"☆12Nov 22, 2022Updated 3 years ago
- Neo4j 大规模 三元组 CVS 导入进数据库☆11Jul 31, 2020Updated 5 years ago
- This is the official implementation for IJCAI 2023 Paper: Towards Hierarchical Policy Learning for Conversational Recommendation with Hyp…☆12Sep 19, 2023Updated 2 years ago
- Reinforcement learning algorithm implementation☆10Oct 31, 2021Updated 4 years ago
- ☆15May 24, 2021Updated 4 years ago