MVE: model-based value estimation
☆11Jul 30, 2018Updated 7 years ago
Alternatives and similar repositories for mve
Users that are interested in mve are comparing it to the libraries listed below
Sorting:
- papers about reinforcement learning☆13Jan 4, 2021Updated 5 years ago
- Code for ICRA2021 "Policy Transfer via Kinematic Domain Randomization and Adaptation"☆12Apr 28, 2021Updated 4 years ago
- 🔍 Codebase for the ICML '20 paper "Ready Policy One: World Building Through Active Learning" (arxiv: 2002.02693)☆18Jul 6, 2023Updated 2 years ago
- ☆24Jan 26, 2024Updated 2 years ago
- NeurIPS Reproducibility Challenge 2019☆20Feb 25, 2020Updated 6 years ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆185Apr 12, 2022Updated 3 years ago
- Code for Automatic Curriculum Learning through Value Disagreement☆31Jun 15, 2020Updated 5 years ago
- PyTorch IMPALA implementation☆27Aug 31, 2019Updated 6 years ago
- My Body Is A Cage☆41Apr 13, 2021Updated 4 years ago
- Bayesian Reinforcement Learning in Tensorflow☆335Feb 15, 2021Updated 5 years ago
- ☆11Jun 15, 2019Updated 6 years ago
- Source code for paper "Trajectory of Alternating Direction Method of Multipliers and Adaptive Acceleration" of NeurIPS 2019☆10Jan 25, 2024Updated 2 years ago
- ☆12Jan 31, 2023Updated 3 years ago
- In this project, we give python and C++ codes for the Ring Polymer Molecular Dynamics (RMPD) to calculate the time correlation function(…☆12Dec 31, 2017Updated 8 years ago
- Implementing TDMA like MAC protocol with IEEE 802.15.4 PHY on GNURadio☆11May 21, 2019Updated 6 years ago
- Implementation for ICML 2019 paper, EMI: Exploration with Mutual Information.☆37Dec 7, 2020Updated 5 years ago
- Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model☆154Oct 26, 2020Updated 5 years ago
- Implementation of Oridinal Classification Paper using Logistic Regression and SVM☆12Jun 10, 2017Updated 8 years ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆176Nov 14, 2024Updated last year
- ☆12May 2, 2022Updated 3 years ago
- (Personal project) Pruning algorithm for DNNs using "lottery ticket" pruning☆10Dec 8, 2022Updated 3 years ago
- gridslam from OpenSLAM.org☆13May 15, 2018Updated 7 years ago
- Patient data simulator following the structure of an open-ai gym.☆11Jul 9, 2019Updated 6 years ago
- Simple RNN test with TensorFLow☆13May 22, 2018Updated 7 years ago
- Soft Actor-Critic implementation with SOTA model-free extension (REDQ) and SOTA model-based extension (MBPO).☆15Feb 21, 2021Updated 5 years ago
- Submission Under Review☆17May 15, 2025Updated 9 months ago
- ☆10Jan 29, 2021Updated 5 years ago
- Our first-year mathematics graduate school notes☆10Dec 20, 2021Updated 4 years ago
- Open AI Gym environment of the Missile Command Atari game.☆14May 23, 2023Updated 2 years ago
- SDK for Unitree A1, Co-Working with Wego Robotics☆10Feb 20, 2022Updated 4 years ago
- This is a tutorial of using Kubeflow to build model, train model and deploy model serving.☆14Nov 22, 2022Updated 3 years ago
- Simulation of 802.11 DCF MAC protocol and 802.11 with RTS/CTS☆12Nov 21, 2017Updated 8 years ago
- Visualize subsets of PSL(2,R) in exterior solid torus model☆13Aug 20, 2020Updated 5 years ago
- ☆11Apr 26, 2021Updated 4 years ago
- Explore Fibonacci, Galois, and State Space Linear Feedback Shift Register (LFSR) sequence generators☆12Dec 29, 2020Updated 5 years ago
- Code associated with the project http://predimportance.mit.edu/☆12Aug 7, 2020Updated 5 years ago
- Solving the card game 6 nimmt! with reinforcement learning☆14Dec 31, 2021Updated 4 years ago
- ☆11Dec 23, 2025Updated 2 months ago
- Recognizing common speech commands using Keras and Tensorflow.☆10Dec 17, 2018Updated 7 years ago