lcalem / reproduction-soft-qlearning-mutual-informationView external linksLinks
Reproduction of the paper "Soft Q-Learning with Mutual Information Regularization" CoRL 2019.
☆10Jan 10, 2019Updated 7 years ago
Alternatives and similar repositories for reproduction-soft-qlearning-mutual-information
Users that are interested in reproduction-soft-qlearning-mutual-information are comparing it to the libraries listed below
Sorting:
- Code for Multi-Agent Common Knowledge Reinforcement Learning (NeurIPS 2019)☆33Dec 1, 2019Updated 6 years ago
- uct tree search + supervised lerning for atari games☆12Feb 14, 2017Updated 9 years ago
- Single Episode Policy Transfer in Reinforcement Learning☆17Jun 13, 2022Updated 3 years ago
- Pytorch code for Arxiv Paper: Learning to learn: Meta-Critic Networks for Sample-Efficient Learning☆57Apr 3, 2018Updated 7 years ago
- discrete soft Q learning(SQL) and soft Q imitation learning(SQIL) implementation in pytorch, simple!☆58Oct 18, 2022Updated 3 years ago
- Reinforcement Learning with Deep Energy-Based Policies☆435Nov 28, 2023Updated 2 years ago
- Variance Reduction for Reinforcement Learning in Input-Driven Environments (ICLR '19)☆31May 6, 2019Updated 6 years ago
- Simple hierarchical configuration for Python packages.☆14Jan 14, 2024Updated 2 years ago
- A python implementation of PROCLUS: PROjected CLUStering algorithm.☆10Jan 12, 2015Updated 11 years ago
- A reinforcement learning package implemented in Torch☆11Jan 24, 2016Updated 10 years ago
- A first bare bones paralleled implementation of Go Explore as described by the Uber Engineering blog post☆46Jan 25, 2019Updated 7 years ago
- Duel_DDQN (Dueling Network Architectures + Double DQN) using Keras☆31Jun 26, 2016Updated 9 years ago
- ☆14Apr 14, 2025Updated 10 months ago
- Mirror Descent Policy Optimization☆42Oct 31, 2020Updated 5 years ago
- Mac port of Torcs, The Open Racing Car Simulator☆11Jun 16, 2010Updated 15 years ago
- 股票高频数据(数据来源:新浪)☆13Jan 29, 2020Updated 6 years ago
- Hands-On TensorBoard for PyTorch Developers, Published by Packt☆11Dec 15, 2025Updated 2 months ago
- Learning bisimulation metrics for control, particularly suited to sparse reward settings☆10Feb 28, 2023Updated 2 years ago
- Stochastic Variance Reduction Policy Gradient Estimation☆11Nov 6, 2018Updated 7 years ago
- ☆16Dec 5, 2025Updated 2 months ago
- ☆13Feb 22, 2023Updated 2 years ago
- ☆15Dec 15, 2025Updated 2 months ago
- ☆13Apr 11, 2022Updated 3 years ago
- Recommendation engine and it's algorithms in python , R .☆12Oct 26, 2018Updated 7 years ago
- Code for abstracting, evaluating, and visualizing Markov Decision Processes.☆10Jan 12, 2017Updated 9 years ago
- tensorflow Implementation of https://github.com/facebookresearch/MIXER☆11Mar 8, 2017Updated 8 years ago
- [AAAI26] Trade-offs in Large Reasoning Models: An Empirical Analysis of Deliberative and Adaptive Reasoning over Foundational Capabilitie…☆10Feb 7, 2026Updated last week
- Android aestheticodes app☆13Aug 27, 2025Updated 5 months ago
- A collection of reading material for the Workshop on "Structure & Priors in Reinforcement Learning" (SPiRL) at ICLR 2019.☆13May 5, 2021Updated 4 years ago
- ArXiv'18 implementation of amortized maximum likelihood (AML) for high-quality, weakly-supervised shape completion.☆11Nov 30, 2018Updated 7 years ago
- Fast interpolative decompositions in Python☆10Jan 4, 2021Updated 5 years ago
- Utility functions for weights and biases (wandb).☆11Sep 17, 2024Updated last year
- An attempt to apply reinforcement learning to graph signal recovery problem☆11Aug 25, 2021Updated 4 years ago
- OpenAI Gym Environment for ROS.☆13Nov 1, 2017Updated 8 years ago
- This is a program to solve NER with HMM. The principles and details can refer to my blog: https://blog.csdn.net/weixin_41679411/article/d…☆11Nov 20, 2018Updated 7 years ago
- Code for Transformers are Adaptable Task Planners, CoRL 2022☆12Mar 28, 2023Updated 2 years ago
- ☆12Mar 12, 2024Updated last year
- Discrete entropy estimator using the Pitman-Yor mixture (PYM) prior☆18Apr 5, 2020Updated 5 years ago
- ☆10Nov 23, 2020Updated 5 years ago