Reproduction of the paper "Soft Q-Learning with Mutual Information Regularization" CoRL 2019.
☆10Jan 10, 2019Updated 7 years ago
Alternatives and similar repositories for reproduction-soft-qlearning-mutual-information
Users that are interested in reproduction-soft-qlearning-mutual-information are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- uct tree search + supervised lerning for atari games☆12Feb 14, 2017Updated 9 years ago
- Code for Multi-Agent Common Knowledge Reinforcement Learning (NeurIPS 2019)☆33Dec 1, 2019Updated 6 years ago
- Pytorch code for Arxiv Paper: Learning to learn: Meta-Critic Networks for Sample-Efficient Learning☆57Apr 3, 2018Updated 7 years ago
- tensorflow Implementation of https://github.com/facebookresearch/MIXER☆11Mar 8, 2017Updated 9 years ago
- Source code for Pathfinding in Stochastic Environments paper.☆15Oct 27, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- This is a program to solve NER with HMM. The principles and details can refer to my blog: https://blog.csdn.net/weixin_41679411/article/d…☆11Nov 20, 2018Updated 7 years ago
- Code exploring the use of reward machines in the context of cooperative multi-agent reinforcement learning.☆14Apr 29, 2023Updated 2 years ago
- We optimize SIEP algorithm in multiple intelligent agents scenario and comparatively research A*, DFS, BFS, Dijkstra, PFP and PRM.☆15Jul 31, 2024Updated last year
- Single Episode Policy Transfer in Reinforcement Learning☆17Jun 13, 2022Updated 3 years ago
- discrete soft Q learning(SQL) and soft Q imitation learning(SQIL) implementation in pytorch, simple!☆58Oct 18, 2022Updated 3 years ago
- Android aestheticodes app☆13Aug 27, 2025Updated 7 months ago
- Reinforcement learning framework.☆16Jul 25, 2025Updated 8 months ago
- ☆10Nov 23, 2020Updated 5 years ago
- Reinforcement Learning with Deep Energy-Based Policies☆437Nov 28, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- 使用knn和朴素贝叶斯算法预测居民出行目的地,主要基于Scala和python语言编写,运行在spark分布式集群。☆10Jun 21, 2022Updated 3 years ago
- Old and new Reinforcement Learning algorithms run on the GridUniverse ecosystem☆23Feb 3, 2019Updated 7 years ago
- ☆14Mar 24, 2021Updated 5 years ago
- Flock and swarm multi-agent RL training environments implemented in JAX☆14Nov 19, 2025Updated 4 months ago
- ☆16Jul 22, 2021Updated 4 years ago
- ☆16Dec 5, 2025Updated 3 months ago
- python code to download all emails from gmail (or other imap service) in either csv or json data☆12Oct 11, 2018Updated 7 years ago
- Dead simple ultra-low bandwidth video over QUIC, written in Rust, dead simple to use, free forever, designed for use in any and all condi…☆50Mar 8, 2026Updated 3 weeks ago
- Code for Jensen et al. 2023☆17Jun 25, 2024Updated last year
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Using Natural Language for Reward Shaping in Reinforcement Learning☆24Dec 11, 2023Updated 2 years ago
- ☆20Oct 5, 2018Updated 7 years ago
- Utility functions for weights and biases (wandb).☆11Sep 17, 2024Updated last year
- Deep Reinforcement Learning for mobile robot navigation, a robot learns to navigate to a random goal point from random moves to adopting …☆20Jul 9, 2022Updated 3 years ago
- Code for abstracting, evaluating, and visualizing Markov Decision Processes.☆10Jan 12, 2017Updated 9 years ago
- Tensorflow implementation of BootstrappedDQN using OpenAI baselines☆19Jan 12, 2021Updated 5 years ago
- Simple hierarchical configuration for Python packages.☆14Jan 14, 2024Updated 2 years ago
- ☆12Mar 23, 2018Updated 8 years ago
- ☆23May 12, 2022Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Learnable MAPF. “Distributed Heuristic Multi-Agent Path Finding with Communication” (DHC) algorithm from ICRA 2021 is implemented and ben…☆24Nov 9, 2023Updated 2 years ago
- ☆14Apr 14, 2025Updated 11 months ago
- A python implementation of PROCLUS: PROjected CLUStering algorithm.☆10Jan 12, 2015Updated 11 years ago
- Recommendation engine and it's algorithms in python , R .☆12Oct 26, 2018Updated 7 years ago
- A reinforcement learning package implemented in Torch☆11Jan 24, 2016Updated 10 years ago
- Learning bisimulation metrics for control, particularly suited to sparse reward settings☆10Feb 28, 2023Updated 3 years ago
- ☆13Apr 11, 2022Updated 3 years ago