TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x
☆63Apr 5, 2021Updated 5 years ago
Alternatives and similar repositories for model-free-algorithms
Users that are interested in model-free-algorithms are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- RLtime is a reinforcement learning library focused on state-of-the-art q-learning algorithms and features☆142Sep 23, 2019Updated 6 years ago
- Distributed Rainbow-IQN for Atari☆80Dec 17, 2019Updated 6 years ago
- ☆11Feb 22, 2019Updated 7 years ago
- Ranking Policy Gradient☆23Nov 27, 2019Updated 6 years ago
- PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL…☆296Feb 24, 2021Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A PyTorch implementation of SEED, originally created by Google Research for TensorFlow 2.☆14Dec 8, 2020Updated 5 years ago
- Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model☆154Oct 26, 2020Updated 5 years ago
- Reinforcement learning algorithms implemented for Tensorflow 2.0+ [DQN, DDPG, AE-DDPG, SAC, PPO, Primal-Dual DDPG]☆316Sep 28, 2022Updated 3 years ago
- DQN-Atari-Agents: Modularized & Parallel PyTorch implementation of several DQN Agents, i.a. DDQN, Dueling DQN, Noisy DQN, C51, Rainbow,…☆122Dec 18, 2020Updated 5 years ago
- PyTorch implementation of FQF, IQN and QR-DQN.☆191Jul 25, 2024Updated last year
- lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.☆378Nov 19, 2022Updated 3 years ago
- My DRL library with tensorflow1.14 based on openai spinning-up☆62Mar 2, 2021Updated 5 years ago
- Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official imp…☆1,425Nov 29, 2023Updated 2 years ago
- PyTorch implementation of the Munchausen Reinforcement Learning Algorithms M-DQN and M-IQN☆46Oct 4, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Minimal Monte Carlo Policy Gradient (REINFORCE) Algorithm Implementation in Keras☆160Dec 26, 2019Updated 6 years ago
- Code for the paper "Phasic Policy Gradient"☆267Apr 2, 2023Updated 3 years ago
- ☆18Jan 4, 2021Updated 5 years ago
- PyTorch implementation of Soft Actor-Critic + Autoencoder(SAC+AE)☆256May 3, 2020Updated 6 years ago
- RC-NFQ: Regularized Convolutional Neural Fitted Q Iteration. A batch algorithm for deep reinforcement learning. Incorporates dropout regu…☆12Mar 17, 2021Updated 5 years ago
- Reinforcement Learning Algorithms Based on PyTorch☆452Oct 21, 2021Updated 4 years ago
- Keras implementation of guide actor-critic for continuous control☆11Mar 12, 2018Updated 8 years ago
- CURL: Contrastive Unsupervised Representation Learning for Sample-Efficient Reinforcement Learning☆604Oct 28, 2020Updated 5 years ago
- A library of probabilistic model based RL algorithms in pytorch☆107Apr 14, 2021Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆17Dec 4, 2019Updated 6 years ago
- TensorFlow2 Reinforcement Learning☆474Feb 13, 2022Updated 4 years ago
- Code for "Unsupervised State Representation Learning in Atari"☆259Nov 2, 2023Updated 2 years ago
- ROS package for robot learning☆17Oct 16, 2019Updated 6 years ago
- A comparison of parameter space noise methods for exploration in deep reinforcement learning☆30Mar 14, 2019Updated 7 years ago
- Tabula Rasa Tic-Tac-Toe☆10Jan 3, 2019Updated 7 years ago
- Code to reproduce the NeurIPS 2019 paper "Generalization in Reinforcement Learning with Selective Noise Injection and Information Bottlen…☆52Jun 28, 2020Updated 5 years ago
- Code for the Population-Based Bandits Algorithm, presented at NeurIPS 2020.☆20Apr 13, 2021Updated 5 years ago
- References at the Intersection of Causality and Reinforcement Learning☆90Aug 19, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- MetaGenRL, a novel meta reinforcement learning algorithm. Unlike prior work, MetaGenRL can generalize to new environments that are entire…☆71Jun 5, 2020Updated 5 years ago
- Understanding RL vision Distill article☆25Mar 3, 2023Updated 3 years ago
- Separating value functions across time-scales.☆17May 13, 2019Updated 7 years ago
- ☆47Jun 19, 2018Updated 7 years ago
- Source code for the paper "Divergence-Augmented Policy Optimization"☆37Nov 28, 2019Updated 6 years ago
- Example code for PySC2☆13Jan 1, 2026Updated 4 months ago
- Reproducing MuJoCo benchmarks in a modern, commercial game /physics engine (Unity + PhysX).☆52Dec 11, 2024Updated last year