CVxTz / RLLinks
RL algorithm implementations from scratch.
☆18Updated 4 years ago
Alternatives and similar repositories for RL
Users that are interested in RL are comparing it to the libraries listed below
Sorting:
- Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]☆39Updated 2 years ago
- Contains implementation of the FILTER algorithm for exponentially faster inverse reinforcement learning.☆49Updated 2 years ago
- Deep Reinforcement Learning Framework done with PyTorch☆36Updated 3 months ago
- A modular implementation of PPO, and soon hopefully other algorithms.☆26Updated last year
- Source files to replicate experiments in my ICLR 2022 paper.☆70Updated 11 months ago
- Deep Q-Network (DQN) and Fitted Q-Iteration (FQI) tutorial for RL Summer School 2023☆72Updated 7 months ago
- A Library for Active Preference-based Reward Learning Algorithms☆50Updated last year
- Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"☆30Updated 3 years ago
- Deep Reinforcement Learning for Continuous Control in PyTorch☆100Updated 3 years ago
- Stable-Baselines3 (SB3) reinforcement learning tutorial for the Reinforcement Learning Virtual School 2021.☆54Updated 2 years ago
- Simple Grid Environment for Gymnasium☆59Updated 4 months ago
- Code for the paper "D2RL: Deep Dense Architectures for Reinforcement Learning"☆39Updated 4 years ago
- Proximal Policy Optimization (Continuous Version) in PyTorch.☆29Updated last month
- ☆21Updated last year
- Implementation for paper "A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning".☆59Updated 9 months ago
- Modular Single-file Reinfocement Learning Algorithms Library☆37Updated 2 years ago
- PyTorch implementation of our paper Reinforcement Learning with Random Delays (ICLR 2020)☆41Updated 3 years ago
- The Starcraft Multi-Agent challenge lite☆42Updated 9 months ago
- Soft Actor-Critic implementation with SOTA model-free extension (REDQ) and SOTA model-based extension (MBPO).☆14Updated 4 years ago
- Implementation of the Deep Deterministic Policy Gradient and Hindsight Experience Replay.☆98Updated last month
- Code and project page for D-REX algorithm from the paper "Better-than-Demonstrator Imitation Learning via Automatically-Ranked Demonstrat…☆50Updated 2 years ago
- JAX and PZ RL envs + algorithms for swarms of CrazyFlies☆78Updated 10 months ago
- Official codebase for Redeeming Intrinsic Rewards via Constrained Policy Optimization