Implementations of Deep Reinforcement Learning Algorithms and Bench-marking with PyTorch
☆160Mar 9, 2020Updated 6 years ago
Alternatives and similar repositories for Reinforcement-Learning
Users that are interested in Reinforcement-Learning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆16Jan 16, 2025Updated last year
- ☆12Mar 28, 2023Updated 3 years ago
- [UR 2023] Robust Route Planning with Distributional Reinforcement Learning in a Stochastic Road Network Environment☆22Jun 19, 2024Updated 2 years ago
- Active Ragdoll Training with Unity ML-Agents☆77Dec 14, 2021Updated 4 years ago
- My implementation of a deep q learning network learning to play pong.☆10Jan 26, 2021Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆12Jan 3, 2022Updated 4 years ago
- A systematic design process for a self-organizing neuro-fuzzy Q-network for model-free and offline reinforcement learning.☆11May 29, 2023Updated 3 years ago
- Course site for UM Introduction to Autonomous Robotics at the University of Michigan☆18Mar 31, 2025Updated last year
- Code for ICLR 2022 Paper (HyperDQN: A Randomized Exploration Method for Deep Reinforcement Learning)☆12Nov 28, 2023Updated 2 years ago
- Python package for linear combination of independent noncentral chi-squared random variables.☆11Sep 16, 2020Updated 5 years ago
- Source code for our paper "BLOB: a probabilistic model for recommendation that combines organic and bandit signals" published at KDD 2020…☆16Mar 24, 2023Updated 3 years ago
- ☆13Jan 14, 2020Updated 6 years ago
- PyTorch Implementation of Implicit Quantile Networks (IQN) for Distributional Reinforcement Learning with additional extensions like PER,…☆94Mar 4, 2023Updated 3 years ago
- The Official Code for Offline Model-based Adaptable Policy Learning (NeurIPS'21 & TPAMI)☆25Jan 16, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [ICML 2024] The algorithm of Reinforcement Learning with an Assistant Reward Agent (ReLara)☆17Aug 2, 2024Updated last year
- Binary Programming Formulation for Learning Classification Trees Using Cplex☆12Nov 14, 2018Updated 7 years ago
- Exploring algorithms in the domain of offline reinforcement learning (REM, Ensemble-DQN, DQN, ...)☆17Jul 7, 2020Updated 5 years ago
- Learning Continuous Control in Deep Reinforcement Learning☆14Nov 24, 2018Updated 7 years ago
- ☆39Aug 25, 2025Updated 10 months ago
- A beginner friendly example for Unity's ML-Agents Framework. This project teaches you how to train an A.I. via Machine Learning.☆30Jun 7, 2020Updated 6 years ago
- Implementation in PyTorch of the neural network presented in Mode-Adaptive Neural Networks for Quadruped Motion Control☆15Feb 17, 2020Updated 6 years ago
- Deep Learning - Visual Representation Learning by solving Jigsaw puzzles using Deep Reinforcement Learning☆10Dec 8, 2016Updated 9 years ago
- A paper list of sample-efficient reinforcement learning☆20Jan 12, 2022Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆12Mar 17, 2024Updated 2 years ago
- Reproduction of the paper "Soft Q-Learning with Mutual Information Regularization" CoRL 2019.☆10Jan 10, 2019Updated 7 years ago
- Tuning the PI controller parameters by using a contextual bandit approach☆15Jan 13, 2022Updated 4 years ago
- Contains implementation of the DoubIL and ResiduIL algorithms from the ICML '22 paper Causal Imitation Learning under Temporally Correlat…☆11Dec 9, 2022Updated 3 years ago
- ☆12Dec 16, 2020Updated 5 years ago
- OpenAI Gym Environments for the Application of Reinforcement Learning in the Simulation of Wireless Networked Feedback Control Loops☆15Feb 5, 2021Updated 5 years ago
- Actor Critic using Kronecker-Factored Trust Region☆19Jul 3, 2018Updated 8 years ago
- ☆11Mar 31, 2020Updated 6 years ago
- [KI'22] Official implementation of the paper "Solving the Traveling Salesperson Problem with Precedence Constraints (TSPPC) by Deep Reinf…☆13Sep 19, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Reinforcement learning with VizDoom platform☆13Apr 18, 2022Updated 4 years ago
- ROS simulation of a UR5 robot to pick objects and inverse kinematics implementation in Python. Includes notebooks on Inverse Kinematics t…☆16Jul 2, 2021Updated 5 years ago
- ☆17Apr 7, 2025Updated last year
- CMU Masters Thesis Project: UAV Path Planning and Human Trajectory Prediction for Navigation through Work Sites.☆11May 4, 2021Updated 5 years ago
- Code and files from a project regarding UAV path planning in a SAR situation. The project was done for the 8th semester of the Operations…☆11Dec 8, 2021Updated 4 years ago
- This projects addresses unmanned aerial vehicle (UAV) navigation and path planning under engine-out case for landing under severe weather…☆15Feb 26, 2026Updated 4 months ago
- [NeurIPS 2025] What Makes a Reward Model a Good Teacher? An Optimization Perspective☆43Sep 18, 2025Updated 9 months ago