Implementations of Deep Reinforcement Learning Algorithms and Bench-marking with PyTorch
☆156Mar 9, 2020Updated 6 years ago
Alternatives and similar repositories for Reinforcement-Learning
Users that are interested in Reinforcement-Learning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆16Jan 16, 2025Updated last year
- ☆12Mar 28, 2023Updated 3 years ago
- Reinforcement learning algorithm implementation☆10Oct 31, 2021Updated 4 years ago
- My implementation of a deep q learning network learning to play pong.☆10Jan 26, 2021Updated 5 years ago
- A systematic design process for a self-organizing neuro-fuzzy Q-network for model-free and offline reinforcement learning.☆11May 29, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Amazon SageMaker Llama 2 Inference via Response Streaming☆12Jun 28, 2024Updated last year
- Course site for UM Introduction to Autonomous Robotics at the University of Michigan☆17Mar 31, 2025Updated last year
- Code for ICLR 2022 Paper (HyperDQN: A Randomized Exploration Method for Deep Reinforcement Learning)☆12Nov 28, 2023Updated 2 years ago
- Python package for linear combination of independent noncentral chi-squared random variables.☆11Sep 16, 2020Updated 5 years ago
- Source code for our paper "BLOB: a probabilistic model for recommendation that combines organic and bandit signals" published at KDD 2020…☆16Mar 24, 2023Updated 3 years ago
- ☆13Jan 14, 2020Updated 6 years ago
- PyTorch Implementation of Implicit Quantile Networks (IQN) for Distributional Reinforcement Learning with additional extensions like PER,…☆94Mar 4, 2023Updated 3 years ago
- [ICML 2024] The algorithm of Reinforcement Learning with an Assistant Reward Agent (ReLara)☆17Aug 2, 2024Updated last year
- Binary Programming Formulation for Learning Classification Trees Using Cplex☆12Nov 14, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Exploring algorithms in the domain of offline reinforcement learning (REM, Ensemble-DQN, DQN, ...)☆17Jul 7, 2020Updated 5 years ago
- Learning Continuous Control in Deep Reinforcement Learning☆14Nov 24, 2018Updated 7 years ago
- ☆39Aug 25, 2025Updated 9 months ago
- This is the pytorch implementation of the UAI2023 paper "A Trajectory is Worth Three Sentences: Multimodal Transformer for Offline Reinf…☆11Oct 9, 2023Updated 2 years ago
- A repository for code of reinforcement learning algorithms with PyTorch☆30Sep 20, 2021Updated 4 years ago
- ☆11Jan 8, 2025Updated last year
- ☆12Mar 17, 2024Updated 2 years ago
- DQN, DDDQN, A3C, PPO, Curiosity applied to the game DOOM☆94Feb 8, 2021Updated 5 years ago
- Reproduction of the paper "Soft Q-Learning with Mutual Information Regularization" CoRL 2019.☆10Jan 10, 2019Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for NeurIPS2023 accepted paper: Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning.☆41Feb 18, 2025Updated last year
- IERG 6130 Reinforcement Learning☆10Apr 29, 2019Updated 7 years ago
- PERMON main package for quadratic programming (PermonQP)☆15Apr 20, 2026Updated last month
- ☆10Feb 28, 2019Updated 7 years ago
- 📖 Paper: Human-level control through deep reinforcement learning 🕹️☆57May 9, 2024Updated 2 years ago
- Tuning the PI controller parameters by using a contextual bandit approach☆15Jan 13, 2022Updated 4 years ago
- Android aestheticodes app☆13Aug 27, 2025Updated 8 months ago
- PyTorch implementation of two variants of the Harlow visual fixation task (PsychLab and 1D version). Reproduces the results found in two …☆14Sep 2, 2020Updated 5 years ago
- ☆11Mar 31, 2020Updated 6 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [KI'22] Official implementation of the paper "Solving the Traveling Salesperson Problem with Precedence Constraints (TSPPC) by Deep Reinf…☆13Sep 19, 2022Updated 3 years ago
- Python Implementation of STreeD: Dynamic Programming Approach for Optimal Decision Trees with Separable objectives and Constraints☆20Mar 23, 2026Updated 2 months ago
- ☆17Apr 7, 2025Updated last year
- CMU Masters Thesis Project: UAV Path Planning and Human Trajectory Prediction for Navigation through Work Sites.☆11May 4, 2021Updated 5 years ago
- ☆14Jun 10, 2022Updated 3 years ago
- ☆10Nov 23, 2020Updated 5 years ago
- Code and files from a project regarding UAV path planning in a SAR situation. The project was done for the 8th semester of the Operations…☆11Dec 8, 2021Updated 4 years ago