This is a pip package implementing Reinforcement Learning algorithms in non-stationary environments supported by the OpenAI Gym toolkit.
☆32Jun 5, 2019Updated 6 years ago
Alternatives and similar repositories for dyna-gym
Users that are interested in dyna-gym are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of the Playground environment from the paper Language as a Cognitive Tool to Imagine Goals inCuriosity-Driven Exploration.☆11Mar 5, 2021Updated 5 years ago
- Neural Network learns to land a rocket using Pytorch, Unity's MLAgents and PPO.☆36Jul 23, 2019Updated 6 years ago
- Deep Q-Network (DQN) and DDPG to address the problem of stall around the wing sail of an autonomous sailing robot☆11Sep 18, 2018Updated 7 years ago
- ☆16Dec 3, 2025Updated 3 months ago
- Hands-On Reinforcement Learning with TensorFlow & TRFL☆14Jan 18, 2021Updated 5 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- An educational resource to help anyone learn deep reinforcement learning, with support for PyTorch☆17Oct 19, 2023Updated 2 years ago
- Programming in Haskell (2nd Edition)☆15Jun 8, 2017Updated 8 years ago
- Modified CartPole-v0 OpenAI Gym environment with various noisy cases and Reinforcement Learning based controller☆10Dec 5, 2017Updated 8 years ago
- [NeurIPS'19-Competition] Reinforcement Learning + Imitation Learning based approach to AI Driving Olympics☆26Jun 18, 2025Updated 9 months ago
- Implementation of End-To-End Memory Networks with Tensorflow for bAbI Dataset☆11Aug 17, 2017Updated 8 years ago
- Model-based Reinforcement Learning Framework☆115May 22, 2020Updated 5 years ago
- Read and write mha files using Python☆10Oct 14, 2013Updated 12 years ago
- A bottom-up model for the simulation of heat demand profiles of urban areas☆13Dec 11, 2023Updated 2 years ago
- Finetuning InstructLLaMA on consumer hardware (copy from https://github.com/tloen/alpaca-lora)☆11Mar 17, 2023Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Socket programming in python☆14Aug 28, 2023Updated 2 years ago
- Pydata MAB Tutorial☆10Jul 6, 2018Updated 7 years ago
- Bayesian Uncertainty Exploration in Deep Reinforcement Learning☆18Jul 12, 2017Updated 8 years ago
- Minimal implementation of multi-agent reinforcement learning algorithms☆59Aug 30, 2021Updated 4 years ago
- ☆15Aug 16, 2018Updated 7 years ago
- COBS: COmprehensive Building Simulator☆17Jun 23, 2022Updated 3 years ago
- Attempt to create a boilerplate Python package structure up-to-date tools and workflows☆15Dec 17, 2022Updated 3 years ago
- Dockerfile that is used for the JModelica regression testing of the Buildings library and of BuildingsPy☆16Nov 22, 2023Updated 2 years ago
- WMG agent☆34Oct 3, 2023Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Paper reading list during my graduate studies☆17Mar 20, 2019Updated 7 years ago
- ☆25Oct 22, 2019Updated 6 years ago
- NCSU CSC-326 Course Page☆12Dec 5, 2018Updated 7 years ago
- Value iteration, policy iteration, and Q-Learning in a grid-world MDP.☆28Dec 12, 2023Updated 2 years ago
- ☆18May 17, 2019Updated 6 years ago
- The Codebase UI that ships with UCM☆20Feb 24, 2026Updated last month
- Learning to Reinforcement Learn☆11Nov 22, 2022Updated 3 years ago
- The official implementation of “MonoArt: Progressive Structural Reasoning for Monocular Articulated 3D Reconstruction”☆49Mar 20, 2026Updated last week
- Playground for reinforcement learning algorithms implemented in TensorFlow☆16Oct 18, 2016Updated 9 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Drift detection module for machine learning pipelines.☆25Jun 21, 2023Updated 2 years ago
- A script for collecting the PubMed Central dataset in a language modelling friendly format.☆25Feb 16, 2021Updated 5 years ago
- Dialogue Knowledge Transfer Networks (DiKTNet)☆24Jun 21, 2022Updated 3 years ago
- A place to store code relevant to MLOps Community Engineering Labs☆16Apr 20, 2021Updated 4 years ago
- Assignments for CS294-112.☆16Jul 13, 2018Updated 7 years ago
- Material for my workshop at PyData Berlin 2018☆14Jul 8, 2018Updated 7 years ago
- Here, we compare Q(\sigma) learning presented by Sutton and Barto in [1] to Tree-Backup, n-step Expected Sarsa, and n-step Sarsa.☆15Feb 17, 2017Updated 9 years ago