Exploring the Dyna-Q reinforcement learning algorithm
☆17Feb 27, 2018Updated 8 years ago
Alternatives and similar repositories for dynaq
Users that are interested in dynaq are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Google AI Research☆10Mar 11, 2020Updated 6 years ago
- Implementation of BIMRL: Brain Inspired Meta Reinforcement Learning - Roozbeh Razavi et al. (IROS 2022)☆10Dec 1, 2022Updated 3 years ago
- Resilient Multi-Agent Reinforcement Learning☆10Nov 4, 2022Updated 3 years ago
- The official implementation of the paper "Deep Reinforcement Learning with Task-Adaptive Retrieval via Hypernetwork".☆12Feb 27, 2024Updated 2 years ago
- Continual Multi-agent Reinforcement Learning in Dynamic Environments☆11Jul 1, 2021Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- OpenAI Gym environment for Robot Soccer Goal☆19May 17, 2019Updated 6 years ago
- ☆16Jul 1, 2021Updated 4 years ago
- Chance constraints in CVXPY☆19Jan 12, 2019Updated 7 years ago
- Codebase of Deployment-Efficient Reinforcement Learning via Model-Based Offline Optimization (ICLR2021)☆54Jul 7, 2021Updated 4 years ago
- Deep learning implementations (Asynchronous Deep Q-Learning) of multiple Game Theory algorithms for adversarial learning (WoLF-PHC, GIGA-…☆15Sep 19, 2017Updated 8 years ago
- Maximum Entropy-Regularized Multi-Goal Reinforcement Learning (ICML 2019)☆24May 30, 2019Updated 6 years ago
- ☆20Oct 23, 2020Updated 5 years ago
- ☆10Jan 5, 2018Updated 8 years ago
- (NeurIPS '22) LISA: Learning Interpretable Skill Abstractions - A framework for unsupervised skill learning using Imitation☆29Feb 22, 2023Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Modular-HER is revised from OpenAI baselines and supports many improvements for Hindsight Experience Replay as modules.☆17Jun 23, 2021Updated 4 years ago
- original source code of the ASE 2019 paper: Wuji: Automatic Online Combat Game Testing Using Evolutionary Deep Reinforcement Learning☆28Jun 8, 2020Updated 5 years ago
- Accompanying repository for Unsupervised Active Domain Randomization in Goal-Directed RL☆12Aug 4, 2020Updated 5 years ago
- A Python implementation of the Deferred Acceptance Algorithm (one-to-one, many-to-many)☆23Jun 11, 2020Updated 5 years ago
- Train a quadcopter to fly with a deep reinforcement learning algorithm - DDPG☆12Jul 19, 2018Updated 7 years ago
- An implementation of DreamerV2 written in JAX, with support for running multiple random seeds of an experiment on a single GPU.☆18Jan 16, 2023Updated 3 years ago
- MVE: model-based value estimation☆11Jul 30, 2018Updated 7 years ago
- 为 Windows 添加右键 “发送到”-“发送到-Kindle-设备” 一键推送电子书到 Kindle 设备☆12Mar 29, 2019Updated 7 years ago
- Visual Verb Sense Disambiguation☆13Apr 26, 2019Updated 6 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Tools to construct surrogate models based on Hermitian polynomial bases. Includes full-factorial and sparse polynomial chaos expansions v…☆10Nov 8, 2018Updated 7 years ago
- Code for ICRA2021 "Policy Transfer via Kinematic Domain Randomization and Adaptation"☆12Apr 28, 2021Updated 4 years ago
- PyTorch implementations of Non-parametric Unsupervised Classification with Adversarial Autoencoders☆12Apr 26, 2019Updated 6 years ago
- Model-Based Reinforcement Learning via Latent-Space Collocation.☆35Mar 29, 2023Updated 3 years ago
- ☆19Dec 25, 2024Updated last year
- Automatically monitors, fetches and commits your accepted LeetCode submissions to your specified Git repository.☆12Feb 11, 2020Updated 6 years ago
- A PID controller in Python☆10Nov 17, 2013Updated 12 years ago
- Deep Multi-Agent Reinforcement Learning with StarCraft 2☆10Sep 27, 2020Updated 5 years ago
- Library for construction, manipulation and evaluation of factorable functions☆13Dec 13, 2025Updated 4 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆10Aug 26, 2022Updated 3 years ago
- ☆33Nov 13, 2023Updated 2 years ago
- 大文件(1G以上)基于SpringBoot分片下载解决方案☆15Oct 24, 2021Updated 4 years ago
- 预测-校正学习计算制导律☆13Jun 22, 2021Updated 4 years ago
- 🔦 A minimal raytracing engine in written in C on MinilibX☆10Mar 23, 2021Updated 5 years ago
- MAML implementation with pytorch☆11Sep 23, 2020Updated 5 years ago
- Simple Code Implementation of "Xception" architecture using PyTorch.☆16Mar 16, 2020Updated 6 years ago