Solving CartPole with an Anticipatory network
☆22Mar 11, 2019Updated 7 years ago
Alternatives and similar repositories for CartPole
Users that are interested in CartPole are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Tracking books that I {have, currently, or plan to} read☆18Apr 18, 2021Updated 4 years ago
- This project includes various scripts for Ensage.☆11Jan 5, 2015Updated 11 years ago
- 数据预处理——插值法填补缺失值,并且标记填充位置☆10Apr 19, 2019Updated 6 years ago
- Undergraduate Thesis.☆11Apr 13, 2025Updated 11 months ago
- 通过python3.6编程,利用DQN算法实现机器学习避开障碍走到迷宫终点。(Through python3.6 programming, I use DQN algorithm to achieve machine learning and avoid obstacles…☆10Apr 15, 2018Updated 7 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- repo for the MPC-RL summer school project☆15Nov 5, 2022Updated 3 years ago
- A framework for path-planing and obstacle avoidance using Deep Reinforcement Learning Techniques.☆15Oct 4, 2021Updated 4 years ago
- A tensorflow implementation of hindsight experience replay☆17Apr 19, 2018Updated 7 years ago
- Python 3.6 and TensorFlow implementation of the AReS and MaRS algorithms☆11Jun 23, 2019Updated 6 years ago
- ☆14Dec 10, 2017Updated 8 years ago
- OpenAI Gym Wrapper for DeepMind Control Suite☆74Nov 30, 2021Updated 4 years ago
- ☆19May 19, 2021Updated 4 years ago
- Deep Q-learning approach to OpenAI Gym's Lunar Lander☆15Jul 27, 2017Updated 8 years ago
- Tensorflow implementation of a Deep Deterministic Policy Gradient (DDPG) network, trained on OpenAI Gym environments.☆23Nov 8, 2018Updated 7 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- 用强化学习来玩微信跳一跳☆20Jan 15, 2018Updated 8 years ago
- Implementation of the skill discovery algorithm described in ICLR submission "Option Discovery using Deep Skill Chaining"☆30Sep 24, 2019Updated 6 years ago
- 基于深度强化学习DQN的FlappyBird游戏AI开发☆16Aug 12, 2019Updated 6 years ago
- Random Network Distillation(RND) algo in Pytorch☆51Feb 26, 2019Updated 7 years ago
- 人工智能导论课程设计-用强化学习玩FlappyBird☆18Mar 25, 2020Updated 6 years ago
- CartPole-v0 via PPO with GAE, PyTorch☆21Feb 10, 2019Updated 7 years ago
- Course resource for the optimization algorithm course in spring 2021. The course is lectured by prof. Zhouwang Yang.☆16Jan 26, 2022Updated 4 years ago
- Library that provides environments for planning problems☆16Mar 30, 2026Updated last week
- ☆15Dec 12, 2022Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient - 7th ICML AutoML workshop 2020☆33Jul 22, 2021Updated 4 years ago
- ☆18Nov 28, 2017Updated 8 years ago
- ☆20Mar 15, 2017Updated 9 years ago
- Simple lua-python parser☆11Jan 19, 2018Updated 8 years ago
- PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2R…☆24Apr 7, 2021Updated 5 years ago
- There are five robots in a consensus formation. They can communicate with each other by a communication topology and correct their positi…☆26May 28, 2023Updated 2 years ago
- Minimalist Operating System designed to implement as much functionality as possible with a budget of 1000 Lines of Code☆12Sep 28, 2016Updated 9 years ago
- grid world reinforcement learning for tensorflow js☆20Jun 18, 2018Updated 7 years ago
- Reinforcement Learning-based Mobile Robot Navigation☆24Oct 31, 2017Updated 8 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆26Mar 29, 2025Updated last year
- CTC beam search☆12Oct 26, 2016Updated 9 years ago
- Matlab/Octave implementation of Reinforcement learning (Q learning algorithm).☆24May 8, 2019Updated 6 years ago
- LangChain, Llama2-Chat, and zero- and few-shot prompting are used to generate synthetic datasets for IR and RAG system evaluation☆40Dec 3, 2023Updated 2 years ago
- A SFSpeechRecognizer-based voice recordings transcriber for macOS☆26Oct 31, 2022Updated 3 years ago
- 利用强化学习方法 DQN 生成基于机器学习的恶意流量检测模型☆29Oct 27, 2021Updated 4 years ago
- Automatic Gap-Fill Question Generation☆18May 30, 2024Updated last year