☆47Aug 29, 2019Updated 6 years ago
Alternatives and similar repositories for Reinforcement-Learning-Lunar_Lander
Users that are interested in Reinforcement-Learning-Lunar_Lander are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Deep Q-learning approach to OpenAI Gym's Lunar Lander☆15Jul 27, 2017Updated 8 years ago
- Adaptation of DQN, DDQN and COMA for multi-agent Gym environments☆10Oct 3, 2023Updated 2 years ago
- Training an LSTM network on the Penn Tree Bank (PTB) dataset☆11Nov 5, 2018Updated 7 years ago
- ☆16Jan 16, 2025Updated last year
- Repository for the Udacity Deep Reinforcement Learning Nanodegree☆12Jul 9, 2019Updated 6 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Optimal placement of edge servers using K-means Clustering and Power allocation using Particle Swarm Optimization☆13Nov 22, 2021Updated 4 years ago
- A multi-agent version of the Double DQN algorithm, with Foraging Task and Pursuit Game test scenarios☆12Apr 24, 2017Updated 8 years ago
- ☆11Oct 26, 2022Updated 3 years ago
- Source code for our paper "BLOB: a probabilistic model for recommendation that combines organic and bandit signals" published at KDD 2020…☆16Mar 24, 2023Updated 3 years ago
- This repo gives an example of using a simple method of reinforcement learning to beat the Lunar Lander environment. The agent uses a comb…☆18Jul 27, 2018Updated 7 years ago
- Use DeepMIMO dataset to generate samples for wireless power allocation☆11Feb 3, 2021Updated 5 years ago
- Learning Continuous Control in Deep Reinforcement Learning☆14Nov 24, 2018Updated 7 years ago
- Experiments of the three PPO-Algorithms (PPO, clipped PPO, PPO with KL-penalty) proposed by John Schulman et al. on the 'Cartpole-v1' env…☆13Nov 14, 2021Updated 4 years ago
- Implementation of reinforcement learning algorithms for the OpenAI Gym environment LunarLander-v2☆24Mar 8, 2021Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Using Gradio interface to build UI for converting text to speech☆13Jan 26, 2021Updated 5 years ago
- ☆11Apr 21, 2023Updated 2 years ago
- 基于深度强化学习DQN的FlappyBird游戏AI开发☆16Aug 12, 2019Updated 6 years ago
- This repo contains PPO implementation in PyTorch for LunarLander-v2☆11Jun 26, 2020Updated 5 years ago
- OpenAI LunarLander-v2 DeepRL-based solutions (DQN, DuelingDQN, D3QN)☆43Aug 11, 2021Updated 4 years ago
- IERG 6130 Reinforcement Learning☆10Apr 29, 2019Updated 6 years ago
- Keras Implementation of TD3(Twin Delayed DDPG) with PER(Prioritized Experience Replay) option on OpenAI gym framework☆11May 29, 2021Updated 4 years ago
- 人工智能导论课程设计-用强化学习玩FlappyBird☆18Mar 25, 2020Updated 6 years ago
- Flappy Bird as a Farama Gymnasium environment.☆37Aug 1, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆12Mar 3, 2025Updated last year
- Open AI Gym - Pendulum-v1 reinforcement learning (DQN, SAC)☆21Jan 26, 2024Updated 2 years ago
- ☆18Jul 20, 2023Updated 2 years ago
- Collection of reinforcement learning algorithms implementations with TensorFlow2☆14Sep 28, 2024Updated last year
- ☆14May 18, 2018Updated 7 years ago
- The continuous mountain car problem solved with DDPG☆13Apr 19, 2020Updated 6 years ago
- Mobility-aware Dynamic Joint Power Control and Resource Allocation for D2D underlaying cellular networks☆14Sep 6, 2020Updated 5 years ago
- An improved version of EOI on Starcraft II task so_many_baneling. (The Emergence of Individuality)☆17Oct 28, 2021Updated 4 years ago
- Python Implementation of STreeD: Dynamic Programming Approach for Optimal Decision Trees with Separable objectives and Constraints☆20Mar 23, 2026Updated 3 weeks ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Blade Element Momentum Theory Function for MATLAB☆12Aug 2, 2018Updated 7 years ago
- Sensitivity Analysis library for Matlab☆17Feb 23, 2026Updated last month
- Robust and safe deep reinforcement learning algorithms☆17Mar 27, 2024Updated 2 years ago
- Propeller design tool. Minimum Induced Loss propellers: Larrabee's method☆17Apr 22, 2023Updated 2 years ago
- Code to run the ASEBO algorithm from the paper: From Complexity to Simplicity: Adaptive ES-Active Subspaces for Blackbox Optimization... …☆16Oct 14, 2020Updated 5 years ago
- Gridworld domains in the gym interface☆29Oct 2, 2024Updated last year
- Code Repository for "Neural Networks for Efficient Bayesian Decoding of Natural Images from Retinal Neurons" by Nikhil Parthasarathy, Ele…☆10May 14, 2018Updated 7 years ago