This repo gives an example of using a simple method of reinforcement learning to beat the Lunar Lander environment. The agent uses a combination of CEM and neural networks using the pytorch library.
☆18Jul 27, 2018Updated 7 years ago
Alternatives and similar repositories for Landing-A-Rocket-With-Simple-Reinforcement-Learning
Users that are interested in Landing-A-Rocket-With-Simple-Reinforcement-Learning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Workshop materials for the workshop "Computer Science Crash Course for Python Hackers" at PyBay 2017☆17Aug 10, 2017Updated 8 years ago
- A small test for multithreaded C++ stack unwinding on unixes☆16Feb 24, 2020Updated 6 years ago
- A gym environment for Stuart Armstrong's model of a treacherous turn.☆18Jul 28, 2018Updated 7 years ago
- Deep Q-Network (DQN) to play classic Atari Games☆11Sep 18, 2017Updated 8 years ago
- Code for "Spinning Up a Pong AI With Deep RL" on FloydHub.☆55Dec 6, 2018Updated 7 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Common Lisp Implementation of NeuroEvolution of Augmenting Topologies (NEAT)☆13Dec 7, 2016Updated 9 years ago
- A research project on isomorphisms of finite fields☆16Jun 15, 2018Updated 7 years ago
- Fast MRI reconstruction on CUDA GPUs☆10Dec 30, 2023Updated 2 years ago
- Commandline utility for OSX that reloads the frontmost browser tab☆11Jan 18, 2016Updated 10 years ago
- A lightweight interactive data visualization library☆14Apr 25, 2019Updated 6 years ago
- A simple APL neural network.☆11May 11, 2016Updated 9 years ago
- A minimal regression library for Julia☆12Apr 24, 2018Updated 7 years ago
- Farcaster-feed is a Farcaster protocol syndication tool for Node.js☆15Sep 28, 2022Updated 3 years ago
- ResearchDoom fork of the Chocolate Doom engine.☆16Oct 20, 2017Updated 8 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Create realtime apps on top of GitHub☆12Dec 25, 2017Updated 8 years ago
- A copy of the verifiable computation projects from Microsoft Research, Pinnochio and Gepetto☆18Sep 17, 2019Updated 6 years ago
- JIT compiler of LPeg patterns☆18Oct 4, 2015Updated 10 years ago
- Nearly generic prime field implementation in Go☆24Feb 11, 2020Updated 6 years ago
- We implement MADDPG in a congestion env, and compare with several control groups to highlight the performance of MADDPG☆11Jul 14, 2021Updated 4 years ago
- 🎨📊 5,623 Street art photos of 15 artists☆16Sep 10, 2018Updated 7 years ago
- React Amplitude Analytics☆11Nov 5, 2018Updated 7 years ago
- Computer Vision Models☆12Mar 1, 2023Updated 3 years ago
- Repo for machine learning research paper summaries☆10Jun 26, 2018Updated 7 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Shared autonomy via deep reinforcement learning☆80Mar 24, 2023Updated 3 years ago
- Find strongest response of convolutional layers on an image dataset. Automatically compute receptive field for any CNN layer.☆14Feb 19, 2021Updated 5 years ago
- My computational narrative notebooks.☆10Aug 13, 2018Updated 7 years ago
- Finally a free iMessage scheduler that just works.☆16Dec 6, 2019Updated 6 years ago
- Routing with reinforcement learning☆10Apr 9, 2022Updated 4 years ago
- A (mildly) optimizing brainf*ck compiler implemented as Nim macros☆26Apr 9, 2026Updated last week
- Homework 3 for Berkeley CS 280: our version of the MIT Mini Places challenge☆12Mar 5, 2016Updated 10 years ago
- Convert MUSE from TensorFlow to PyTorch and ONNX☆11May 22, 2024Updated last year
- Face generation from a given extremely low resolution images using DC_GAN.☆12May 15, 2017Updated 8 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆13Nov 17, 2015Updated 10 years ago
- This project is concerned with my participating in the RuNNE competition https://github.com/dialogue-evaluation/RuNNE☆13Jun 28, 2023Updated 2 years ago
- 28th place solution to Kaggle Santander Competition 2019☆18Apr 19, 2019Updated 7 years ago
- my configuration files☆30Mar 14, 2026Updated last month
- Randomized Linear Algebra in Python☆13Mar 21, 2017Updated 9 years ago
- Interface definitions for the Compute@Edge platform in witx.☆15Feb 11, 2022Updated 4 years ago
- This is the code-base that I personally use as the starting point for any reinforcement learning codebase with the purpose of fast experi…☆13Jan 4, 2023Updated 3 years ago