Implementation of TD Lambda algorithm, with neural network for value estimation
☆20Apr 16, 2018Updated 7 years ago
Alternatives and similar repositories for Deep-Watkins-Q-and-Actor-Critic
Users that are interested in Deep-Watkins-Q-and-Actor-Critic are comparing it to the libraries listed below
Sorting:
- Implementation of True Online TD(lambda) with a Fourier Basis function approximator.☆13May 9, 2015Updated 10 years ago
- A reinforcement learning package implemented in Torch☆11Jan 24, 2016Updated 10 years ago
- ☆31Oct 24, 2023Updated 2 years ago
- A neural network with 3 layers made with just numpy as dependency☆26Jun 5, 2017Updated 8 years ago
- Leveraging Recursive Gumbel-Max Trick for Approximate Inference in Combinatorial Spaces, NeurIPS 2021☆14Dec 11, 2021Updated 4 years ago
- ☆10Aug 15, 2016Updated 9 years ago
- ☆11Jul 25, 2021Updated 4 years ago
- This repo contains a PyTorch implementation of a CNN model for multi-label Image classification model deployed on heroku.☆14Feb 28, 2021Updated 5 years ago
- my pdf files☆12Aug 7, 2019Updated 6 years ago
- Code for optimal execution☆12Oct 29, 2020Updated 5 years ago
- Implement the model of Halperin and Feldshteyn for DJIA and SP500☆10Apr 4, 2019Updated 6 years ago
- Playground for motion planning and controls algorithms.☆15Aug 15, 2018Updated 7 years ago
- ☆18Dec 11, 2015Updated 10 years ago
- SEC Form 13f Securities datasets☆12Apr 19, 2019Updated 6 years ago
- tweaks to Spyder IDE to include Solarized color scheme☆24Apr 15, 2020Updated 5 years ago
- RAD: Reinforcement Learning with Augmented Data (code for procgen experiments)☆18Mar 29, 2021Updated 4 years ago
- This repository provides several functions to generate and process race track maps containing specific local information, which is furthe…☆13Dec 2, 2021Updated 4 years ago
- MPC package for solving optimal control problems☆19Jun 11, 2025Updated 9 months ago
- Implementation of Denoising Diffusion Probabilistic Models (DDPM) in JAX and Flax.☆22Oct 12, 2023Updated 2 years ago
- An PPO - LSTM based RL agent to solve the classic word game - Hangman☆15Nov 20, 2024Updated last year
- BankHoldingCompanyData☆13Mar 11, 2026Updated last week
- Object-Aware Guidance for Autonomous Scene Reconstruction☆17Aug 19, 2018Updated 7 years ago
- Code from the paper An actor-critic algorithm with policy gradients to solve the job shop scheduling problem using deep double recurrent …☆14Mar 20, 2024Updated 2 years ago
- MXNet Implementation of DeepMind's Neural Arithmetic Logic Units (NALU)☆18Aug 10, 2018Updated 7 years ago
- Simulation of RRT* algorithms with and without Dubins Nonholonomic Robot steering.☆68Nov 15, 2017Updated 8 years ago
- Hardware design files for BLDC servo controller☆14Sep 24, 2022Updated 3 years ago
- Python API and analysis of Chicago's bikeshare☆10Dec 8, 2022Updated 3 years ago
- ☆16Oct 6, 2019Updated 6 years ago
- WOS(web of science)网站文献爬取工具☆19Sep 7, 2018Updated 7 years ago
- Noisy network measurement with stan☆56Jan 16, 2026Updated 2 months ago
- Autonomously drifting car that learned how to drift in a roundabout using Deep Reinforcement Learning and the Carla Simulator. Performed …☆17Oct 21, 2024Updated last year
- This is the code for "DeepMind Reinforcement Learning" By Siraj Raval on Youtube☆85Sep 5, 2018Updated 7 years ago
- A Higher-order HMM with EM algo.☆16May 4, 2022Updated 3 years ago
- Like word2vec, except for letters of the alphabet.☆17May 29, 2017Updated 8 years ago
- A genetic algorithm that learns to play the game Qwixx☆14Mar 1, 2024Updated 2 years ago
- GPU Programming with Python and CUDA.☆26Updated this week
- ☆20Mar 28, 2023Updated 2 years ago
- Implementation of RRT, RRT-connect, RRT*, and PRM in c++☆13Oct 25, 2017Updated 8 years ago
- A Telegram bot for the BoardGameGeek☆13Nov 30, 2025Updated 3 months ago