Implementation of TD Lambda algorithm, with neural network for value estimation
☆20Apr 16, 2018Updated 8 years ago
Alternatives and similar repositories for Deep-Watkins-Q-and-Actor-Critic
Users that are interested in Deep-Watkins-Q-and-Actor-Critic are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of True Online TD(lambda) with a Fourier Basis function approximator.☆13May 9, 2015Updated 10 years ago
- Adaptive stress testing of black-box systems within POMDPs.jl☆16Feb 6, 2024Updated 2 years ago
- A reinforcement learning package implemented in Torch☆11Jan 24, 2016Updated 10 years ago
- ☆11Jul 25, 2021Updated 4 years ago
- This repo contains a PyTorch implementation of a CNN model for multi-label Image classification model deployed on heroku.☆14Feb 28, 2021Updated 5 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆12Mar 17, 2024Updated 2 years ago
- Dreamer on JAX☆16Jan 19, 2022Updated 4 years ago
- my pdf files☆12Aug 7, 2019Updated 6 years ago
- Code for optimal execution☆12Oct 29, 2020Updated 5 years ago
- Implement the model of Halperin and Feldshteyn for DJIA and SP500☆10Apr 4, 2019Updated 7 years ago
- Playground for motion planning and controls algorithms.☆15Aug 15, 2018Updated 7 years ago
- Use Logitech G27 steering wheel to remote control openxc-vehicle-simulator☆13Feb 18, 2014Updated 12 years ago
- SEC Form 13f Securities datasets☆13Apr 19, 2019Updated 7 years ago
- A collection of meta-learning algorithms in Jax☆25Sep 3, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- fork of gitlab.com/bpaassen/five_clique. Solution to Matt Parker's 5-clique problem☆10Aug 4, 2022Updated 3 years ago
- An PPO - LSTM based RL agent to solve the classic word game - Hangman☆15Nov 20, 2024Updated last year
- Object-Aware Guidance for Autonomous Scene Reconstruction☆17Aug 19, 2018Updated 7 years ago
- Code from the paper An actor-critic algorithm with policy gradients to solve the job shop scheduling problem using deep double recurrent …☆14Mar 20, 2024Updated 2 years ago
- LSTM based neural network that predicts the state of the vehicle in terms of position and velocity.☆14May 7, 2021Updated 4 years ago
- Code for experiments done for EMNLP2020.☆11Dec 8, 2022Updated 3 years ago
- Hardware design files for BLDC servo controller☆14Sep 24, 2022Updated 3 years ago
- ☆16Oct 6, 2019Updated 6 years ago
- WOS(web of science)网站文献爬取工具☆19Sep 7, 2018Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- $GIT_REV in your dokku env☆15Jun 28, 2018Updated 7 years ago
- This is the code for "DeepMind Reinforcement Learning" By Siraj Raval on Youtube☆84Sep 5, 2018Updated 7 years ago
- 📻「我的收藏电台」写了个播放界面☆22Nov 8, 2022Updated 3 years ago
- Like word2vec, except for letters of the alphabet.☆17May 29, 2017Updated 8 years ago
- Documentation website for teleport generators.☆14Feb 3, 2023Updated 3 years ago
- A genetic algorithm that learns to play the game Qwixx☆14Mar 1, 2024Updated 2 years ago
- Replicated the Alpha Go Zero paper but applied it to the game Santorini.☆13Jan 27, 2018Updated 8 years ago
- GPU Programming with Python and CUDA.☆26Updated this week
- Simple AI Agent Trained to play Hangman☆13Sep 27, 2019Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Indeed web crawler☆11Aug 14, 2018Updated 7 years ago
- Reinforcement Learning via Latent State Decoding☆29Jun 12, 2023Updated 2 years ago
- Link to paper: https://www.ssrn.com/abstract=3804655☆13Jul 27, 2021Updated 4 years ago
- ☆17Sep 10, 2017Updated 8 years ago
- Tools to make git easier to use and to avoid the learning curve☆20Apr 3, 2019Updated 7 years ago
- ROS Driver for Pointgrey Ladybug Cameras☆30Dec 19, 2022Updated 3 years ago
- Deep learning models for contextual multi-armed bandit setting☆13May 16, 2021Updated 4 years ago