Implementation of TD Lambda algorithm, with neural network for value estimation
☆20Apr 16, 2018Updated 8 years ago
Alternatives and similar repositories for Deep-Watkins-Q-and-Actor-Critic
Users that are interested in Deep-Watkins-Q-and-Actor-Critic are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of True Online TD(lambda) with a Fourier Basis function approximator.☆13May 9, 2015Updated 11 years ago
- Adaptive stress testing of black-box systems within POMDPs.jl☆16Feb 6, 2024Updated 2 years ago
- A reinforcement learning package implemented in Torch☆11Jan 24, 2016Updated 10 years ago
- ☆31Oct 24, 2023Updated 2 years ago
- A neural network with 3 layers made with just numpy as dependency☆26Jun 5, 2017Updated 9 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Leveraging Recursive Gumbel-Max Trick for Approximate Inference in Combinatorial Spaces, NeurIPS 2021☆14Dec 11, 2021Updated 4 years ago
- ☆10Aug 15, 2016Updated 9 years ago
- This repo contains a PyTorch implementation of a CNN model for multi-label Image classification model deployed on heroku.☆14Feb 28, 2021Updated 5 years ago
- ☆11Feb 13, 2021Updated 5 years ago
- Python code to automatically produce a summary of a piece of text.☆11Sep 8, 2016Updated 9 years ago
- 关于Fault-Tolerant Federated Reinforcement Learning with Theoretical Guarantee这篇论文的详细代码解读☆11Dec 27, 2023Updated 2 years ago
- Dreamer on JAX☆16Jan 19, 2022Updated 4 years ago
- Code for optimal execution☆12Oct 29, 2020Updated 5 years ago
- Implement the model of Halperin and Feldshteyn for DJIA and SP500☆10Apr 4, 2019Updated 7 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆18Dec 11, 2015Updated 10 years ago
- SEC Form 13f Securities datasets☆14Apr 19, 2019Updated 7 years ago
- A classifier for cat and dog images (Response to Siraj's challenge of the week)☆38Feb 25, 2017Updated 9 years ago
- RAD: Reinforcement Learning with Augmented Data (code for procgen experiments)☆19Mar 29, 2021Updated 5 years ago
- This repository provides several functions to generate and process race track maps containing specific local information, which is furthe…☆13Dec 2, 2021Updated 4 years ago
- MPC package for solving optimal control problems☆19Jun 11, 2025Updated last year
- A collection of meta-learning algorithms in Jax☆24Sep 3, 2022Updated 3 years ago
- An PPO - LSTM based RL agent to solve the classic word game - Hangman☆15Nov 20, 2024Updated last year
- BankHoldingCompanyData☆14Mar 11, 2026Updated 3 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- NLP Resources for Indian Languages☆10Nov 9, 2020Updated 5 years ago
- Code from the paper An actor-critic algorithm with policy gradients to solve the job shop scheduling problem using deep double recurrent …☆14Mar 20, 2024Updated 2 years ago
- Energy-based Surprise Minimization for Multi-Agent Value Factorization☆12Oct 20, 2023Updated 2 years ago
- MXNet Implementation of DeepMind's Neural Arithmetic Logic Units (NALU)☆18Aug 10, 2018Updated 7 years ago
- A safe and efficient autonomous driving algorithm. Winner of the 2019 DriveML Huawei Autonomous Vehicles Challenge. Built using RLLib and…☆18Jan 24, 2020Updated 6 years ago
- Code for experiments done for EMNLP2020.☆11Dec 8, 2022Updated 3 years ago
- A very basic LSTM composer, doesn't compose any proper music for now☆109Aug 27, 2018Updated 7 years ago
- Python API and analysis of Chicago's bikeshare☆10Dec 8, 2022Updated 3 years ago
- Annotated Minecraft dataset for machine learning☆13Nov 13, 2015Updated 10 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Noisy network measurement with stan☆56Jan 16, 2026Updated 5 months ago
- ☆12Jan 7, 2019Updated 7 years ago
- Like word2vec, except for letters of the alphabet.☆17May 29, 2017Updated 9 years ago
- A Higher-order HMM with EM algo.☆16May 4, 2022Updated 4 years ago
- Documentation website for teleport generators.☆14Feb 3, 2023Updated 3 years ago
- Sim2Real Transfer for Deep Reinforcement Learning with Stochastic State Transition Delays, CORL-2020.☆26Jun 3, 2021Updated 5 years ago
- GPU Programming with Python and CUDA.☆26Jun 23, 2026Updated last week