Implementation of TD Lambda algorithm, with neural network for value estimation
☆20Apr 16, 2018Updated 7 years ago
Alternatives and similar repositories for Deep-Watkins-Q-and-Actor-Critic
Users that are interested in Deep-Watkins-Q-and-Actor-Critic are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Markov Chain Monte Carlo (MCMC) and importance sampling in the context of Bayesian linear regression☆11Feb 25, 2018Updated 8 years ago
- Adaptive stress testing of black-box systems within POMDPs.jl☆16Feb 6, 2024Updated 2 years ago
- A reinforcement learning package implemented in Torch☆11Jan 24, 2016Updated 10 years ago
- Leveraging Recursive Gumbel-Max Trick for Approximate Inference in Combinatorial Spaces, NeurIPS 2021☆14Dec 11, 2021Updated 4 years ago
- ☆11Jul 25, 2021Updated 4 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆11Feb 13, 2021Updated 5 years ago
- JAX implementation of the T5 model: Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer☆24Jun 10, 2023Updated 2 years ago
- 关于Fault-Tolerant Federated Reinforcement Learning with Theoretical Guarantee这篇论文的详细代码解读☆11Dec 27, 2023Updated 2 years ago
- Dreamer on JAX☆16Jan 19, 2022Updated 4 years ago
- Code for optimal execution☆12Oct 29, 2020Updated 5 years ago
- Playground for motion planning and controls algorithms.☆15Aug 15, 2018Updated 7 years ago
- ☆18Dec 11, 2015Updated 10 years ago
- A classifier for cat and dog images (Response to Siraj's challenge of the week)☆38Feb 25, 2017Updated 9 years ago
- ☆12Jul 21, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- A collection of meta-learning algorithms in Jax☆24Sep 3, 2022Updated 3 years ago
- A Bachelor Thesis implementation of RRT, RRT* and Informed RRT*.☆14Jul 9, 2018Updated 7 years ago
- NLP Resources for Indian Languages☆10Nov 9, 2020Updated 5 years ago
- Energy-based Surprise Minimization for Multi-Agent Value Factorization☆12Oct 20, 2023Updated 2 years ago
- Code from the paper An actor-critic algorithm with policy gradients to solve the job shop scheduling problem using deep double recurrent …☆14Mar 20, 2024Updated 2 years ago
- LSTM based neural network that predicts the state of the vehicle in terms of position and velocity.☆14May 7, 2021Updated 4 years ago
- Simulation of RRT* algorithms with and without Dubins Nonholonomic Robot steering.☆68Nov 15, 2017Updated 8 years ago
- A simulation, planning and control toolbox for planar manipulation (e.g., pushing and grasping).☆25Jul 8, 2017Updated 8 years ago
- Code for experiments done for EMNLP2020.☆11Dec 8, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Hardware design files for BLDC servo controller☆13Sep 24, 2022Updated 3 years ago
- Python API and analysis of Chicago's bikeshare☆10Dec 8, 2022Updated 3 years ago
- ☆16Oct 6, 2019Updated 6 years ago
- WOS(web of science)网站文献爬取工具☆19Sep 7, 2018Updated 7 years ago
- $GIT_REV in your dokku env☆15Jun 28, 2018Updated 7 years ago
- This is the code for "DeepMind Reinforcement Learning" By Siraj Raval on Youtube☆85Sep 5, 2018Updated 7 years ago
- Like word2vec, except for letters of the alphabet.☆17May 29, 2017Updated 8 years ago
- A genetic algorithm that learns to play the game Qwixx☆14Mar 1, 2024Updated 2 years ago
- Replicated the Alpha Go Zero paper but applied it to the game Santorini.☆13Jan 27, 2018Updated 8 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Graph Convolutional Networks in JAX☆33Jan 3, 2021Updated 5 years ago
- Simple AI Agent Trained to play Hangman☆13Sep 27, 2019Updated 6 years ago
- ☆20Mar 28, 2023Updated 3 years ago
- Implementation of RRT, RRT-connect, RRT*, and PRM in c++☆13Oct 25, 2017Updated 8 years ago
- Indeed web crawler☆11Aug 14, 2018Updated 7 years ago
- A Telegram bot for the BoardGameGeek☆13Nov 30, 2025Updated 4 months ago
- Reinforcement Learning via Latent State Decoding☆29Jun 12, 2023Updated 2 years ago