Implementation of q-learning using TensorFlow
☆58May 9, 2017Updated 9 years ago
Alternatives and similar repositories for dqn
Users that are interested in dqn are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Notes on Logistic Regression and OWLQN☆26Apr 8, 2017Updated 9 years ago
- We implement MADDPG in a congestion env, and compare with several control groups to highlight the performance of MADDPG☆11Jul 14, 2021Updated 4 years ago
- collections of language style transfer papers☆10Jan 4, 2018Updated 8 years ago
- Biomedical Relation Extraction for Transcription Factor and Gene / Gene Products (part of a Master Thesis at Rostlab, TUM)☆12Dec 23, 2017Updated 8 years ago
- Reimplementation of the clockwork recurrent neural network in Torch7☆14Feb 4, 2016Updated 10 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Please read README file.☆12Mar 17, 2023Updated 3 years ago
- Paper published in the Journal of Investment Management, co-authored with Sanjiv R. Das☆13Oct 4, 2017Updated 8 years ago
- simple example of gradient-based hyperparameter optimization using tensorflow☆19Feb 29, 2016Updated 10 years ago
- ☆16Feb 1, 2022Updated 4 years ago
- Breezedeus's Blog☆17Jul 4, 2023Updated 2 years ago
- LSTM Python library using Cython☆39Dec 19, 2014Updated 11 years ago
- A reinforcement learning algorithm for congestion control, together with a realistic Omnet++ network simulation environment☆36Jul 20, 2023Updated 2 years ago
- An attempt at implementing ideas in "Learning to Transduce with Unbounded Memory" (http://arxiv.org/abs/1506.02516)☆11Jul 27, 2016Updated 9 years ago
- LoRa AODV Routing Protocol implementation modifying FLoRa framework. It works on Omnet++☆12Mar 26, 2020Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆15Dec 6, 2017Updated 8 years ago
- Model-Based Stochastic Search for Large Scale Optimization of Multi-Agent UAV Swarms☆18Jul 30, 2018Updated 7 years ago
- Modified AODV Based on Clustering and Gateway for Wireless Sensor Network☆17Mar 8, 2019Updated 7 years ago
- Using reinforcement learning to make markets in the high frequency trading setting.☆29Apr 8, 2025Updated last year
- Example of a Variational-Autoencoder using Theano blocks☆11Jun 16, 2015Updated 10 years ago
- Python implementation of tabular asynchronous actor critic☆11May 3, 2016Updated 10 years ago
- Tree-Invent: A novel molecular generative model constrained with topological tree☆14Jul 26, 2023Updated 2 years ago
- Urban UAV Mobility Model for NS3☆15Aug 2, 2022Updated 3 years ago
- Hierarchical Encoder Decoder for Dialog Modelling☆16May 20, 2015Updated 11 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- coding examples to Intro to RL☆13Apr 30, 2018Updated 8 years ago
- 找出 ping 值最小的 IP/域名☆14Feb 28, 2013Updated 13 years ago
- This repository implements Distilled Graph Attention Policy Networks (DGAPNs), a curiosity-driven reinforcement learning model to generat…☆21Jan 21, 2022Updated 4 years ago
- Improved leach protocol for routing in WSN☆19May 20, 2020Updated 6 years ago
- Routing Protocol for Low power and Lossy Networks (RPL) simulation model☆19Oct 2, 2023Updated 2 years ago
- Long Short-Term Memory Recurrent Neural Networks☆26Jun 11, 2015Updated 10 years ago
- record and share my reading everyday☆12Apr 1, 2016Updated 10 years ago
- ☆10Jul 21, 2017Updated 8 years ago
- Full Chainer implementation of OpenAI's Reinforcement Learning using Random Network Distillation☆32Apr 15, 2019Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Ubuntu 一键装机☆11May 4, 2017Updated 9 years ago
- Simulation of LEACH routing protocol on OMNET++☆23Apr 16, 2023Updated 3 years ago
- generative models for speech☆20Jul 4, 2016Updated 9 years ago
- Backprop training of recurrent neural networks with Hebbian plastic connections☆20Jun 30, 2021Updated 4 years ago
- Simplest Version of playing Atari with Deep Q Learning in Tensorflow☆155Oct 19, 2017Updated 8 years ago
- Using a paper from Google DeepMind I've developed a new version of the DQN using threads exploration instead of memory replay as explain …☆84Mar 4, 2016Updated 10 years ago
- OMNeT++ project about routing protocols under wireless mesh networks. Presents main brand new ACO based routing protocol AntWMNet, and A…☆23Jun 4, 2015Updated 10 years ago