Implementation of q-learning using TensorFlow
☆58May 9, 2017Updated 8 years ago
Alternatives and similar repositories for dqn
Users that are interested in dqn are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Notes on Logistic Regression and OWLQN☆26Apr 8, 2017Updated 8 years ago
- We implement MADDPG in a congestion env, and compare with several control groups to highlight the performance of MADDPG☆11Jul 14, 2021Updated 4 years ago
- 通过对于现有开源分布式机器学习工具的整合(主要是基于参数服务器的logistic regression,xgboost,FFM,FM ),打造一个工业级的,可以线上使用的点击率预估流水线☆26Jun 6, 2017Updated 8 years ago
- Reimplementation of the clockwork recurrent neural network in Torch7☆14Feb 4, 2016Updated 10 years ago
- Paper published in the Journal of Investment Management, co-authored with Sanjiv R. Das☆13Oct 4, 2017Updated 8 years ago
- simple example of gradient-based hyperparameter optimization using tensorflow☆19Feb 29, 2016Updated 10 years ago
- ☆16Feb 1, 2022Updated 4 years ago
- Improved Leach is a project in Omnet++ aiming to simulate a version of Leach with an improved Cluster Head (CH) selection scheme.☆13Sep 19, 2018Updated 7 years ago
- A routing algorithm based on QLearning☆17Dec 24, 2020Updated 5 years ago
- ☆11Jan 11, 2022Updated 4 years ago
- A reinforcement learning algorithm for congestion control, together with a realistic Omnet++ network simulation environment☆36Jul 20, 2023Updated 2 years ago
- ESP32S3 AI voice assistant is a voice interaction system based on ESP32S3, implemented with Arduino IDE.☆12Aug 26, 2024Updated last year
- 浙江大学课程攻略共享计划☆12Jul 23, 2021Updated 4 years ago
- Scalable Distributed LDA implementation for Spark & Glint☆29Sep 27, 2016Updated 9 years ago
- Model-Based Stochastic Search for Large Scale Optimization of Multi-Agent UAV Swarms☆18Jul 30, 2018Updated 7 years ago
- Theano-based Deep Learning library (convnets, recurrent neural networks, and more).☆12Nov 24, 2019Updated 6 years ago
- Codes for Stackelberg GAN☆15Apr 23, 2019Updated 6 years ago
- ☆13May 3, 2017Updated 8 years ago
- Example of a Variational-Autoencoder using Theano blocks☆12Jun 16, 2015Updated 10 years ago
- Python implementation of tabular asynchronous actor critic☆11May 3, 2016Updated 9 years ago
- Tree-Invent: A novel molecular generative model constrained with topological tree☆13Jul 26, 2023Updated 2 years ago
- The source code for the paper "Anonymous Hedonic Game for Task Allocation in a Large-Scale Multiple Agent System" in T-RO (10.1109/TRO.20…☆24May 24, 2024Updated last year
- Urban UAV Mobility Model for NS3☆15Aug 2, 2022Updated 3 years ago
- coding examples to Intro to RL☆13Apr 30, 2018Updated 7 years ago
- Deep structured semantic model☆32May 5, 2016Updated 9 years ago
- Keras Implementation of the continuous control with actor-critic, a3c☆13Dec 3, 2017Updated 8 years ago
- tools for alpha research☆23Dec 20, 2017Updated 8 years ago
- [JSS'19] A Blockchain-based Lightweight Framework for Edge and Fog Computing☆45Jun 18, 2021Updated 4 years ago
- Implementation of condnets☆16Apr 21, 2016Updated 9 years ago
- Long Short-Term Memory Recurrent Neural Networks☆26Jun 11, 2015Updated 10 years ago
- record and share my reading everyday☆12Apr 1, 2016Updated 9 years ago
- Full Chainer implementation of OpenAI's Reinforcement Learning using Random Network Distillation☆32Apr 15, 2019Updated 6 years ago
- Next word prediction based on N-gram language model☆12Jan 11, 2015Updated 11 years ago
- Ubuntu 一键装机☆11May 4, 2017Updated 8 years ago
- Created during a nodecopter hackday in Brighton. The example script attempts to position an ar-drone so that it centers on any face detec…☆41Oct 9, 2017Updated 8 years ago
- Simulation of LEACH routing protocol on OMNET++☆23Apr 16, 2023Updated 2 years ago
- Variational Bayes for NN in Torch7 (http://papers.nips.cc/paper/4329-practical-variational-inference-for-neural-networks.pdf)☆10Mar 23, 2015Updated 11 years ago
- Simplest Version of playing Atari with Deep Q Learning in Tensorflow☆158Oct 19, 2017Updated 8 years ago
- Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Juli…☆13Dec 30, 2016Updated 9 years ago