lufficc/dqn

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/lufficc/dqn)

lufficc / dqn

Implementation of q-learning using TensorFlow

☆58

Alternatives and similar repositories for dqn

Users that are interested in dqn are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

aggresss / GPUDemo
View on GitHub
《显卡就是开发板》所提到的文档，代码和程序
☆19Apr 10, 2018Updated 8 years ago
ChenChengKuan / awesome_deep_language_style_transfer
View on GitHub
collections of language style transfer papers
☆10Jan 4, 2018Updated 8 years ago
Rostlab / relna
View on GitHub
Biomedical Relation Extraction for Transcription Factor and Gene / Gene Products (part of a Master Thesis at Rostlab, TUM)
☆12Dec 23, 2017Updated 8 years ago
zergylord / ClockworkRNN
View on GitHub
Reimplementation of the clockwork recurrent neural network in Torch7
☆14Feb 4, 2016Updated 10 years ago
waylensu / DeepFFM
View on GitHub
☆27Jun 26, 2017Updated 9 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
davidBelanger / tf-hypergrad
View on GitHub
simple example of gradient-based hyperparameter optimization using tensorflow
☆19Feb 29, 2016Updated 10 years ago
yangkevin2 / neurips2021-lap3
View on GitHub
☆16Feb 1, 2022Updated 4 years ago
rjagerman / glintlda
View on GitHub
Scalable Distributed LDA implementation for Spark & Glint
☆29Sep 27, 2016Updated 9 years ago
ddfan / swarm_evolve
View on GitHub
Model-Based Stochastic Search for Large Scale Optimization of Multi-Agent UAV Swarms
☆18Jul 30, 2018Updated 7 years ago
ogrisel / keras
View on GitHub
Theano-based Deep Learning library (convnets, recurrent neural networks, and more).
☆12Nov 24, 2019Updated 6 years ago
hongyanz / Stackelberg-GAN
View on GitHub
Codes for Stackelberg GAN
☆15Apr 23, 2019Updated 7 years ago
udibr / VAE
View on GitHub
Example of a Variational-Autoencoder using Theano blocks
☆11Jun 16, 2015Updated 11 years ago
lilac / fun
View on GitHub
The fun programming language
☆17Oct 30, 2024Updated last year
sujiongming / starcraftAI
View on GitHub
多智能体即时策略对抗方法与实践苏炯铭刘鸿福陈少飞项凤涛编著科学出版社 2019.11 随书代码
☆31Nov 17, 2020Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
wulfebw / async_rl
View on GitHub
Python implementation of tabular asynchronous actor critic
☆11May 3, 2016Updated 10 years ago
MingyuanXu / Tree-Invent
View on GitHub
Tree-Invent: A novel molecular generative model constrained with topological tree
☆14Jul 26, 2023Updated 2 years ago
inmo-jang / GRAPE
View on GitHub
The source code for the paper "Anonymous Hedonic Game for Task Allocation in a Large-Scale Multiple Agent System" in T-RO (10.1109/TRO.20…
☆24May 24, 2024Updated 2 years ago
sordonia / hed-dlg
View on GitHub
Hierarchical Encoder Decoder for Dialog Modelling
☆16May 20, 2015Updated 11 years ago
outstandingcandy / dssm
View on GitHub
Deep structured semantic model
☆32May 5, 2016Updated 10 years ago
mozillazg / justping
View on GitHub
找出 ping 值最小的 IP/域名
☆14Feb 28, 2013Updated 13 years ago
njchoma / DGAPN
View on GitHub
This repository implements Distilled Graph Attention Policy Networks (DGAPNs), a curiosity-driven reinforcement learning model to generat…
☆21Jan 21, 2022Updated 4 years ago
Cloudslab / FogBus
View on GitHub
[JSS'19] A Blockchain-based Lightweight Framework for Edge and Fog Computing
☆45Jun 18, 2021Updated 5 years ago
bengioe / condnet
View on GitHub
Implementation of condnets
☆16Apr 21, 2016Updated 10 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
xingdi-eric-yuan / recurrent-net-lstm
View on GitHub
Long Short-Term Memory Recurrent Neural Networks
☆26Jun 11, 2015Updated 11 years ago
JunhongXu / planet-kaggle-pytorch
View on GitHub
☆10Jul 21, 2017Updated 8 years ago
ManikandanThangavelu / scikitcrf_NER
View on GitHub
Python library for custom entity recognition using Sklearn CRF
☆17Aug 1, 2017Updated 8 years ago
GradientTrader / simulator
View on GitHub
☆51Dec 13, 2017Updated 8 years ago
17881055 / vue-sudoku
View on GitHub
vue sudoku （数独）
☆11Jan 5, 2023Updated 3 years ago
igul222 / speech
View on GitHub
generative models for speech
☆20Jul 4, 2016Updated 10 years ago
louissmit / VBNN
View on GitHub
Variational Bayes for NN in Torch7 (http://papers.nips.cc/paper/4329-practical-variational-inference-for-neural-networks.pdf)
☆10Mar 23, 2015Updated 11 years ago
Zeta36 / Asynchronous-Methods-for-Deep-Reinforcement-Learning
View on GitHub
Using a paper from Google DeepMind I've developed a new version of the DQN using threads exploration instead of memory replay as explain …
☆84Mar 4, 2016Updated 10 years ago
li-xin-yi / Project-Wuhu
View on GitHub
芜湖计划-课业相关的闲杂物品（主要为tex文档）存放处
☆12Feb 7, 2016Updated 10 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
morgangiraud / openai-rl
View on GitHub
Benchmark of different RL algorithm
☆13Dec 8, 2022Updated 3 years ago
chenhuang-learn / large-scale-lbfgs
View on GitHub
a large scale lbfgs using a method in nips 2014 paper "Large-scale L-BFGS using MapReduce".
☆13May 30, 2015Updated 11 years ago
UNSWComputing / rUNSWift-2016-release
View on GitHub
UNSW's RoboCup Standard Platform League Team
☆12Jun 18, 2022Updated 4 years ago
trevorcampbell / sda-bnp
View on GitHub
Streaming, Distributed, Asynchronous Bayesian Nonparametric Inference
☆12Nov 2, 2015Updated 10 years ago
mpezeshki / RNN_Experiments
View on GitHub
General experiments on Vanilla RNN and LSTM in Theano.
☆16Aug 23, 2015Updated 10 years ago
0bserver07 / Study-Reinforcement-Learning
View on GitHub
RL study guide — foundations through RLHF, DPO, GRPO, RLVR, agentic RL, and offline RL. Hand-written CS294 notes, 19 lecture drafts, 5 te…
☆159May 15, 2026Updated last month
menpo / conda-opencv
View on GitHub
Conda build scripts for OpenCV 2.x
☆10Jun 16, 2016Updated 10 years ago