Some code for tutorials following https://gym.openai.com/docs/rl
☆14Jul 3, 2016Updated 9 years ago
Alternatives and similar repositories for deep-rl-gym-tutorials
Users that are interested in deep-rl-gym-tutorials are comparing it to the libraries listed below
Sorting:
- Code for "So similar and yet incompatible: Toward the automated identification of semantically compatible words" in NAACL 2015 proceedi…☆11May 11, 2015Updated 10 years ago
- My Udacity Machine Learning Nanodegree capstone project in Reinforcement Learning☆10Dec 1, 2017Updated 8 years ago
- Implementation of QA Networks☆10Jul 14, 2016Updated 9 years ago
- My notes on reinforcement learning papers☆15Jun 14, 2018Updated 7 years ago
- Model-Free Episodic Control☆14Jan 12, 2017Updated 9 years ago
- Behavioral Cloning project that teaches a car to drive autonomously using Deep Learning with Keras☆12Dec 18, 2016Updated 9 years ago
- Series Algorithms of Deep Reinforcement Learning, such as DQN, DDQN, one-step-DQN, DDPG, etc☆43Sep 27, 2016Updated 9 years ago
- Python package to sample from determinantal point processes☆18Jul 20, 2015Updated 10 years ago
- (Keras) Use deep Q-learning to build two Gomoku (Five-in-a-Row) agents playing against each other.☆19Oct 8, 2016Updated 9 years ago
- Exploratory topic modeling with distributional semantics and interactive visualization☆18Jan 11, 2017Updated 9 years ago
- An evolutionary algorithm-based optimization for tracking weights in the OpenSim Residual Reduction Algorithm (RRA).☆11Jul 17, 2023Updated 2 years ago
- datasets for NLP research☆24Nov 6, 2021Updated 4 years ago
- Determinantal point process☆18Sep 16, 2015Updated 10 years ago
- ☆28Oct 9, 2017Updated 8 years ago
- Mujoco Models for the Fetch Robot☆32Feb 9, 2025Updated last year
- Kaggle "Facial Keypoints Detection" competition.☆25Jan 9, 2017Updated 9 years ago
- Code for Multi-Agent Common Knowledge Reinforcement Learning (NeurIPS 2019)☆33Dec 1, 2019Updated 6 years ago
- Inferring beliefs about dynamics from behavior☆30May 24, 2018Updated 7 years ago
- Robust policy search algorithms which train on model ensembles☆30Oct 26, 2016Updated 9 years ago
- Recurrent Convolutional Memory Network (in progress)☆29Apr 16, 2016Updated 9 years ago
- Provides Movie Recommendations on the MovieLens ml-100k dataset using Collaborative Filtering☆11Nov 14, 2013Updated 12 years ago
- workspace comprising demo packages for our roscon2018 talk☆10Dec 21, 2019Updated 6 years ago
- Collaborative Deep Reinforcement Learning☆32Jul 29, 2017Updated 8 years ago
- Repository for Manning Twitch session about building and deploying APIs with Python☆12Jul 19, 2021Updated 4 years ago
- Python server for NAO Communication project☆11Aug 22, 2018Updated 7 years ago
- C++ library to work with Iso8583 messages☆10Sep 22, 2018Updated 7 years ago
- Stochastic Markov Games☆12Oct 5, 2017Updated 8 years ago
- Published by Packt☆11Jan 18, 2021Updated 5 years ago
- reinforcement learning. policy gradient. PCL☆37Apr 25, 2017Updated 8 years ago
- This is the implementation of paper Model Free Episodic Control☆36Sep 30, 2019Updated 6 years ago
- TensorFlow implementation of the DDPG algorithm from the paper Continuous Control with Deep Reinforcement Learning (ICLR 2016)☆215Feb 16, 2018Updated 8 years ago
- An implementation of Deep Reinforcement Learning / Deep Q-Networks for Atari games in TensorFlow☆74Feb 25, 2017Updated 9 years ago
- Work towards creating a common JSON based format for compact network specification☆14Jan 6, 2026Updated last month
- Code for ICML 2022 paper: Achieving Fairness at No Utility Cost via Data Reweighing with Influence☆11Aug 3, 2022Updated 3 years ago
- ☆11Apr 14, 2022Updated 3 years ago
- Code to reproduce all the results in the paper: "Learning dynamics of linear denoising autoencoders." (ICML 2018)☆11Aug 20, 2018Updated 7 years ago
- Simple Flask webservice to search through your PDF collection using Whoosh☆11Jul 11, 2014Updated 11 years ago
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆13Nov 11, 2024Updated last year
- ResearchDoom fork of the Chocolate Doom engine.☆16Oct 20, 2017Updated 8 years ago