Implementation of Scheduled Policy Optimization for task-oriented language grouding
☆29Jul 16, 2018Updated 7 years ago
Alternatives and similar repositories for walk_the_blocks
Users that are interested in walk_the_blocks are comparing it to the libraries listed below
Sorting:
- Blocks World -- Simulator, Code, and Models (Misra et al. EMNLP 2017)☆40Feb 7, 2019Updated 7 years ago
- Tree-LSTM + Self-Structured Attention -- a method to summarize textual data by topics☆10Apr 26, 2018Updated 7 years ago
- ☆10Sep 29, 2017Updated 8 years ago
- Separating value functions across time-scales.☆17May 13, 2019Updated 6 years ago
- Deep Reinforcement Learning for Multi Agent Soccer☆16Dec 15, 2016Updated 9 years ago
- RC-NFQ: Regularized Convolutional Neural Fitted Q Iteration. A batch algorithm for deep reinforcement learning. Incorporates dropout regu…☆12Mar 17, 2021Updated 4 years ago
- RL framework for embodied agents based on PyTorch☆11Apr 11, 2019Updated 6 years ago
- Official implementation for the paper: "Shallow Updates for Deep Reinforcement Learning"☆18Nov 2, 2017Updated 8 years ago
- Professor Forcing, NIPS'16☆45Apr 3, 2017Updated 8 years ago
- Round 1 Starter Kit for the MarLo challenge☆21Sep 27, 2018Updated 7 years ago
- Code for Emergent Translation in Multi-Agent Communication☆81Jun 6, 2018Updated 7 years ago
- Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning☆81Nov 22, 2017Updated 8 years ago
- A2C for GVG-AI☆23Nov 7, 2018Updated 7 years ago
- Repository containing supplementary materials and code for "JuMP: A Modeling Language for Mathematical Optimization"☆27Mar 1, 2016Updated 10 years ago
- FLUIDS is a lightweight driving simulator for benchmarking Deep Reinforcement and Imitation learning algorithms.☆24May 3, 2019Updated 6 years ago
- An aspiring attempt to generate a continuous space of sentences with DenseNet☆26May 4, 2017Updated 8 years ago
- ☆29Jan 25, 2018Updated 8 years ago
- in progress☆61Feb 19, 2016Updated 10 years ago
- Robust policy search algorithms which train on model ensembles☆30Oct 26, 2016Updated 9 years ago
- Atari gauntlet for RL agents☆29Mar 18, 2017Updated 8 years ago
- Reinforcement learning models in ViZDoom environment☆130Mar 9, 2022Updated 4 years ago
- ☆32Apr 22, 2019Updated 6 years ago
- A latent variable RNN model for discourse-driven language modeling☆36Jun 22, 2016Updated 9 years ago
- Implementation of Multi-Agent Deep Deterministic Policy Gradients☆39Mar 28, 2018Updated 7 years ago
- reinforcement learning. policy gradient. PCL☆37Apr 25, 2017Updated 8 years ago
- ☆11Jan 28, 2019Updated 7 years ago
- ☆35Apr 10, 2018Updated 7 years ago
- Stochastic Markov Games☆12Oct 5, 2017Updated 8 years ago
- ☆29Mar 24, 2018Updated 7 years ago
- ☆44Dec 4, 2018Updated 7 years ago
- Distributed A3C☆34Dec 22, 2017Updated 8 years ago
- a port of the Wavenet algorithm to generate poems (using Samuel Graván's @Zeta36 code).☆36May 30, 2017Updated 8 years ago
- Generating Text through Adversarial Training(GAN) using Skip-Thought Vectors☆45Oct 30, 2021Updated 4 years ago
- Attention is All You Need in Sonnet☆38Aug 30, 2017Updated 8 years ago
- Real valued neural networks (RVNN) and complex valued neural networks (CVNN) (Akira Hirose, 2012).☆11Jul 17, 2017Updated 8 years ago
- 基于电商导购机器人,自然语言理解(NLU),文本纠错,歧义词消歧☆12May 5, 2020Updated 5 years ago
- ☆10Jul 9, 2020Updated 5 years ago
- A batch (multiple concurrent sequence pairs) implementation of Dynamic Time Warping (DTW) in Theano☆10Sep 13, 2015Updated 10 years ago
- 新生必读 A collection of classical literatures for newbies (students of Prof. Gao)☆10Mar 5, 2018Updated 8 years ago