xwhan / walk_the_blocksView external linksLinks
Implementation of Scheduled Policy Optimization for task-oriented language grouding
☆29Jul 16, 2018Updated 7 years ago
Alternatives and similar repositories for walk_the_blocks
Users that are interested in walk_the_blocks are comparing it to the libraries listed below
Sorting:
- Blocks World -- Simulator, Code, and Models (Misra et al. EMNLP 2017)☆40Feb 7, 2019Updated 7 years ago
- Tree-LSTM + Self-Structured Attention -- a method to summarize textual data by topics☆10Apr 26, 2018Updated 7 years ago
- ☆10Sep 29, 2017Updated 8 years ago
- Code for the paper "TD or not TD: Analyzing the Role of Temporal Differencing in Deep Reinforcement Learning", Artemij Amiranashvili, Ale…☆12Aug 24, 2018Updated 7 years ago
- Separating value functions across time-scales.☆17May 13, 2019Updated 6 years ago
- RC-NFQ: Regularized Convolutional Neural Fitted Q Iteration. A batch algorithm for deep reinforcement learning. Incorporates dropout regu…☆12Mar 17, 2021Updated 4 years ago
- Deep Reinforcement Learning for Multi Agent Soccer☆16Dec 15, 2016Updated 9 years ago
- Official implementation for the paper: "Shallow Updates for Deep Reinforcement Learning"☆18Nov 2, 2017Updated 8 years ago
- RL framework for embodied agents based on PyTorch☆11Apr 11, 2019Updated 6 years ago
- Professor Forcing, NIPS'16☆45Apr 3, 2017Updated 8 years ago
- Round 1 Starter Kit for the MarLo challenge☆21Sep 27, 2018Updated 7 years ago
- Code for Emergent Translation in Multi-Agent Communication☆81Jun 6, 2018Updated 7 years ago
- A2C for GVG-AI☆23Nov 7, 2018Updated 7 years ago
- FLUIDS is a lightweight driving simulator for benchmarking Deep Reinforcement and Imitation learning algorithms.☆24May 3, 2019Updated 6 years ago
- An aspiring attempt to generate a continuous space of sentences with DenseNet☆26May 4, 2017Updated 8 years ago
- Repository containing supplementary materials and code for "JuMP: A Modeling Language for Mathematical Optimization"☆27Mar 1, 2016Updated 9 years ago
- in progress☆61Feb 19, 2016Updated 9 years ago
- Robust policy search algorithms which train on model ensembles☆30Oct 26, 2016Updated 9 years ago
- Atari gauntlet for RL agents☆29Mar 18, 2017Updated 8 years ago
- Weakly Supervised Topic Segmentation and Labeling☆33Jan 16, 2022Updated 4 years ago
- Cornell Instruction Following Framework☆34Oct 11, 2021Updated 4 years ago
- A latent variable RNN model for discourse-driven language modeling☆36Jun 22, 2016Updated 9 years ago
- ☆35Apr 10, 2018Updated 7 years ago
- reinforcement learning. policy gradient. PCL☆37Apr 25, 2017Updated 8 years ago
- ☆11Jan 28, 2019Updated 7 years ago
- ☆29Mar 24, 2018Updated 7 years ago
- Stochastic Markov Games☆12Oct 5, 2017Updated 8 years ago
- Dynamic Robot Instruction Following☆39Dec 28, 2021Updated 4 years ago
- ☆44Dec 4, 2018Updated 7 years ago
- Distributed A3C☆34Dec 22, 2017Updated 8 years ago
- a port of the Wavenet algorithm to generate poems (using Samuel Graván's @Zeta36 code).☆36May 30, 2017Updated 8 years ago
- 基于capsule的观点型阅读理解模型☆88Aug 8, 2019Updated 6 years ago
- Generating Text through Adversarial Training(GAN) using Skip-Thought Vectors☆45Oct 30, 2021Updated 4 years ago
- Implementation of the Hierarchical and Interpretable Skill Acquisition in Multi-task Reinforcement Learning by Tianmin Shu, Caiming Xiong…☆11Jun 18, 2018Updated 7 years ago
- ☆10Jul 9, 2020Updated 5 years ago
- A batch (multiple concurrent sequence pairs) implementation of Dynamic Time Warping (DTW) in Theano☆10Sep 13, 2015Updated 10 years ago
- a feature frontend for VINS☆10Aug 27, 2018Updated 7 years ago
- Real valued neural networks (RVNN) and complex valued neural networks (CVNN) (Akira Hirose, 2012).☆11Jul 17, 2017Updated 8 years ago
- A2C, ACKTR and A2T implementations for ViZDoom☆10Dec 18, 2017Updated 8 years ago