XiaoxiaoGuo / rcdqn
This repository contains the source code of the EMNLP 2020 paper Interactive Fiction Game Playing as Multi-Paragraph Reading Comprehension with Reinforcement Learning.
☆20Updated 4 years ago
Alternatives and similar repositories for rcdqn:
Users that are interested in rcdqn are comparing it to the libraries listed below
- [ICLR 2022 Spotlight] Multi-Stage Episodic Control for Strategic Exploration in Text Games☆14Updated 2 years ago
- Automatically Composing Representation Transformations as a Means for Generalization☆24Updated 5 years ago
- Solving reinforcement learning tasks which require language and vision☆32Updated last year
- Learning with latent language☆50Updated 3 years ago
- Goal driven language generation using knowledge graph A2C agents☆59Updated 4 years ago
- BabyAI++: Towards Grounded language Learning beyond Memorization, ICLR BeTR-RL 2020☆25Updated 4 years ago
- Experiment code for the ICLR 2020 paper "RTFM: Generalising to New Environment Dynamics via Reading".☆38Updated 3 years ago
- ☆32Updated 3 years ago
- SeqGAN but with more bells and whistles☆24Updated 6 years ago
- ☆35Updated 6 months ago
- ☆20Updated 3 years ago
- The multi-modal sequence to sequence baseline neural models used in the Grounded SCAN paper.☆16Updated 3 years ago
- Grounded SCAN data set.☆69Updated 3 years ago
- Template-DQN and DRRN agent implementations☆21Updated last year
- Reproducing the reinforcement learning models used in "Emergence of Linguistic Communication from Referential Games with Symbolic and Pix…☆12Updated 6 years ago
- Code repository for On the interaction between supervision and self-play in emergent communication (ICLR 2020)☆16Updated 4 years ago
- Measuring compositionality in representation learning☆71Updated 5 years ago
- This repository contains the code used for Ordered Memory paper☆29Updated 5 years ago
- Implementation of Grounded Language Learning in a 3D Simulated World (DeepMind)☆34Updated 7 years ago
- Systematic generalization test for CLEVR☆15Updated 4 years ago
- On the pitfalls of measuring emergent communication☆34Updated 5 years ago
- mplementation of Advantage Actor Critic (A2C) and Proximal Policy Optimization Algorithm (PPO) use the advantages of Tensorflow 2.x.☆9Updated 4 years ago
- Official code for "Can Wikipedia Help Offline Reinforcement Learning?" by Machel Reid, Yutaro Yamada and Shixiang Shane Gu☆103Updated 2 years ago
- PyTorch implementation for The Scattering Compositional Learner (SCL)☆32Updated 4 years ago
- Compositional generalization through meta sequence-to-sequence learning☆84Updated 5 years ago
- ☆38Updated 3 years ago
- Contextual Bandits Action Elimination DQN☆19Updated 6 years ago
- Variational Reinforcement Learning☆16Updated 5 months ago
- Implements the Messenger environment and EMMA model.☆23Updated last year
- Question Answering with Interactive Text (QAit), code for EMNLP 2019 paper "Interactive Language Learning by Question Answering"☆44Updated 5 years ago