ischlag / Fast-Weight-Memory-public
Official code repository of the paper Learning Associative Inference Using Fast Weight Memory by Schlag et al.
☆26Updated 3 years ago
Alternatives and similar repositories for Fast-Weight-Memory-public:
Users that are interested in Fast-Weight-Memory-public are comparing it to the libraries listed below
- The official repository for our paper "The Neural Data Router: Adaptive Control Flow in Transformers Improves Systematic Generalization".☆32Updated 3 years ago
- The official repository for our paper "The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers". We s…☆66Updated 2 years ago
- Humans understand novel sentences by composing meanings and roles of core language components. In contrast, neural network models for nat…☆27Updated 4 years ago
- Implementation of Relation Network and Recurrent Relational Network using PyTorch v1.3. Original papers: (RN) https://arxiv.org/abs/1706.…☆19Updated 2 years ago
- The multi-modal sequence to sequence baseline neural models used in the Grounded SCAN paper.☆16Updated 3 years ago
- ☆45Updated 3 years ago
- Code accompanying ICML 2021 paper "Few-shot Language Coordination by Modeling Theory of Mind"☆18Updated 2 years ago
- ☆35Updated 6 months ago
- Code to reproduce the results for Compositional Attention☆60Updated 2 years ago
- ☆22Updated 3 years ago
- A variant of Transformer-XL where the memory is updated not with a queue, but with attention☆47Updated 4 years ago
- ☆22Updated 3 years ago
- Question Answering with Interactive Text (QAit), code for EMNLP 2019 paper "Interactive Language Learning by Question Answering"☆44Updated 5 years ago
- Official repository for the paper "Going Beyond Linear Transformers with Recurrent Fast Weight Programmers" (NeurIPS 2021)☆48Updated last year
- Code for Residual Energy-Based Models for Text Generation in PyTorch.☆23Updated 3 years ago
- ☆13Updated 2 years ago
- ☆36Updated 4 years ago
- ☆80Updated 5 months ago
- lanmt ebm☆11Updated 4 years ago
- Code for the paper PermuteFormer☆42Updated 3 years ago
- ☆67Updated 2 years ago
- ReaSCAN is a synthetic navigation task that requires models to reason about surroundings over syntactically difficult languages. (NeurIPS…☆20Updated 3 years ago
- ☆22Updated 3 years ago
- Suite of 500 procedurally-generated NLP tasks to study language model adaptability☆21Updated 2 years ago
- Evaluating and improving the faithfulness of the interpretations offered by Neural Module Networks☆13Updated last year
- Code used to run experiments for the ICLR 2023 paper "Computational Language Acquisition with Theory of Mind".☆14Updated last year
- Memory efficient MAML using gradient checkpointing☆83Updated 5 years ago
- Code for "Discovering Non-monotonic Autoregressive Orderings with Variational Inference" (paper and code updated from ICLR 2021)☆11Updated 10 months ago
- Variational Transformers for Diverse Response Generation☆80Updated 5 months ago
- Official code for the paper "Context-Aware Language Modeling for Goal-Oriented Dialogue Systems"☆34Updated 2 years ago