facebookresearch / Permutation-Equivariant-Seq2SeqLinks

Humans understand novel sentences by composing meanings and roles of core language components. In contrast, neural network models for natural language modeling fail when such compositional generalization is required. The main contribution of this paper is to hypothesize that language compositionality is a form of group-equivariance. Based on thi…

☆27

Alternatives and similar repositories for Permutation-Equivariant-Seq2Seq

Users that are interested in Permutation-Equivariant-Seq2Seq are comparing it to the libraries listed below

Sorting:

wouterkool / estimating-gradients-without-replacement
Estimating Gradients for Discrete Random Variables by Sampling without Replacement
☆40Updated 5 years ago
facebookresearch / meta_seq2seq
Compositional generalization through meta sequence-to-sequence learning
☆83Updated 5 years ago
jacobandreas / l3
Learning with latent language
☆51Updated 4 years ago
deep-spin / lp-sparsemap
LP-SparseMAP: Differentiable sparse structured prediction in coarse factor graphs
☆41Updated last year
jacobandreas / tre
Measuring compositionality in representation learning
☆73Updated 6 years ago
harvardnlp / hmm-lm
☆41Updated 4 years ago
Holmeswww / PPOGAN
☆25Updated last year
brendenlake / meta_seq2seq
PyTorch code for meta seq2seq learning
☆43Updated 5 years ago
rizar / CLOSURE
Systematic generalization test for CLEVR
☆15Updated 5 years ago
lucidrains / ESBN-pytorch
Usable implementation of Emerging Symbol Binding Network (ESBN), in Pytorch
☆25Updated 4 years ago
RobertCsordas / transformer_generalization
The official repository for our paper "The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers". We s…
☆67Updated 2 years ago
frankaging / Reason-SCAN
ReaSCAN is a synthetic navigation task that requires models to reason about surroundings over syntactically difficult languages. (NeurIPS…
☆20Updated 3 years ago
paruby / DIP-VAE
An implementation of DIP-VAE from the paper "Variational Inference of Disentangled Latent Concepts from Unlabelled Observations" by Kumar…
☆26Updated 7 years ago
kingofspace0wzz / wae-rnf-lm
Pytorch Implemetation for our NAACL2019 Paper "Riemannian Normalizing Flow on Variational Wasserstein Autoencoder for Text Modeling" http…
☆62Updated 5 years ago
OliverRichter / normalized-attention
Code publication to the paper "Normalized Attention Without Probability Cage"
☆16Updated 3 years ago
ranjaykrishna / easyturk
Make quick mechanical turk HTML/Javascript interfaces and launch them with Python functions
☆41Updated 4 years ago
choidami / sst
☆50Updated 4 years ago
asmadotgh / neural_chat_web
The server portion of the Neural Chat project to deploy chatbots on web. This code is accompanied by another repository that includes the…
☆36Updated 4 years ago
zomux / lanmt-ebm
lanmt ebm
☆12Updated 5 years ago
carolinlawrence / gradient-rollback
Code for gradient rollback, which explains predictions of neural matrix factorization models, as for example used for knowledge base comp…
☆21Updated 4 years ago
rowanz / piglet
PIGLeT: Language Grounding Through Neuro-Symbolic Interaction in a 3D World [ACL 2021]
☆56Updated 3 years ago
rizar / systematic-generalization-sqoop
Code for "Systematic Generalization: What Is Required and Can It Be Learned"
☆37Updated 6 years ago
ischlag / TPR-RNN
Code for the publication Learning to Reason with Third-Order Tensor Products.
☆41Updated 6 years ago
salesforce / esprit
Dataset and documentation for paper on explaining solutions to physical reasoning tasks (ESPRIT))
☆21Updated 3 months ago
ec6dde01667145e58de60f864e05a4 / CausalOptimizationAnon
☆65Updated last year
dhh1995 / SCL
PyTorch implementation for The Scattering Compositional Learner (SCL)
☆32Updated 5 years ago
thegregyang / NNspectra
Computing the eigenvalues of Neural Tangent Kernel and Conjugate Kernel (aka NNGP kernel) over the boolean cube
☆47Updated 5 years ago
giannisdaras / smyrf
[NeurIPS 2020] Official Implementation: "SMYRF: Efficient Attention using Asymmetric Clustering".
☆50Updated last year
ischlag / Fast-Weight-Memory-public
Official code repository of the paper Learning Associative Inference Using Fast Weight Memory by Schlag et al.
☆28Updated 4 years ago
belindal / TaskBench500
Suite of 500 procedurally-generated NLP tasks to study language model adaptability
☆21Updated 3 years ago