sarthmit / Compositional-AttentionLinks

Code to reproduce the results for Compositional Attention

☆59

Alternatives and similar repositories for Compositional-Attention

Users that are interested in Compositional-Attention are comparing it to the libraries listed below

Sorting:

RobertCsordas / transformer_generalization
The official repository for our paper "The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers". We s…
☆67Updated 2 years ago
jiamings / ml-cpc
☆36Updated 4 years ago
yilundu / improved_contrastive_divergence
[ICML'21] Improved Contrastive Divergence Training of Energy Based Models
☆66Updated 3 years ago
bergen / EdgeTransformer
☆22Updated 3 years ago
ermongroup / subsets
Code for Reparameterizable Subset Sampling via Continuous Relaxations, IJCAI 2019.
☆57Updated 2 years ago
google-research / head2toe
☆81Updated last year
lucidrains / memformer
Implementation of Memformer, a Memory-augmented Transformer, in Pytorch
☆123Updated 4 years ago
ischlag / Fast-Weight-Memory-public
Official code repository of the paper Learning Associative Inference Using Fast Weight Memory by Schlag et al.
☆28Updated 4 years ago
RobertCsordas / ndr
The official repository for our paper "The Neural Data Router: Adaptive Control Flow in Transformers Improves Systematic Generalization".
☆33Updated 4 months ago
Noahs-ARK / RFA
☆33Updated 4 years ago
ssnl / PyTorch-Reparam-Module
Reparameterize your PyTorch modules
☆71Updated 4 years ago
ec6dde01667145e58de60f864e05a4 / CausalOptimizationAnon
☆65Updated last year
yilundu / ebm_compositionality
[NeurIPS'20] Code for the Paper Compositional Visual Generation and Inference with Energy Based Models
☆46Updated 2 years ago
ischlag / fast-weight-transformers
Official code repository of the paper Linear Transformers Are Secretly Fast Weight Programmers.
☆105Updated 4 years ago
facebookresearch / directclr
Code used in "Understanding Dimensional Collapse in Contrastive Self-supervised Learning" paper.
☆79Updated 3 years ago
haoliuhl / hybrid-discriminative-generative
Hybrid Discriminative-Generative Training via Contrastive Learning
☆75Updated 2 years ago
lucidrains / gated-state-spaces-pytorch
Implementation of Gated State Spaces, from the paper "Long Range Language Modeling via Gated State Spaces", in Pytorch
☆101Updated 2 years ago
RobertCsordas / modules
The official repository for our paper "Are Neural Nets Modular? Inspecting Functional Modularity Through Differentiable Weight Masks". We…
☆46Updated 2 years ago
CyndxAI / QKNorm
Code for the paper "Query-Key Normalization for Transformers"
☆49Updated 4 years ago
YannDubs / Invariant-Self-Supervised-Learning
Pytorch code for "Improving Self-Supervised Learning by Characterizing Idealized Representations"
☆41Updated 2 years ago
hlml / fortuitous_forgetting
☆19Updated 3 years ago
varunnair18 / FISH
Code for "Training Neural Networks with Fixed Sparse Masks" (NeurIPS 2021).
☆59Updated 3 years ago
s-nlp / certain-transformer
How certain is your transformer?
☆25Updated 4 years ago
google-deepmind / emergent_in_context_learning
☆85Updated last year
google-deepmind / ssl_hsic
☆38Updated last year
joshr17 / IFM
Code for paper "Can contrastive learning avoid shortcut solutions?" NeurIPS 2021.
☆47Updated 3 years ago
sajadn / Exemplar-VAE
Exemplar VAE: Linking Generative Models, Nearest Neighbor Retrieval, and Data Augmentation
☆69Updated 4 years ago
ferrine / hyrnn
Hyperbolic Neural Networks, pytorch
☆87Updated 6 years ago
gibipara92 / learning-explanations-hard-to-vary
Code to implement the AND-mask and geometric mean to do gradient based optimization, from the paper "Learning explanations that are hard …
☆41Updated 4 years ago
cpcp1998 / PermuteFormer
Code for the paper PermuteFormer
☆42Updated 4 years ago