MurtyShikhar / Pushdown-LayersLinks

Code for Pushdown Layers from our EMNLP 2023 paper

☆28

Alternatives and similar repositories for Pushdown-Layers

Users that are interested in Pushdown-Layers are comparing it to the libraries listed below

Sorting:

sustcsonglin / TN-PCFG
source code of NAACL2021 "PCFGs Can Do Better: Inducing Probabilistic Context-Free Grammars with Many Symbols“ and ACL2021 main conferenc…
☆50Updated 3 months ago
sustcsonglin / gated_linear_attention_layer
☆32Updated last year
MurtyShikhar / structural-grokking
Code for our ACL '23 paper titled "Grokking of Hierarchical Structure in Vanilla Transformers"
☆21Updated last year
zomux / lanmt-ebm
lanmt ebm
☆12Updated 5 years ago
machelreid / editpro
Learning to Model Editing Processes
☆26Updated 3 years ago
jungokasai / deep-shallow
☆44Updated 4 years ago
bergen / EdgeTransformer
☆22Updated 3 years ago
belindal / state-probes
Code for the paper "Implicit Representations of Meaning in Neural Language Models"
☆54Updated 2 years ago
suzgunmirac / crowd-sampling
Follow the Wisdom of the Crowd: Effective Text Generation via Minimum Bayes Risk Decoding
☆18Updated 2 years ago
RobertCsordas / ndr
The official repository for our paper "The Neural Data Router: Adaptive Control Flow in Transformers Improves Systematic Generalization".
☆33Updated 2 weeks ago
xuanlinli17 / autoregressive_inference
Code for "Discovering Non-monotonic Autoregressive Orderings with Variational Inference" (paper and code updated from ICLR 2021)
☆12Updated last year
ekinakyurek / lexical
Paper: Lexicon Learning for Few-Shot Neural Sequence Modeling
☆16Updated 3 years ago
yikangshen / megablocks
☆20Updated last year
rycolab / parsing-as-tagging
☆18Updated last year
iesl / s-diora
☆12Updated 4 years ago
zhaoyd1 / Dep_Transformer_Grammars
☆14Updated 8 months ago
whyNLP / Probabilistic-Transformer
A probabilitic model for contextual word representation. Accepted to ACL2023 Findings.
☆23Updated last year
frankxu2004 / knnlm-why
Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"
☆58Updated 2 years ago
sustcsonglin / mamba-triton
☆48Updated last year
yoonkim / neural-qcfg
☆45Updated 3 years ago
cpllab / syntactic-generalization
Code and data for "A Systematic Assessment of Syntactic Generalization in Neural Language Models"
☆28Updated 4 years ago
microsoft / EfficientLongSequenceModeling
☆51Updated 2 years ago
Shark-NLP / CAB
☆31Updated last year
rycolab / prefix-parsing
☆14Updated last year
jenni-ai / T2FW
Fine-Tuning Pre-trained Transformers into Decaying Fast Weights
☆19Updated 2 years ago
tommccoy1 / rnn-hierarchical-biases
Code for "Does syntax need to grow on trees? Sources of inductive bias in sequence to sequence networks"
☆23Updated 5 years ago
abhishekpanigrahi1996 / transformer_in_transformer
☆45Updated last year
rycolab / tree_expectations
Code Repository for "Efficient Computation of Expectations under Spanning Tree Distributions", http://arxiv.org/abs/2008.12988
☆10Updated 4 years ago
viking-sudo-rm / rusty-dawg
Rust library for indexing and quickly searching large pretraining corpora
☆26Updated this week
nyu-mll / SQuALITY
Query-focused summarization data
☆42Updated 2 years ago