srush / ProbTalk
☆29Updated 2 years ago
Related projects: ⓘ
- Silly twitter torch implementations.☆46Updated last year
- Code Repository for "Efficient Computation of Expectations under Spanning Tree Distributions", http://arxiv.org/abs/2008.12988☆10Updated 3 years ago
- The official repository for our paper "The Neural Data Router: Adaptive Control Flow in Transformers Improves Systematic Generalization".☆32Updated 2 years ago
- The Return of Lexical Dependencies: Neural Lexicalized PCFGs (TACL)☆33Updated 2 years ago
- ☆12Updated 3 years ago
- Learning to Model Editing Processes☆26Updated 2 years ago
- ☆42Updated 3 years ago
- ☆37Updated 3 years ago
- Code for "Does syntax need to grow on trees? Sources of inductive bias in sequence to sequence networks"☆22Updated 4 years ago
- Simple-to-use scoring function for arbitrarily tokenized texts.☆27Updated last week
- Code for the paper "The Surprising Computational Power of Nondeterministic Stack RNNs" (DuSell and Chiang, 2023)☆18Updated 5 months ago
- Suite of 500 procedurally-generated NLP tasks to study language model adaptability☆21Updated 2 years ago
- ☆13Updated 3 years ago
- ☆21Updated 3 years ago
- Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"☆56Updated last year
- Official code for the ICLR 2020 paper 'ARE PPE-TRAINED LANGUAGE MODELS AWARE OF PHRASES? SIMPLE BUT STRONG BASELINES FOR GRAMMAR INDCUTIO…☆30Updated last year
- Code for paper "Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs"☆28Updated 2 years ago
- source code of NAACL2021 "PCFGs Can Do Better: Inducing Probabilistic Context-Free Grammars with Many Symbols“ and ACL2021 main conferenc…☆44Updated 6 months ago
- Codebase implementing LMs for learning the Dyck-(k,m) bounded hierarchical language☆14Updated 3 years ago
- Code for the paper "Implicit Representations of Meaning in Neural Language Models"☆48Updated last year
- Minimum Bayes Risk Decoding for Hugging Face Transformers☆51Updated 3 months ago
- ☆12Updated 2 years ago
- Few-shot Learning with Auxiliary Data☆26Updated 9 months ago
- Exploring Few-Shot Adaptation of Language Models with Tables☆23Updated 2 years ago
- Follow the Wisdom of the Crowd: Effective Text Generation via Minimum Bayes Risk Decoding☆18Updated last year
- ☆34Updated 4 months ago
- lanmt ebm☆11Updated 4 years ago
- ☆44Updated 2 years ago
- The official repository for our paper "The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers". We s…☆66Updated last year
- LP-SparseMAP: Differentiable sparse structured prediction in coarse factor graphs☆41Updated 10 months ago