srush / ProbTalk
☆29Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for ProbTalk
- Code for "Discovering Non-monotonic Autoregressive Orderings with Variational Inference" (paper and code updated from ICLR 2021)☆11Updated 8 months ago
- Silly twitter torch implementations.☆46Updated 2 years ago
- Code Repository for "Efficient Computation of Expectations under Spanning Tree Distributions", http://arxiv.org/abs/2008.12988☆10Updated 3 years ago
- Learning to Model Editing Processes☆26Updated 2 years ago
- ☆38Updated 4 years ago
- ☆42Updated 3 years ago
- Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"☆56Updated last year
- Sampling-Based Minimum Bayes-Risk Decoding for Neural Machine Translation☆16Updated 2 years ago
- Code for the paper "The Surprising Computational Power of Nondeterministic Stack RNNs" (DuSell and Chiang, 2023)☆18Updated 8 months ago
- The InterScript dataset contains interactive user feedback on scripts generated by a T5-XXL model.☆11Updated 2 years ago
- ☆12Updated 3 years ago
- Code for "Does syntax need to grow on trees? Sources of inductive bias in sequence to sequence networks"☆22Updated 4 years ago
- ☆12Updated 2 years ago
- Minimum Bayes Risk Decoding for Hugging Face Transformers☆56Updated 5 months ago
- Code for the paper "Modelling Latent Translations for Cross-Lingual Transfer"☆17Updated 2 years ago
- Exploring Few-Shot Adaptation of Language Models with Tables☆23Updated 2 years ago
- Suite of 500 procedurally-generated NLP tasks to study language model adaptability☆21Updated 2 years ago
- Tetra-Tagging: Word-Synchronous Parsing with Linear-Time Inference☆15Updated 4 years ago
- ☆22Updated 3 years ago
- Simple-to-use scoring function for arbitrarily tokenized texts.☆32Updated 3 weeks ago
- Query-focused summarization data☆41Updated last year
- Code for the paper "Understanding the Mechanics of SPIGOT: Surrogate Gradients for Latent Structure Learning"☆11Updated 3 years ago
- lanmt ebm☆11Updated 4 years ago
- The official repository for our paper "The Neural Data Router: Adaptive Control Flow in Transformers Improves Systematic Generalization".☆32Updated 3 years ago
- Fine-Tuning Pre-trained Transformers into Decaying Fast Weights☆19Updated 2 years ago
- ☆25Updated 2 years ago
- Codebase implementing LMs for learning the Dyck-(k,m) bounded hierarchical language☆15Updated 4 years ago
- All materials that accompany/are needed to reproduce ACL 2020 paper - Interpreting Pretrained Contextualized Representations via Reductio…☆18Updated 4 years ago
- Code for SaGe subword tokenizer (EACL 2023)☆22Updated this week
- Implementation of ICML 22 Paper: Scaling Structured Inference with Randomization☆14Updated 2 years ago