stevenxcao / subnetwork-probing
☆13Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for subnetwork-probing
- This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"☆26Updated last year
- The official repository for our paper "The Neural Data Router: Adaptive Control Flow in Transformers Improves Systematic Generalization".☆32Updated 3 years ago
- Suite of 500 procedurally-generated NLP tasks to study language model adaptability☆21Updated 2 years ago
- Exploring Few-Shot Adaptation of Language Models with Tables☆23Updated 2 years ago
- Code for our paper: "GrIPS: Gradient-free, Edit-based Instruction Search for Prompting Large Language Models"☆51Updated last year
- ☆50Updated 3 years ago
- ☆38Updated 3 years ago
- ☆12Updated 2 years ago
- Codebase implementing LMs for learning the Dyck-(k,m) bounded hierarchical language☆15Updated 4 years ago
- Code for "Does syntax need to grow on trees? Sources of inductive bias in sequence to sequence networks"☆22Updated 4 years ago
- ☆22Updated 3 years ago
- Humans understand novel sentences by composing meanings and roles of core language components. In contrast, neural network models for nat…☆27Updated 4 years ago
- ☆44Updated 3 years ago
- Implementation of ICML 22 Paper: Scaling Structured Inference with Randomization☆14Updated 2 years ago
- Code for "Discovering Non-monotonic Autoregressive Orderings with Variational Inference" (paper and code updated from ICLR 2021)☆11Updated 8 months ago
- Differentiable Perturb-and-Parse operator☆25Updated 5 years ago
- Code for paper "Leakage-Adjusted Simulatability: Can Models Generate Non-Trivial Explanations of Their Behavior in Natural Language?"☆20Updated 4 years ago
- Learning to Model Editing Processes☆26Updated 2 years ago
- Paper: Lexicon Learning for Few-Shot Neural Sequence Modeling☆15Updated 2 years ago
- Variable-order CRFs with structure learning☆16Updated 3 months ago
- ☆24Updated 3 years ago
- Code for Residual Energy-Based Models for Text Generation in PyTorch.☆23Updated 3 years ago
- Code for gradient rollback, which explains predictions of neural matrix factorization models, as for example used for knowledge base comp…☆21Updated 3 years ago
- Symbolic Brittleness in Sequence Models: on Systematic Generalization in Symbolic Mathematics (AAAI 2022)☆14Updated 2 years ago
- EMNLP 2021 - Frustratingly Simple Pretraining Alternatives to Masked Language Modeling☆31Updated 3 years ago
- Source Code for paper "Learning from Explanations with Neural Execution Tree", ICLR 2020☆18Updated 3 years ago
- A Kernel-Based View of Language Model Fine-Tuning https://arxiv.org/abs/2210.05643☆69Updated last year
- Evaluating Machines by their Real-World Language Use☆33Updated last year
- Code Repository for "Efficient Computation of Expectations under Spanning Tree Distributions", http://arxiv.org/abs/2008.12988☆10Updated 3 years ago
- Code accompanying ICML 2021 paper "Few-shot Language Coordination by Modeling Theory of Mind"☆18Updated 2 years ago