stevenxcao / subnetwork-probing
☆13Updated 3 years ago
Alternatives and similar repositories for subnetwork-probing:
Users that are interested in subnetwork-probing are comparing it to the libraries listed below
- The official repository for our paper "The Neural Data Router: Adaptive Control Flow in Transformers Improves Systematic Generalization".☆32Updated 3 years ago
- Exploring Few-Shot Adaptation of Language Models with Tables☆23Updated 2 years ago
- Suite of 500 procedurally-generated NLP tasks to study language model adaptability☆21Updated 2 years ago
- This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"☆27Updated last year
- Code for our paper: "GrIPS: Gradient-free, Edit-based Instruction Search for Prompting Large Language Models"☆53Updated last year
- Adding new tasks to T0 without catastrophic forgetting☆32Updated 2 years ago
- ☆48Updated last year
- Variable-order CRFs with structure learning☆15Updated 6 months ago
- Code for Residual Energy-Based Models for Text Generation in PyTorch.☆23Updated 3 years ago
- Humans understand novel sentences by composing meanings and roles of core language components. In contrast, neural network models for nat…☆27Updated 4 years ago
- Source Code for "Teaching Machine Comprehension with Compositional Explanations" (Findings of EMNLP 2020)☆11Updated 4 years ago
- Code for "Discovering Non-monotonic Autoregressive Orderings with Variational Inference" (paper and code updated from ICLR 2021)☆12Updated 11 months ago
- ☆38Updated 3 years ago
- ☆20Updated 3 years ago
- ☆22Updated 3 years ago
- ☆50Updated 3 years ago
- Implementation of ICML 22 Paper: Scaling Structured Inference with Randomization☆14Updated 2 years ago
- Finding Generalizable Evidence by Learning to Convince Q&A Models☆25Updated 2 years ago
- Code accompanying ICML 2021 paper "Few-shot Language Coordination by Modeling Theory of Mind"☆18Updated 2 years ago
- A method for evaluating the high-level coherence of machine-generated texts. Identifies high-level coherence issues in transformer-based …☆11Updated last year
- ☆23Updated 5 months ago
- Pretraining summarization models using a corpus of nonsense☆13Updated 3 years ago
- Code for gradient rollback, which explains predictions of neural matrix factorization models, as for example used for knowledge base comp…☆21Updated 3 years ago
- A Kernel-Based View of Language Model Fine-Tuning https://arxiv.org/abs/2210.05643☆74Updated last year
- ☆22Updated 2 years ago
- ReaSCAN is a synthetic navigation task that requires models to reason about surroundings over syntactically difficult languages. (NeurIPS…☆20Updated 3 years ago
- ☆10Updated 2 years ago
- Source Code for paper "Learning from Explanations with Neural Execution Tree", ICLR 2020☆18Updated 3 years ago
- Official codebase accompanying our ACL 2022 paper "RELiC: Retrieving Evidence for Literary Claims" (https://relic.cs.umass.edu).☆20Updated 2 years ago
- ☆12Updated 3 years ago