kernelmachine / demixLinks

DEMix Layers for Modular Language Modeling

☆53

Alternatives and similar repositories for demix

Users that are interested in demix are comparing it to the libraries listed below

Sorting:

McGill-NLP / polytropon
☆54Updated 2 years ago
GEM-benchmark / GEM-metrics
Automatic metrics for GEM tasks
☆66Updated 2 years ago
kernelmachine / demix-data
Benchmark API for Multidomain Language Modeling
☆25Updated 2 years ago
INK-USC / CrossFit
Code for paper "CrossFit : A Few-shot Learning Challenge for Cross-task Generalization in NLP" (https://arxiv.org/abs/2104.08835)
☆112Updated 3 years ago
joeljang / ELM
[ICML 2023] Exploring the Benefits of Training Expert Language Models over Instruction Tuning
☆99Updated 2 years ago
nicola-decao / KnowledgeEditor
Code for Editing Factual Knowledge in Language Models
☆139Updated 3 years ago
violet-zct / swarm-distillation-zero-shot
☆22Updated 2 years ago
tanyuqian / ctc-gen-eval
EMNLP 2021 - CTC: A Unified Framework for Evaluating Natural Language Generation
☆97Updated 2 years ago
tau-nlp / scrolls
The official code of EMNLP 2022, "SCROLLS: Standardized CompaRison Over Long Language Sequences".
☆70Updated last year
suzgunmirac / crowd-sampling
Follow the Wisdom of the Crowd: Effective Text Generation via Minimum Bayes Risk Decoding
☆18Updated 2 years ago
yanaiela / pararel
☆45Updated last year
peterwestuw / surface-form-competition
☆58Updated 3 years ago
GXimingLu / Quark
☆75Updated last year
awebson / prompt_semantics
This repository accompanies our paper “Do Prompt-Based Models Really Understand the Meaning of Their Prompts?”
☆85Updated 3 years ago
archiki / GrIPS
Code for our paper: "GrIPS: Gradient-free, Edit-based Instruction Search for Prompting Large Language Models"
☆55Updated 2 years ago
ekinakyurek / influence
Code for "Tracing Knowledge in Language Models Back to the Training Data"
☆38Updated 2 years ago
jxhe / efficient-knnlm
Pytorch implementation of paper "Efficient Nearest Neighbor Language Models" (EMNLP 2021)
☆73Updated 3 years ago
jzbjyb / lm-calibration
☆35Updated 3 years ago
martiansideofthemoon / longeval-summarization
Official repository for our EACL 2023 paper "LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization" (https…
☆44Updated 11 months ago
frankxu2004 / knnlm-why
Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"
☆58Updated 2 years ago
swj0419 / kNN_prompt
TBC
☆27Updated 2 years ago
jacobandreas / geca
☆42Updated 4 years ago
machelreid / m2d2
M2D2: A Massively Multi-domain Language Modeling Dataset (EMNLP 2022) by Machel Reid, Victor Zhong, Suchin Gururangan, Luke Zettlemoyer
☆54Updated 2 years ago
qcwthu / Lifelong-Fewshot-Language-Learning
The code for lifelong few-shot language learning
☆55Updated 3 years ago
princeton-nlp / DinkyTrain
Princeton NLP's pre-training library based on fairseq with DeepSpeed kernel integration 🚃
☆114Updated 2 years ago
AkariAsai / evidentiality_qa
The official implemetation of "Evidentiality-guided Generation for Knowledge-Intensive NLP Tasks" (NAACL 2022).
☆44Updated 2 years ago
cindyxinyiwang / expand-via-lexicon-based-adaptation
Code for ACL 2022 paper "Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation"
☆30Updated 3 years ago
alon-albalak / TLiDB
Transfer Learning in Dialogue Benchmarking Toolkit
☆14Updated 2 years ago
cliang1453 / SAGE
No Parameters Left Behind: Sensitivity Guided Adaptive Learning Rate for Training Large Transformer Models (ICLR 2022)
☆30Updated 3 years ago
AkariAsai / ATTEMPT
This is the oficial repository for "Parameter-Efficient Multi-task Tuning via Attentional Mixtures of Soft Prompts" (EMNLP 2022)
☆102Updated 2 years ago