joeljang / ELMLinks

[ICML 2023] Exploring the Benefits of Training Expert Language Models over Instruction Tuning

☆99

Alternatives and similar repositories for ELM

Users that are interested in ELM are comparing it to the libraries listed below

Sorting:

kernelmachine / demix
DEMix Layers for Modular Language Modeling
☆53Updated 3 years ago
seonghyeonye / TAPP
[AAAI 2024] Investigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction Following
☆79Updated 10 months ago
kernelmachine / silo-lm
SILO Language Models code repository
☆81Updated last year
kaistAI / InstructIR
IntructIR, a novel benchmark specifically designed to evaluate the instruction following ability in information retrieval models. Our foc…
☆32Updated last year
ThomasScialom / T0_continual_learning
Adding new tasks to T0 without catastrophic forgetting
☆33Updated 2 years ago
seonghyeonye / Flipped-Learning
[ICLR 2023] Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learners
☆116Updated last month
jzbjyb / ReAtt
Retrieval as Attention
☆83Updated 2 years ago
swj0419 / kNN_prompt
TBC
☆27Updated 2 years ago
nicola-decao / KnowledgeEditor
Code for Editing Factual Knowledge in Language Models
☆139Updated 3 years ago
xlang-ai / icl-selective-annotation
[ICLR 2023] Code for our paper "Selective Annotation Makes Language Models Better Few-Shot Learners"
☆108Updated 2 years ago
kaistAI / GAP
[ACL 2023] Gradient Ascent Post-training Enhances Language Model Generalization
☆29Updated 10 months ago
dmis-lab / TouR
Findings of ACL'2023: Optimizing Test-Time Query Representations for Dense Retrieval
☆30Updated last year
microsoft / KID
Knowledge Infused Decoding
☆71Updated last year
nkandpa2 / long_tail_knowledge
Repo for the paper "Large Language Models Struggle to Learn Long-Tail Knowledge"
☆77Updated 2 years ago
google-deepmind / streamingqa
☆48Updated last year
awebson / prompt_semantics
This repository accompanies our paper “Do Prompt-Based Models Really Understand the Meaning of Their Prompts?”
☆85Updated 3 years ago
oriram / spider
☆54Updated 2 years ago
facebookresearch / NPM
The original implementation of Min et al. "Nonparametric Masked Language Modeling" (paper https//arxiv.org/abs/2212.01349)
☆157Updated 2 years ago
AkariAsai / ATTEMPT
This is the oficial repository for "Parameter-Efficient Multi-task Tuning via Attentional Mixtures of Soft Prompts" (EMNLP 2022)
☆102Updated 2 years ago
archiki / GrIPS
Code for our paper: "GrIPS: Gradient-free, Edit-based Instruction Search for Prompting Large Language Models"
☆55Updated 2 years ago
joeljang / temporalwiki
[EMNLP 2022] TemporalWiki: A Lifelong Benchmark for Training and Evaluating Ever-Evolving Language Models
☆73Updated last year
allenai / natural-instructions-v1
Benchmarking Generalization to New Tasks from Natural Language Instructions
☆26Updated 4 years ago
MikeWangWZHL / Zemi
Repo for "Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple Tasks" ACL 2023 Findings
☆16Updated 2 years ago
ekinakyurek / influence
Code for "Tracing Knowledge in Language Models Back to the Training Data"
☆38Updated 2 years ago
martiansideofthemoon / longeval-summarization
Official repository for our EACL 2023 paper "LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization" (https…
☆44Updated 11 months ago
INK-USC / CrossFit
Code for paper "CrossFit : A Few-shot Learning Challenge for Cross-task Generalization in NLP" (https://arxiv.org/abs/2104.08835)
☆112Updated 3 years ago
allenai / Lila
A unified benchmark for math reasoning
☆88Updated 2 years ago
tau-nlp / scrolls
The official code of EMNLP 2022, "SCROLLS: Standardized CompaRison Over Long Language Sequences".
☆70Updated last year
AkariAsai / evidentiality_qa
The official implemetation of "Evidentiality-guided Generation for Knowledge-Intensive NLP Tasks" (NAACL 2022).
☆44Updated 2 years ago
thunlp / DPT
☆13Updated 3 years ago