allenai / hyperdecodersLinks

Codebase for Hyperdecoders https://arxiv.org/abs/2203.08304

☆12

Alternatives and similar repositories for hyperdecoders

Users that are interested in hyperdecoders are comparing it to the libraries listed below

Sorting:

swj0419 / in-context-pretraining
☆53Updated last year
nayeon7lee / FactualityPrompt
☆87Updated 2 years ago
yuzhaouoe / pretraining-data-packing
[ACL'24 Oral] Analysing The Impact of Sequence Composition on Language Model Pre-Training
☆22Updated 11 months ago
nkandpa2 / long_tail_knowledge
Repo for the paper "Large Language Models Struggle to Learn Long-Tail Knowledge"
☆77Updated 2 years ago
google-research-datasets / GSM-IC
Grade-School Math with Irrelevant Context (GSM-IC) benchmark is an arithmetic reasoning dataset built upon GSM8K, by adding irrelevant se…
☆60Updated 2 years ago
gmftbyGMFTBY / Rep-Dropout
[NeurIPS 2023] Repetition In Repetition Out: Towards Understanding Neural Text Degeneration from the Data Perspective
☆33Updated last year
qtli / GSM-Plus
GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.
☆62Updated last year
Leezekun / dialogic
[EMNLP 2022] Code and data for "Controllable Dialogue Simulation with In-Context Learning"
☆35Updated 2 years ago
Re-Align / AlignTDS
Analyzing LLM Alignment via Token distribution shift
☆16Updated last year
llyx97 / TAMT
[NAACL 2022] "Learning to Win Lottery Tickets in BERT Transfer via Task-agnostic Mask Training", Yuanxin Liu, Fandong Meng, Zheng Lin, Pe…
☆15Updated 2 years ago
AkariAsai / ATTEMPT
This is the oficial repository for "Parameter-Efficient Multi-task Tuning via Attentional Mixtures of Soft Prompts" (EMNLP 2022)
☆102Updated 2 years ago
McGill-NLP / retriever-lm-reasoning
Code for "Can Retriever-Augmented Language Models Reason? The Blame Game Between the Retriever and the Language Model", EMNLP Findings 20…
☆28Updated last year
swj0419 / kNN_prompt
TBC
☆27Updated 2 years ago
cxcscmu / MATES
Official repository for MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models [NeurIPS 2024]
☆73Updated 8 months ago
dqxiu / KAssess
☆14Updated last year
xlang-ai / icl-selective-annotation
[ICLR 2023] Code for our paper "Selective Annotation Makes Language Models Better Few-Shot Learners"
☆108Updated 2 years ago
kernelmachine / demix
DEMix Layers for Modular Language Modeling
☆53Updated 3 years ago
PrasannS / rlhf-length-biases
☆27Updated last year
YuxiXie / SelfEval-Guided-Decoding
☆100Updated last year
lancopku / MUKI
[Findings of EMNLP22] From Mimicking to Integrating: Knowledge Integration for Pre-Trained Language Models
☆19Updated 2 years ago
wzhouad / context-faithful-llm
Code and data for paper "Context-faithful Prompting for Large Language Models".
☆41Updated 2 years ago
violet-zct / swarm-distillation-zero-shot
☆22Updated 2 years ago
yizhongw / llm-temporal-alignment
Methods and evaluation for aligning language models temporally
☆29Updated last year
OhadRubin / EPR
☆63Updated 2 years ago
FranxYao / FlanT5-CoT-Specialization
Implementation of ICML 23 Paper: Specializing Smaller Language Models towards Multi-Step Reasoning.
☆131Updated 2 years ago
sail-sg / symbolic-instruction-tuning
The official repository for the paper "From Zero to Hero: Examining the Power of Symbolic Tasks in Instruction Tuning".
☆65Updated 2 years ago
HKUNLP / icl-ceil
[ICML 2023] Code for our paper “Compositional Exemplars for In-context Learning”.
☆102Updated 2 years ago
SimengSun / ChapterBreak
☆11Updated last year
shankarp8 / knowledge_distillation
Repository for "Propagating Knowledge Updates to LMs Through Distillation" (NeurIPS 2023).
☆26Updated 11 months ago
McGill-NLP / polytropon
☆54Updated 2 years ago