renll / SparseLTLinks

[EMNLP 2022] Language Model Pre-Training with Sparse Latent Typing

☆14

Alternatives and similar repositories for SparseLT

Users that are interested in SparseLT are comparing it to the libraries listed below

Sorting:

jxhe / efficient-knnlm
Pytorch implementation of paper "Efficient Nearest Neighbor Language Models" (EMNLP 2021)
☆74Updated 3 years ago
INK-USC / ReCross
ReCross: Unsupervised Cross-Task Generalization via Retrieval Augmentation
☆24Updated 3 years ago
swarnaHub / SummarizationPrograms
[ICLR 2023] PyTorch code of Summarization Programs: Interpretable Abstractive Summarization with Neural Modular Trees
☆24Updated 2 years ago
frankxu2004 / knnlm-why
Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"
☆59Updated 2 years ago
ThomasScialom / T0_continual_learning
Adding new tasks to T0 without catastrophic forgetting
☆33Updated 3 years ago
princeton-nlp / ShortcutGrammar
EMNLP 2022: Finding Dataset Shortcuts with Grammar Induction https://arxiv.org/abs/2210.11560
☆57Updated 9 months ago
thunlp / Knowledge-Inheritance
Source code for paper: Knowledge Inheritance for Pre-trained Language Models
☆38Updated 3 years ago
ekinakyurek / influence
Code for "Tracing Knowledge in Language Models Back to the Training Data"
☆39Updated 2 years ago
da03 / criticize_text_generation
A method for evaluating the high-level coherence of machine-generated texts. Identifies high-level coherence issues in transformer-based …
☆11Updated 2 years ago
princeton-nlp / WhatICLLearns
[ACL 2023 Findings] What In-Context Learning “Learns” In-Context: Disentangling Task Recognition and Task Learning
☆20Updated 2 years ago
RUCAIBox / ELMER
This repository is the official implementation of our EMNLP 2022 paper ELMER: A Non-Autoregressive Pre-trained Language Model for Efficie…
☆26Updated 3 years ago
pietrolesci / memorisation-profiles
This is the official implementation for our ACL 2024 paper: "Causal Estimation of Memorisation Profiles".
☆23Updated 8 months ago
sunlab-osu / ReasonBERT
Code and pre-trained models for "ReasonBert: Pre-trained to Reason with Distant Supervision", EMNLP'2021
☆29Updated 2 years ago
cooelf / CompassMTL
Task Compass: Scaling Multi-task Pre-training with Task Prefix (EMNLP 2022: Findings) (stay tuned & more will be updated)
☆22Updated 3 years ago
frankaging / Causal-Distill
The Codebase for Causal Distillation for Language Models (NAACL '22)
☆25Updated 3 years ago
lyutyuh / structured-span-selector
A Structured Span Selector (NAACL 2022). A structured span selector with a WCFG for span selection tasks (coreference resolution, semanti…
☆21Updated 3 years ago
terarachang / DataICL
Data Valuation on In-Context Examples (ACL23)
☆24Updated 10 months ago
MikeWangWZHL / Zemi
Repo for "Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple Tasks" ACL 2023 Findings
☆16Updated 2 years ago
Tiiiger / templm
Code release for "TempLM: Distilling Language Models into Template-Based Generators"
☆14Updated 3 years ago
txsun1997 / Metric-Fairness
EMNLP'2022: BERTScore is Unfair: On Social Bias in Language Model-Based Metrics for Text Generation
☆41Updated 3 years ago
yumeng5 / SuperGen
[NeurIPS 2022] Generating Training Data with Language Models: Towards Zero-Shot Language Understanding
☆69Updated 3 years ago
MichaelZhouwang / Sequence_Span_Rewriting
Code for EMNLP 2021 paper: Improving Sequence-to-Sequence Pre-training via Sequence Span Rewriting
☆17Updated 4 years ago
gmftbyGMFTBY / MomentumDecoding
Momentum Decoding: Open-ended Text Generation as Graph Exploration
☆19Updated 2 years ago
JetRunner / PABEE
Code for the paper "BERT Loses Patience: Fast and Robust Inference with Early Exit".
☆66Updated 4 years ago
Chen-Wang-CUHK / Training-Free-and-Ref-Free-Summ-Evaluation
The source code of our ACL paper "A Training-free and Reference-free Summarization Evaluation Metric via Centrality-weighted Relevance an…
☆14Updated 2 years ago
neulab / retomaton
PyTorch code for the RetoMaton paper: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022)
☆74Updated 3 years ago
cliang1453 / SAGE
No Parameters Left Behind: Sensitivity Guided Adaptive Learning Rate for Training Large Transformer Models (ICLR 2022)
☆29Updated 3 years ago
felixzli / synthetic_pretraining
☆38Updated 3 years ago
thunlp / DPT
☆13Updated 3 years ago
gmftbyGMFTBY / Rep-Dropout
[NeurIPS 2023] Repetition In Repetition Out: Towards Understanding Neural Text Degeneration from the Data Perspective
☆37Updated 2 years ago