princeton-nlp / DinkyTrainLinks

Princeton NLP's pre-training library based on fairseq with DeepSpeed kernel integration 🚃

☆114

Alternatives and similar repositories for DinkyTrain

Users that are interested in DinkyTrain are comparing it to the libraries listed below

Sorting:

ExpressAI / reStructured-Pretraining
reStructured Pre-training
☆98Updated 2 years ago
princeton-nlp / TRIME
[EMNLP 2022] Training Language Models with Memory Augmentation https://arxiv.org/abs/2205.12674
☆196Updated 2 years ago
microsoft / REINA
☆117Updated 3 years ago
nicola-decao / KnowledgeEditor
Code for Editing Factual Knowledge in Language Models
☆142Updated 3 years ago
kernelmachine / demix
DEMix Layers for Modular Language Modeling
☆54Updated 4 years ago
INK-USC / CrossFit
Code for paper "CrossFit : A Few-shot Learning Challenge for Cross-task Generalization in NLP" (https://arxiv.org/abs/2104.08835)
☆113Updated 3 years ago
swj0419 / in-context-pretraining
☆54Updated last year
shmsw25 / Channel-LM-Prompting
An original implementation of "Noisy Channel Language Model Prompting for Few-Shot Text Classification"
☆131Updated 3 years ago
nkandpa2 / long_tail_knowledge
Repo for the paper "Large Language Models Struggle to Learn Long-Tail Knowledge"
☆78Updated 2 years ago
sail-sg / symbolic-instruction-tuning
The official repository for the paper "From Zero to Hero: Examining the Power of Symbolic Tasks in Instruction Tuning".
☆66Updated 2 years ago
rosewang2008 / language_modeling_via_stochastic_processes
Language modeling via stochastic processes. Oral @ ICLR 2022.
☆138Updated 2 years ago
lemaoliu / retrieval-generation-reading-list
This project maintains a reading list for general text generation tasks
☆66Updated 4 years ago
swj0419 / kNN_prompt
TBC
☆27Updated 3 years ago
FranxYao / Distributional-Generalization-in-Natural-Language-Processing
Distributional Generalization in NLP. A roadmap.
☆88Updated 2 years ago
AkariAsai / ATTEMPT
This is the oficial repository for "Parameter-Efficient Multi-task Tuning via Attentional Mixtures of Soft Prompts" (EMNLP 2022)
☆104Updated 3 years ago
jzbjyb / ReAtt
Retrieval as Attention
☆82Updated 2 years ago
TobiasLee / Awesome-Efficient-PLM
Must-read papers on improving efficiency for pre-trained language models.
☆105Updated 3 years ago
microsoft / COCO-LM
[NeurIPS 2021] COCO-LM: Correcting and Contrasting Text Sequences for Language Model Pretraining
☆119Updated 2 years ago
facebookresearch / MetaICL
An original implementation of "MetaICL Learning to Learn In Context" by Sewon Min, Mike Lewis, Luke Zettlemoyer and Hannaneh Hajishirzi
☆271Updated 2 years ago
nayeon7lee / FactualityPrompt
☆87Updated 3 years ago
Hunter-DDM / stablemoe
Code for the ACL-2022 paper "StableMoE: Stable Routing Strategy for Mixture of Experts"
☆50Updated 3 years ago
Leezekun / dialogic
[EMNLP 2022] Code and data for "Controllable Dialogue Simulation with In-Context Learning"
☆35Updated 2 years ago
thomaslu2000 / Incremental-Parsing-Representations
☆57Updated 3 years ago
thu-coai / PICL
Code for ACL2023 paper: Pre-Training to Learn in Context
☆106Updated last year
McGill-NLP / polytropon
☆54Updated 2 years ago
ghomasHudson / muld
The Multitask Long Document Benchmark
☆41Updated 3 years ago
KaiLv69 / UDR
ACL'23: Unified Demonstration Retriever for In-Context Learning
☆37Updated 2 years ago
gmftbyGMFTBY / MomentumDecoding
Momentum Decoding: Open-ended Text Generation as Graph Exploration
☆19Updated 2 years ago
fuzihaofzh / repetition-problem-nlg
Code for the paper "A Theoretical Analysis of the Repetition Problem in Text Generation" in AAAI 2021.
☆57Updated 3 years ago
yxuansu / TaCL
[NAACL'22] TaCL: Improving BERT Pre-training with Token-aware Contrastive Learning
☆94Updated 3 years ago