princeton-nlp / DinkyTrainView external linksLinks
Princeton NLP's pre-training library based on fairseq with DeepSpeed kernel integration π
β116Oct 27, 2022Updated 3 years ago
Alternatives and similar repositories for DinkyTrain
Users that are interested in DinkyTrain are comparing it to the libraries listed below
Sorting:
- [NAACL 2021] Factual Probing Is [MASK]: Learning vs. Learning to Recall https://arxiv.org/abs/2104.05240β168Oct 7, 2022Updated 3 years ago
- [EMNLP 2022] Training Language Models with Memory Augmentation https://arxiv.org/abs/2205.12674β195Jun 14, 2023Updated 2 years ago
- EMNLP 2022: Finding Dataset Shortcuts with Grammar Induction https://arxiv.org/abs/2210.11560β58Feb 28, 2025Updated 11 months ago
- Code for ACL 2023 Oral Paper: ManagerTower: Aggregating the Insights of Uni-Modal Experts for Vision-Language Representation Learningβ12Aug 23, 2025Updated 5 months ago
- [ACL 2022] Structured Pruning Learns Compact and Accurate Models https://arxiv.org/abs/2204.00408β198May 9, 2023Updated 2 years ago
- Materials for ACL-2022 tutorial: Knowledge-Augmented Methods for Natural Language Processingβ286Aug 8, 2022Updated 3 years ago
- EMNLP 2022: Generating Natural Language Proofs with Verifier-Guided Search https://arxiv.org/abs/2205.12443β86Sep 15, 2024Updated last year
- Fusion-in-Decoderβ591Oct 4, 2023Updated 2 years ago
- An original implementation of "Noisy Channel Language Model Prompting for Few-Shot Text Classification"β131Apr 23, 2022Updated 3 years ago
- β54Apr 15, 2022Updated 3 years ago
- [NAACL'22] TaCL: Improving BERT Pre-training with Token-aware Contrastive Learningβ94Jun 8, 2022Updated 3 years ago
- An (incomplete) overview of information extractionβ43Apr 28, 2022Updated 3 years ago
- [ACL 2022] Ditch the Gold Standard: Re-evaluating Conversational Question Answeringβ44Jun 18, 2022Updated 3 years ago
- β10Sep 27, 2021Updated 4 years ago
- [NeurIPS 2020] "The Lottery Ticket Hypothesis for Pre-trained BERT Networks", Tianlong Chen, Jonathan Frankle, Shiyu Chang, Sijia Liu, Yaβ¦β142Dec 30, 2021Updated 4 years ago
- A Structured Span Selector (NAACL 2022). A structured span selector with a WCFG for span selection tasks (coreference resolution, semantiβ¦β21Jul 11, 2022Updated 3 years ago
- Official code and model checkpoints for our EMNLP 2022 paper "RankGen - Improving Text Generation with Large Ranking Models" (https://arxβ¦β137Aug 2, 2023Updated 2 years ago
- Code and model release for the paper "Task-aware Retrieval with Instructions" by Asai et al.β165Oct 4, 2023Updated 2 years ago
- β35May 18, 2023Updated 2 years ago
- A Kernel-Based View of Language Model Fine-Tuning https://arxiv.org/abs/2210.05643β78Sep 4, 2023Updated 2 years ago
- [ICML 2023] Exploring the Benefits of Training Expert Language Models over Instruction Tuningβ98Apr 26, 2023Updated 2 years ago
- CSS-LM: Contrastive Semi-supervised Fine-tuning of Pre-trained Language Modelsβ12Jul 1, 2023Updated 2 years ago
- The implementation for "Open Relation Modeling: Learning to Define Relations between Entities" (Findings of ACL '22)β12Feb 28, 2022Updated 3 years ago
- A plug-and-play library for parameter-efficient-tuning (Delta Tuning)β1,040Sep 19, 2024Updated last year
- Exploring Few-Shot Adaptation of Language Models with Tablesβ24Aug 22, 2022Updated 3 years ago
- ReCross: Unsupervised Cross-Task Generalization via Retrieval Augmentationβ24May 1, 2022Updated 3 years ago
- [NeurIPS'22 Spotlight] A Contrastive Framework for Neural Text Generationβ475Mar 7, 2024Updated last year
- An efficient implementation of the popular sequence models for text generation, summarization, and translation tasks. https://arxiv.org/pβ¦β433Aug 17, 2022Updated 3 years ago
- Lite Self-Trainingβ30Jul 25, 2023Updated 2 years ago
- Scalable training for dense retrieval models.β298Jun 10, 2025Updated 8 months ago
- β59Sep 23, 2022Updated 3 years ago
- Source code for SIGIR 2022 paper.β16Apr 25, 2022Updated 3 years ago
- Library for Knowledge Intensive Language Tasksβ963Mar 31, 2022Updated 3 years ago
- β60Dec 20, 2022Updated 3 years ago
- β98Jun 6, 2022Updated 3 years ago
- Code for the NAACL 2022 long paper "DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings"β296Oct 27, 2022Updated 3 years ago
- Code for EMNLP 2021 paper: Improving Sequence-to-Sequence Pre-training via Sequence Span Rewritingβ17Nov 30, 2021Updated 4 years ago
- Zero-shot Learning by Generating Task-specific Adaptersβ14Apr 2, 2021Updated 4 years ago
- NAACL 2022: Can Rationalization Improve Robustness? https://arxiv.org/abs/2204.11790β27Nov 21, 2022Updated 3 years ago