salesforce / MPTLinks
☆16Updated 2 years ago
Alternatives and similar repositories for MPT
Users that are interested in MPT are comparing it to the libraries listed below
Sorting:
- An Experiment on Dynamic NTK Scaling RoPE☆64Updated 2 years ago
- Code for paper 'Data-Efficient FineTuning'☆28Updated 2 years ago
- Code for ACL2023 paper: Pre-Training to Learn in Context☆106Updated last year
- ☆11Updated 3 years ago
- The code of paper "Learning to Break the Loop: Analyzing and Mitigating Repetitions for Neural Text Generation" published at NeurIPS 202 …☆48Updated 3 years ago
- Implementation of "Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation"☆21Updated 2 years ago
- On Transferability of Prompt Tuning for Natural Language Processing☆100Updated last year
- Code and data for "Dynosaur: A Dynamic Growth Paradigm for Instruction-Tuning Data Curation" (EMNLP 2023)☆65Updated 2 years ago
- Vocabulary Trimming (VT) is a model compression technique, which reduces a multilingual LM vocabulary to a target language by deleting ir…☆61Updated last year
- [EMNLP 2022] Code and data for "Controllable Dialogue Simulation with In-Context Learning"☆35Updated 2 years ago
- Inference script for Meta's LLaMA models using Hugging Face wrapper☆110Updated 2 years ago
- Source code for paper: Knowledge Inheritance for Pre-trained Language Models☆38Updated 3 years ago
- Retrieval as Attention☆82Updated 3 years ago
- A collection of instruction data and scripts for machine translation.☆20Updated 2 years ago
- Transformers at any scale☆42Updated last year
- An Empirical Study On Contrastive Search And Contrastive Decoding For Open-ended Text Generation☆27Updated last year
- KETOD Knowledge-Enriched Task-Oriented Dialogue☆32Updated 3 years ago
- ☆75Updated last year
- the instructions and demonstrations for building a formal logical reasoning capable GLM☆55Updated last year
- Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"☆59Updated 2 years ago
- Code for "Can Retriever-Augmented Language Models Reason? The Blame Game Between the Retriever and the Language Model", EMNLP Findings 20…☆28Updated 2 years ago
- A extension of Transformers library to include T5ForSequenceClassification class.☆40Updated 2 years ago
- ☆35Updated last year
- [ICLR 2023] Codebase for Copy-Generator model, including an implementation of kNN-LM☆190Updated 11 months ago
- ☆15Updated 4 years ago
- [NeurIPS 2023 Main Track] This is the repository for the paper titled "Don’t Stop Pretraining? Make Prompt-based Fine-tuning Powerful Lea…☆76Updated last year
- Repo for "Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple Tasks" ACL 2023 Findings☆16Updated 2 years ago
- A Multilingual Replicable Instruction-Following Model☆95Updated 2 years ago
- ROUGE for multilingual Summarization☆25Updated 4 years ago
- ☆35Updated 2 years ago