salesforce / MPTLinks
☆16Updated 2 years ago
Alternatives and similar repositories for MPT
Users that are interested in MPT are comparing it to the libraries listed below
Sorting:
- An Experiment on Dynamic NTK Scaling RoPE☆64Updated 2 years ago
- Code for paper 'Data-Efficient FineTuning'☆28Updated 2 years ago
- Code for ACL2023 paper: Pre-Training to Learn in Context☆106Updated last year
- On Transferability of Prompt Tuning for Natural Language Processing☆100Updated last year
- A Multilingual Replicable Instruction-Following Model☆95Updated 2 years ago
- Implementation of "Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation"☆21Updated 2 years ago
- An Empirical Study On Contrastive Search And Contrastive Decoding For Open-ended Text Generation☆27Updated last year
- Code for ACL paper "Zero-Shot Text Classification via Self-Supervised Tuning"☆28Updated 2 years ago
- ☆15Updated 4 years ago
- This project maintains a reading list for general text generation tasks☆66Updated 4 years ago
- Inference script for Meta's LLaMA models using Hugging Face wrapper☆110Updated 2 years ago
- Retrieval as Attention☆82Updated 2 years ago
- [EMNLP 2022] Code and data for "Controllable Dialogue Simulation with In-Context Learning"☆35Updated 2 years ago
- Vocabulary Trimming (VT) is a model compression technique, which reduces a multilingual LM vocabulary to a target language by deleting ir…☆58Updated last year
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆29Updated last week
- Code for "Mixed Cross Entropy Loss for Neural Machine Translation"☆20Updated 4 years ago
- Code for EMNLP 2020 paper CoDIR☆41Updated 3 years ago
- Source code for COLING 2022 paper "Automatic Label Sequence Generation for Prompting Sequence-to-sequence Models"☆24Updated 3 years ago
- Source code for paper: Knowledge Inheritance for Pre-trained Language Models☆38Updated 3 years ago
- Source code repo for paper "TLDR: Token Loss Dynamic Reweighting for Reducing Repetitive Utterance Generation"☆10Updated 2 years ago
- Code and dataset for the emnlp paper titled Instruct and Extract: Instruction Tuning for On-Demand Information Extraction☆54Updated last year
- ☆11Updated 3 years ago
- Transformers at any scale☆42Updated last year
- the instructions and demonstrations for building a formal logical reasoning capable GLM☆55Updated last year
- The code of paper "Learning to Break the Loop: Analyzing and Mitigating Repetitions for Neural Text Generation" published at NeurIPS 202…☆47Updated 3 years ago
- This repository is the official implementation of our EMNLP 2022 paper ELMER: A Non-Autoregressive Pre-trained Language Model for Efficie…☆26Updated 3 years ago
- Repo for the paper "Large Language Models Struggle to Learn Long-Tail Knowledge"☆78Updated 2 years ago
- KETOD Knowledge-Enriched Task-Oriented Dialogue☆32Updated 2 years ago
- This is the official PyTorch repo for "UNIREX: A Unified Learning Framework for Language Model Rationale Extraction" (ICML 2022).☆26Updated 2 years ago
- A extension of Transformers library to include T5ForSequenceClassification class.☆40Updated 2 years ago