salesforce / MPTLinks
☆16Updated last year
Alternatives and similar repositories for MPT
Users that are interested in MPT are comparing it to the libraries listed below
Sorting:
- Code for paper 'Data-Efficient FineTuning'☆29Updated 2 years ago
- A collection of instruction data and scripts for machine translation.☆20Updated last year
- ☆11Updated 2 years ago
- First explanation metric (diagnostic report) for text generation evaluation☆62Updated 3 months ago
- Understanding and Improving Encoder Layer Fusion in Sequence-to-Sequence Learning (ICLR 2021)☆24Updated 4 years ago
- An Empirical Study On Contrastive Search And Contrastive Decoding For Open-ended Text Generation☆27Updated last year
- An Experiment on Dynamic NTK Scaling RoPE☆64Updated last year
- ☆54Updated 2 years ago
- ☆15Updated 3 years ago
- Code for ACL paper "Zero-Shot Text Classification via Self-Supervised Tuning"☆27Updated last year
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆48Updated last year
- The paper list of multilingual pre-trained models (Continual Updated).☆22Updated 11 months ago
- ☆34Updated 11 months ago
- Technical Report: Is ChatGPT a Good NLG Evaluator? A Preliminary Study☆43Updated 2 years ago
- This is the code for the Submission 3358 at NeurIPS 2022.☆22Updated 2 years ago
- 🩺 A collection of ChatGPT evaluation reports on various bechmarks.☆49Updated 2 years ago
- Source code for COLING 2022 paper "Automatic Label Sequence Generation for Prompting Sequence-to-sequence Models"☆24Updated 2 years ago
- Using business-level retrieval system (BM25) with Python in just a few lines.☆31Updated 2 years ago
- Efficient Memory-Augmented Transformers☆34Updated 2 years ago
- Code for COLING22 paper, DPTDR: Deep Prompt Tuning for Dense Passage Retrieval☆25Updated last year
- ☆35Updated last year
- Towards Systematic Measurement for Long Text Quality☆35Updated 9 months ago
- Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"☆57Updated 2 years ago
- BLOOM+1: Adapting BLOOM model to support a new unseen language☆72Updated last year
- ☆35Updated last year
- DQ-BART: Efficient Sequence-to-Sequence Model via Joint Distillation and Quantization (ACL 2022)☆50Updated last year
- [COLM'24] "How Easily do Irrelevant Inputs Skew the Responses of Large Language Models?"☆22Updated 7 months ago
- KETOD Knowledge-Enriched Task-Oriented Dialogue☆32Updated 2 years ago
- Code and pre-trained models for "ReasonBert: Pre-trained to Reason with Distant Supervision", EMNLP'2021☆29Updated 2 years ago
- Code and data for "Dynosaur: A Dynamic Growth Paradigm for Instruction-Tuning Data Curation" (EMNLP 2023)☆64Updated last year