rinnakk / prefix-tuning-gptLinks
Example code for prefix-tuning GPT/GPT-NeoX models and for inference with trained prefixes
☆12Updated 2 years ago
Alternatives and similar repositories for prefix-tuning-gpt
Users that are interested in prefix-tuning-gpt are comparing it to the libraries listed below
Sorting:
- ☆46Updated 3 years ago
- Support Continual pre-training & Instruction Tuning forked from llama-recipes☆32Updated last year
- Checkpointable dataset utilities for foundation model training☆32Updated last year
- ☆11Updated 3 years ago
- ☆17Updated 6 months ago
- ☆53Updated 6 months ago
- KETOD Knowledge-Enriched Task-Oriented Dialogue☆32Updated 2 years ago
- ☆29Updated 3 years ago
- A Benchmark for Robust, Multi-evidence, Multi-answer Question Answering☆16Updated 2 years ago
- Codes to pre-train Japanese T5 models☆41Updated 3 years ago
- [EMNLP 2023] Official repository for Dialogue Chain-of-Thought Distillation (DONUT & DOCTOR)☆11Updated last year
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆34Updated last year
- Convenient Text-to-Text Training for Transformers☆19Updated 3 years ago
- Introducing Filtered Direct Preference Optimization (fDPO) that enhances language model alignment with human preferences by discarding lo…☆15Updated 7 months ago
- IntructIR, a novel benchmark specifically designed to evaluate the instruction following ability in information retrieval models. Our foc…☆32Updated last year
- Codebase for public release of the plug-and-blend framework.☆23Updated 3 years ago
- DIRECT: Direct and Indirect REsponses in Conversational Text Corpus☆16Updated 3 years ago
- BLOOM+1: Adapting BLOOM model to support a new unseen language☆72Updated last year
- Utility scripts for preprocessing Wikipedia texts for NLP☆77Updated last year
- Japanese LLaMa experiment☆53Updated 6 months ago
- 日本語マルチタスク言語理解ベンチマーク Japanese Massive Multitask Language Understanding Benchmark☆36Updated 6 months ago
- ☆43Updated 3 years ago
- COMET-ATOMIC ja☆30Updated last year
- Source code for the GPT-2 story generation models in the EMNLP 2020 paper "STORIUM: A Dataset and Evaluation Platform for Human-in-the-Lo…☆39Updated last year
- Code associated with the "Data Augmentation using Pre-trained Transformer Models" paper☆52Updated 2 years ago
- Observe the slow deterioration of my mental sanity in the github commit history☆12Updated 2 years ago
- ☆15Updated 3 years ago
- LEIA: Facilitating Cross-Lingual Knowledge Transfer in Language Models with Entity-based Data Augmentation☆21Updated last year
- List of papers on Self-Correction of LLMs.☆73Updated 6 months ago
- Code & Data for Comparative Opinion Summarization via Collaborative Decoding (Iso et al; Findings of ACL 2022)☆21Updated 3 months ago