rinnakk / prefix-tuning-gpt
Example code for prefix-tuning GPT/GPT-NeoX models and for inference with trained prefixes
☆11Updated last year
Related projects: ⓘ
- ☆46Updated 2 years ago
- Codes to pre-train Japanese T5 models☆40Updated 3 years ago
- Support Continual pre-training & Instruction Tuning forked from llama-recipes☆31Updated 7 months ago
- ☆43Updated 3 years ago
- A Benchmark for Robust, Multi-evidence, Multi-answer Question Answering☆17Updated last year
- ☆28Updated 3 months ago
- COMET-ATOMIC ja☆28Updated 6 months ago
- ☆11Updated 3 years ago
- Flexible evaluation tool for language models☆27Updated this week
- LEIA: Facilitating Cross-Lingual Knowledge Transfer in Language Models with Entity-based Data Augmentation☆21Updated 4 months ago
- DIRECT: Direct and Indirect REsponses in Conversational Text Corpus☆16Updated 3 years ago
- Do Multilingual Language Models Think Better in English?☆41Updated last year
- ☆10Updated 2 years ago
- Utility scripts for preprocessing Wikipedia texts for NLP☆73Updated 5 months ago
- ☆16Updated 10 months ago
- ☆11Updated 3 months ago
- ☆41Updated last year
- ☆28Updated 2 years ago
- BLOOM+1: Adapting BLOOM model to support a new unseen language☆69Updated 6 months ago
- Pytorch implementation and pre-trained Japanese model for CANINE, the efficient character-level transformer.☆89Updated 10 months ago
- Observe the slow deterioration of my mental sanity in the github commit history☆13Updated last year
- 日本語マルチタスク言語理解ベンチマーク Japanese Massive Multitask Language Understanding Benchmark☆25Updated 6 months ago
- Code & Data for Comparative Opinion Summarization via Collaborative Decoding (Iso et al; Findings of ACL 2022)☆21Updated last year
- Checkpointable dataset utilities for foundation model training☆31Updated 7 months ago
- Code for COLING 2020 Paper☆13Updated 2 weeks ago
- ☆40Updated 7 months ago
- M2D2: A Massively Multi-domain Language Modeling Dataset (EMNLP 2022) by Machel Reid, Victor Zhong, Suchin Gururangan, Luke Zettlemoyer☆53Updated last year
- ☯️ AllenNLP training configurations for promising models on Named Entity Recognition. (BiLSTM-CRF, BiLSTM-CNN-CRF, BERT, BERT-CRF)☆15Updated 3 years ago
- Source code for the GPT-2 story generation models in the EMNLP 2020 paper "STORIUM: A Dataset and Evaluation Platform for Human-in-the-Lo…☆38Updated 8 months ago
- KETOD Knowledge-Enriched Task-Oriented Dialogue☆31Updated last year