rinnakk / prefix-tuning-gpt
Example code for prefix-tuning GPT/GPT-NeoX models and for inference with trained prefixes
☆12Updated last year
Related projects ⓘ
Alternatives and complementary repositories for prefix-tuning-gpt
- Swallowプロジェクト 大規模言語モデル 評価スクリプト☆10Updated 4 months ago
- A Benchmark for Robust, Multi-evidence, Multi-answer Question Answering☆16Updated last year
- Checkpointable dataset utilities for foundation model training☆32Updated 9 months ago
- ☆46Updated 2 years ago
- Support Continual pre-training & Instruction Tuning forked from llama-recipes☆32Updated 9 months ago
- ☆43Updated 3 years ago
- Codes to pre-train Japanese T5 models☆40Updated 3 years ago
- A library for semantic similarity search☆23Updated 2 months ago
- Observe the slow deterioration of my mental sanity in the github commit history☆13Updated last year
- ☆28Updated 2 years ago
- DIRECT: Direct and Indirect REsponses in Conversational Text Corpus☆16Updated 3 years ago
- Flexible evaluation tool for language models☆36Updated this week
- LEIA: Facilitating Cross-Lingual Knowledge Transfer in Language Models with Entity-based Data Augmentation☆21Updated 6 months ago
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆34Updated last year
- ☆16Updated last year
- ☆11Updated 6 months ago
- ☆11Updated 3 years ago
- Utility scripts for preprocessing Wikipedia texts for NLP☆76Updated 7 months ago
- Hugging Face RoBERTa with Flash Attention 2☆19Updated last year
- Convenient Text-to-Text Training for Transformers☆19Updated 2 years ago
- ☯️ AllenNLP training configurations for promising models on Named Entity Recognition. (BiLSTM-CRF, BiLSTM-CNN-CRF, BERT, BERT-CRF)☆15Updated 3 years ago
- TorchServe+Streamlit for easily serving your HuggingFace NER models☆31Updated 2 years ago
- KETOD Knowledge-Enriched Task-Oriented Dialogue☆31Updated last year
- ☆51Updated 5 months ago
- ☆25Updated 5 months ago
- ☆12Updated 5 months ago
- Repository for Skill Set Optimization☆12Updated 3 months ago
- Source code for the GPT-2 story generation models in the EMNLP 2020 paper "STORIUM: A Dataset and Evaluation Platform for Human-in-the-Lo…☆39Updated 10 months ago
- COMET-ATOMIC ja☆28Updated 8 months ago
- Calculating Expected Time for training LLM.☆38Updated last year