rinnakk / prefix-tuning-gptLinks
Example code for prefix-tuning GPT/GPT-NeoX models and for inference with trained prefixes
☆13Updated 2 years ago
Alternatives and similar repositories for prefix-tuning-gpt
Users that are interested in prefix-tuning-gpt are comparing it to the libraries listed below
Sorting:
- ☆46Updated 3 years ago
- ☆43Updated 4 years ago
- Checkpointable dataset utilities for foundation model training☆32Updated last year
- Support Continual pre-training & Instruction Tuning forked from llama-recipes☆32Updated last year
- ☆42Updated last year
- COMET-ATOMIC ja☆30Updated last year
- Project of llm evaluation to Japanese tasks☆89Updated last week
- A Benchmark for Robust, Multi-evidence, Multi-answer Question Answering☆16Updated 2 years ago
- ☆50Updated last year
- Repo for "Smart Word Suggestions" (SWS) task and benchmark☆20Updated last year
- Codes to pre-train Japanese T5 models☆40Updated 3 years ago
- ☆11Updated 4 years ago
- ☆18Updated 8 months ago
- ☆14Updated last year
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆34Updated 2 years ago
- Japanese LLaMa experiment☆54Updated 8 months ago
- ☆61Updated last year
- Implementation of autoregressive language model using improved Transformer and DeepSpeed pipeline parallelism.☆32Updated 3 years ago
- ☆56Updated 8 months ago
- ☆28Updated 4 months ago
- ☆15Updated 3 years ago
- Convenient Text-to-Text Training for Transformers☆19Updated 3 years ago
- Observe the slow deterioration of my mental sanity in the github commit history☆12Updated 2 years ago
- Source code for the GPT-2 story generation models in the EMNLP 2020 paper "STORIUM: A Dataset and Evaluation Platform for Human-in-the-Lo…☆39Updated last year
- official repo for AAAI ALOHA chatbot☆29Updated last year
- DIRECT: Direct and Indirect REsponses in Conversational Text Corpus☆16Updated 4 years ago
- Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP☆58Updated 3 years ago
- GPT-jax based on the official huggingface library☆13Updated 4 years ago
- ☆29Updated 3 years ago
- hllama is a library which aims to provide a set of utility tools for large language models.☆10Updated last year