rinnakk / prefix-tuning-gptLinks
Example code for prefix-tuning GPT/GPT-NeoX models and for inference with trained prefixes
☆13Updated 2 years ago
Alternatives and similar repositories for prefix-tuning-gpt
Users that are interested in prefix-tuning-gpt are comparing it to the libraries listed below
Sorting:
- ☆46Updated 3 years ago
- A Benchmark for Robust, Multi-evidence, Multi-answer Question Answering☆17Updated 2 years ago
- Project of llm evaluation to Japanese tasks☆90Updated last week
- Convenient Text-to-Text Training for Transformers☆19Updated 3 years ago
- Checkpointable dataset utilities for foundation model training☆31Updated last year
- Support Continual pre-training & Instruction Tuning forked from llama-recipes☆33Updated last year
- ☆11Updated 4 years ago
- Repo for "Smart Word Suggestions" (SWS) task and benchmark☆20Updated last year
- Lightblue LLM Eval Framework: tengu, elyza100, ja-mtbench, rakuda☆18Updated last week
- BLOOM+1: Adapting BLOOM model to support a new unseen language☆73Updated last year
- ☆43Updated 4 years ago
- ☆13Updated 10 months ago
- Implementation of autoregressive language model using improved Transformer and DeepSpeed pipeline parallelism.☆32Updated 3 years ago
- ☆57Updated 10 months ago
- Can LLMs generate code-mixed sentences through zero-shot prompting?☆11Updated 2 years ago
- ☆29Updated 3 years ago
- Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP☆58Updated 3 years ago
- Do Multilingual Language Models Think Better in English?☆42Updated 2 years ago
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…☆27Updated last year
- hllama is a library which aims to provide a set of utility tools for large language models.☆10Updated last year
- Ensembling Hugging Face transformers made easy☆63Updated 2 years ago
- ☆14Updated 3 years ago
- ☆15Updated 3 years ago
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆34Updated 2 years ago
- KETOD Knowledge-Enriched Task-Oriented Dialogue☆32Updated 2 years ago
- Calculating Expected Time for training LLM.☆38Updated 2 years ago
- ☆44Updated 10 months ago
- MEXMA: Token-level objectives improve sentence representations☆41Updated 9 months ago
- List of papers on Self-Correction of LLMs.☆78Updated 9 months ago
- Observe the slow deterioration of my mental sanity in the github commit history☆12Updated 2 years ago