gustavecortal / gpt-j-fine-tuning-exampleLinks
Fine-tuning 6-Billion GPT-J (& other models) with LoRA and 8-bit compression
β68Updated 3 years ago
Alternatives and similar repositories for gpt-j-fine-tuning-example
Users that are interested in gpt-j-fine-tuning-example are comparing it to the libraries listed below
Sorting:
- π€Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.β55Updated 3 years ago
- Exploring finetuning public checkpoints on filter 8K sequences on Pileβ115Updated 2 years ago
- Experiments with generating opensource language model assistantsβ97Updated 2 years ago
- Create soft prompts for fairseq 13B dense, GPT-J-6B and GPT-Neo-2.7B for free in a Google Colab TPU instanceβ28Updated 2 years ago
- Simple Annotated implementation of GPT-NeoX in PyTorchβ110Updated 3 years ago
- β33Updated 2 years ago
- RWKV-v2-RNN trained on the Pile. See https://github.com/BlinkDL/RWKV-LM for details.β66Updated 3 years ago
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limitβ62Updated 2 years ago
- Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRAβ103Updated 6 months ago
- A dataset featuring diverse dialogues between two ChatGPT (gpt-3.5-turbo) instances with system messages written by GPT-4. Covering varioβ¦β163Updated 2 years ago
- An experimental implementation of the retrieval-enhanced language modelβ75Updated 2 years ago
- β44Updated 2 years ago
- Framework agnostic python runtime for RWKV modelsβ146Updated 2 years ago
- β131Updated 3 years ago
- Reimplementation of the task generation part from the Alpaca paperβ118Updated 2 years ago
- Training & Implementation of chatbots leveraging GPT-like architecture with the aitextgen package to enable dynamic conversations.β49Updated 3 years ago
- Multi-Domain Expert Learningβ66Updated last year
- A library for squeakily cleaning and filtering language datasets.β48Updated 2 years ago
- QLoRA: Efficient Finetuning of Quantized LLMsβ79Updated last year
- 4 bits quantization of SantaCoder using GPTQβ51Updated 2 years ago
- Merge LLM that are split in to partsβ27Updated 3 months ago
- Instruct-tuning LLaMA on consumer hardwareβ65Updated 2 years ago
- β37Updated 2 years ago
- SparseGPT + GPTQ Compression of LLMs like LLaMa, OPT, Pythiaβ40Updated 2 years ago
- Conversational Language model toolkit for training against human preferences.β42Updated last year
- Prompt tuning toolkit for GPT-2 and GPT-Neoβ89Updated 4 years ago
- β253Updated 2 years ago
- Inference script for Meta's LLaMA models using Hugging Face wrapperβ109Updated 2 years ago
- One stop shop for all things carpβ59Updated 3 years ago
- β35Updated 2 years ago