gustavecortal / gpt-j-fine-tuning-exampleLinks

Fine-tuning 6-Billion GPT-J (& other models) with LoRA and 8-bit compression

☆66

Alternatives and similar repositories for gpt-j-fine-tuning-example

Users that are interested in gpt-j-fine-tuning-example are comparing it to the libraries listed below

Sorting:

VE-FORBRYDERNE / mtj-softtuner
Create soft prompts for fairseq 13B dense, GPT-J-6B and GPT-Neo-2.7B for free in a Google Colab TPU instance
☆28Updated 2 years ago
finetunej / transformers
🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.
☆56Updated 3 years ago
kaiokendev / cutoff-len-is-context-len
Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit
☆63Updated 2 years ago
Rallio67 / language-model-agents
Experiments with generating opensource language model assistants
☆97Updated 2 years ago
kyleliang919 / Long-context-transformers
Exploring finetuning public checkpoints on filter 8K sequences on Pile
☆116Updated 2 years ago
BlinkDL / RWKV-v2-RNN-Pile
RWKV-v2-RNN trained on the Pile. See https://github.com/BlinkDL/RWKV-LM for details.
☆67Updated 2 years ago
labmlai / neox
Simple Annotated implementation of GPT-NeoX in PyTorch
☆110Updated 2 years ago
zphang / minimal-gpt-neox-20b
☆130Updated 3 years ago
huu4ontocord / MDEL
Multi-Domain Expert Learning
☆67Updated last year
deep-diver / LLM-Pref-Mark-UI
☆37Updated 2 years ago
ConiferLabsWA / flan-ul2-alpaca
☆32Updated 2 years ago
leehanchung / lora-instruct
Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA
☆104Updated 2 months ago
ChrisHayduk / qlora-multi-gpu
QLoRA with Enhanced Multi GPU Support
☆37Updated 2 years ago
rmihaylov / mpttune
Tune MPTs
☆84Updated 2 years ago
harrisonvanderbyl / rwkvstic
Framework agnostic python runtime for RWKV models
☆146Updated last year
ConiferLabsWA / flan-ul2-dolly
☆34Updated 2 years ago
castorini / hf-spacerini
Plug-and-play Search Interfaces with Pyserini and Hugging Face
☆32Updated 2 years ago
NolanoOrg / sparse_quant_llms
SparseGPT + GPTQ Compression of LLMs like LLaMa, OPT, Pythia
☆41Updated 2 years ago
mobarski / alpaca-libre
Reimplementation of the task generation part from the Alpaca paper
☆119Updated 2 years ago
CarperAI / squeakily
A library for squeakily cleaning and filtering language datasets.
☆47Updated 2 years ago
pszemraj / ai-msgbot
Training & Implementation of chatbots leveraging GPT-like architecture with the aitextgen package to enable dynamic conversations.
☆48Updated 2 years ago
zsc / llama_infer
Inference script for Meta's LLaMA models using Hugging Face wrapper
☆110Updated 2 years ago
qwopqwop200 / gptqlora
GPTQLoRA: Efficient Finetuning of Quantized LLMs with GPTQ
☆103Updated 2 years ago
iwalton3 / mpt-lora-patch
Patch for MPT-7B which allows using and training a LoRA
☆58Updated 2 years ago
donaldafeith / Pytorch_Merge
Merge LLM that are split in to parts
☆27Updated 2 weeks ago
zarakiquemparte / zaraki-tools
☆27Updated last year
AlpinDale / sparsegpt-for-LLaMA
Code for the paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot" with LLaMA implementation.
☆71Updated 2 years ago
illidanlab / personaGPT
Implementation of PersonaGPT Dialog Model
☆113Updated 3 years ago
EleutherAI / openwebtext2
☆90Updated 3 years ago
TehVenomm / LM_Transformers_BlockMerge
Image Diffusion block merging technique applied to transformers based Language Models.
☆54Updated 2 years ago