sleekmike / Finetune_GPT-J_6B_8-bitLinks

Fine-tuning GPT-J-6B on colab or equivalent PC GPU with your custom datasets: 8-bit weights with low-rank adaptors (LoRA)

☆74

Alternatives and similar repositories for Finetune_GPT-J_6B_8-bit

Users that are interested in Finetune_GPT-J_6B_8-bit are comparing it to the libraries listed below

Sorting:

mallorbc / Finetune_LLMs
Repo for fine-tuning Casual LLMs
☆456Updated last year
labmlai / neox
Simple Annotated implementation of GPT-NeoX in PyTorch
☆110Updated 3 years ago
zphang / minimal-gpt-neox-20b
☆131Updated 3 years ago
CarperAI / cheese
Used for adaptive human in the loop evaluation of language and embedding models.
☆307Updated 2 years ago
Xirider / finetune-gpt2xl
Guide: Finetune GPT2-XL (1.5 Billion Parameters) and finetune GPT-NEO (2.7 B) on a single GPU with Huggingface Transformers using DeepSpe…
☆436Updated 2 years ago
leehanchung / lora-instruct
Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA
☆103Updated 5 months ago
mobarski / alpaca-libre
Reimplementation of the task generation part from the Alpaca paper
☆118Updated 2 years ago
rmihaylov / falcontune
Tune any FALCON in 4-bit
☆464Updated 2 years ago
AlekseyKorshuk / huggingnft
Generate NFT or train new model in just few clicks! Train as much as you can, others will resume from checkpoint!
☆157Updated 3 years ago
pszemraj / ai-msgbot
Training & Implementation of chatbots leveraging GPT-like architecture with the aitextgen package to enable dynamic conversations.
☆49Updated 3 years ago
amrrs / LLM-QA-Bot
☆64Updated 2 years ago
gustavecortal / gpt-j-fine-tuning-example
Fine-tuning 6-Billion GPT-J (& other models) with LoRA and 8-bit compression
☆68Updated 3 years ago
neuml / txtinstruct
📚 Datasets and models for instruction-tuning
☆237Updated 2 years ago
aspctu / alpaca-lora
Instruct-tuning LLaMA on consumer hardware
☆65Updated 2 years ago
paulcjh / gpt-j-6b
☆50Updated 2 years ago
mayooear / private-chatbot-mpt30b-langchain
Chat with your data privately using MPT-30b
☆183Updated 2 years ago
jagilley / fact-checker
Fact-checking LLM outputs with self-ask
☆304Updated 2 years ago
ConiferLabsWA / flan-ul2-dolly
☆34Updated 2 years ago
mallorbc / gpt-j-6b
☆64Updated 4 years ago
google-research-datasets / presto
A Multilingual Dataset for Parsing Realistic Task-Oriented Dialogs
☆115Updated 2 years ago
keerthanpg / DadJokeGenerator
☆46Updated 2 years ago
radi-cho / botbots
A dataset featuring diverse dialogues between two ChatGPT (gpt-3.5-turbo) instances with system messages written by GPT-4. Covering vario…
☆164Updated 2 years ago
Birch-san / mpt-play
Command-line script for inferencing from models such as MPT-7B-Chat
☆99Updated 2 years ago
PotatoSpudowski / fastLLaMa
fastLLaMa: An experimental high-performance framework for running Decoder-only LLMs with 4-bit quantization in Python using a C/C++ backe…
☆412Updated 2 years ago
cohere-ai / sandbox-conversant-lib
Conversational AI tooling & personas built on Cohere's LLMs
☆174Updated 2 years ago
chainyo / picaisso
🎨 Imagine what Picasso could have done with AI. Self-host your StableDiffusion API.
☆50Updated 2 years ago
vicgalle / gpt-j-api
API for the GPT-J language model 🦜. Including a FastAPI backend and a streamlit frontend
☆336Updated 4 years ago
iwalton3 / mpt-lora-patch
Patch for MPT-7B which allows using and training a LoRA
☆58Updated 2 years ago
TheProtaganist / gpt-j
A GPT-J API to use with python3 to generate text, blogs, code, and more
☆204Updated 2 years ago
huggingface / fuego
[WIP] A 🔥 interface for running code in the cloud
☆85Updated 2 years ago