mallorbc / gpt-j-6b
☆63Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for gpt-j-6b
- Repo for fine-tuning Casual LLMs☆449Updated 7 months ago
- ☆33Updated 3 years ago
- ☆27Updated 3 years ago
- Notebook for running GPT neo models based on GPT3☆64Updated 3 years ago
- Fine-tuning GPT-J-6B on colab or equivalent PC GPU with your custom datasets: 8-bit weights with low-rank adaptors (LoRA)☆74Updated 2 years ago
- Fine-tuning 6-Billion GPT-J (& other models) with LoRA and 8-bit compression☆65Updated 2 years ago
- ☆121Updated last year
- Simple Annotated implementation of GPT-NeoX in PyTorch☆111Updated 2 years ago
- Training & Implementation of chatbots leveraging GPT-like architecture with the aitextgen package to enable dynamic conversations.☆46Updated 2 years ago
- A repository to run gpt-j-6b on low vram machines (4.2 gb minimum vram for 2000 token context, 3.5 gb for 1000 token context). Model load…☆114Updated 2 years ago
- ☆128Updated 2 years ago
- Guide: Finetune GPT2-XL (1.5 Billion Parameters) and finetune GPT-NEO (2.7 B) on a single GPU with Huggingface Transformers using DeepSpe…☆432Updated last year
- Patch for MPT-7B which allows using and training a LoRA☆58Updated last year
- Just a simple HowTo for https://github.com/johnsmith0031/alpaca_lora_4bit☆31Updated last year
- llama-4bit-colab☆64Updated last year
- 🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.☆55Updated 2 years ago
- ☆28Updated last year
- API for the GPT-J language model 🦜. Including a FastAPI backend and a streamlit frontend☆338Updated 3 years ago
- This repository contains code for extending the Stanford Alpaca synthetic instruction tuning to existing instruction-tuned models such as…☆348Updated last year
- A GPT-J API to use with python3 to generate text, blogs, code, and more☆204Updated 2 years ago
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆124Updated last year
- Instruct-tuning LLaMA on consumer hardware☆66Updated last year
- A basic ui for running gpt neo 2.7B on low vram (3 gb Vram minimum)☆35Updated 3 years ago
- A ready-to-deploy container for implementing an easy to use REST API to access Language Models.☆64Updated last year
- Reweight GPT - a simple neural network using transformer architecture for next character prediction☆48Updated last year
- Conversational Language model toolkit for training against human preferences.☆41Updated 7 months ago
- A search engine for ParlAI's BlenderBot project (and probably other ones as well)☆132Updated 2 years ago
- Automated prompting and scoring framework to evaluate LLMs using updated human knowledge prompts☆111Updated last year
- Implementation of PersonaGPT Dialog Model☆102Updated 3 years ago