mallorbc / gpt-j-6b
☆63Updated 3 years ago
Alternatives and similar repositories for gpt-j-6b:
Users that are interested in gpt-j-6b are comparing it to the libraries listed below
- ☆27Updated 3 years ago
- Fine-tuning GPT-J-6B on colab or equivalent PC GPU with your custom datasets: 8-bit weights with low-rank adaptors (LoRA)☆74Updated 2 years ago
- ☆121Updated last year
- ☆34Updated 3 years ago
- Guide: Finetune GPT2-XL (1.5 Billion Parameters) and finetune GPT-NEO (2.7 B) on a single GPU with Huggingface Transformers using DeepSpe…☆437Updated last year
- Fine-tuning 6-Billion GPT-J (& other models) with LoRA and 8-bit compression☆66Updated 2 years ago
- A repository to run gpt-j-6b on low vram machines (4.2 gb minimum vram for 2000 token context, 3.5 gb for 1000 token context). Model load…☆115Updated 3 years ago
- llama-4bit-colab☆65Updated last year
- Colab notebooks to run a basic AI Dungeon clone using gpt-neo-2.7B☆64Updated 3 years ago
- Simple Annotated implementation of GPT-NeoX in PyTorch☆110Updated 2 years ago
- ☆65Updated last year
- ☆128Updated 2 years ago
- A ready-to-deploy container for implementing an easy to use REST API to access Language Models.☆64Updated 2 years ago
- Implementation of PersonaGPT Dialog Model☆105Updated 3 years ago
- Small repository for my video on LoRA☆16Updated last year
- Training & Implementation of chatbots leveraging GPT-like architecture with the aitextgen package to enable dynamic conversations.☆48Updated 2 years ago
- Notebook for running GPT neo models based on GPT3☆63Updated 3 years ago
- Patch for MPT-7B which allows using and training a LoRA☆58Updated last year
- Reweight GPT - a simple neural network using transformer architecture for next character prediction☆51Updated last year
- Instruct-tuning LLaMA on consumer hardware☆66Updated last year
- GPT2Explorer is bringing GPT2 OpenAI langage models playground to run locally on standard windows computers.☆29Updated 2 years ago
- A GPT-J API to use with python3 to generate text, blogs, code, and more☆206Updated 2 years ago
- ☆28Updated last year
- Creates an Langchain Agent which uses the WebUI's API and Wikipedia to work☆73Updated last year
- openai/whisper + extra features☆88Updated 2 years ago
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆123Updated last year
- AI projects in python, mostly Jupyter notebooks.☆179Updated 2 months ago
- ☆168Updated last year
- A Repo to store the Google Colaboratory Notebooks that I have created and shared☆274Updated 7 months ago
- A gradio web UI for running Large Language Models like GPT-J 6B, OPT, GALACTICA, LLaMA, and Pygmalion.☆308Updated last year