arrmansa / Basic-UI-for-GPT-J-6B-with-low-vram
A repository to run gpt-j-6b on low vram machines (4.2 gb minimum vram for 2000 token context, 3.5 gb for 1000 token context). Model loading takes 12gb free ram.
☆114Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for Basic-UI-for-GPT-J-6B-with-low-vram
- Colab notebooks to run a basic AI Dungeon clone using gpt-neo-2.7B☆64Updated 3 years ago
- A basic ui for running gpt neo 2.7B on low vram (3 gb Vram minimum)☆35Updated 3 years ago
- Just a repo with some AI Dungeon scripts☆29Updated 3 years ago
- Tools with GUI for GPT finetune data preparation☆23Updated 3 years ago
- Conversational Language model toolkit for training against human preferences.☆41Updated 7 months ago
- A ready-to-deploy container for implementing an easy to use REST API to access Language Models.☆64Updated last year
- 🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.☆55Updated 2 years ago
- ☆28Updated last year
- k_diffusion wrapper included for k_lms sampling. fixed for notebook.☆20Updated last year
- NovelAI Research Tool and API implementations in Golang☆43Updated 2 years ago
- A latent text-to-image diffusion model☆67Updated last year
- 1.4B latent diffusion model fine tuning☆261Updated 2 years ago
- A prompt/context management system☆165Updated last year
- Simple Annotated implementation of GPT-NeoX in PyTorch☆111Updated 2 years ago
- A gradio web UI for running Large Language Models like GPT-J 6B, OPT, GALACTICA, LLaMA, and Pygmalion.☆307Updated last year
- Frontend for deeplearning Image generation☆144Updated 5 months ago
- Notebook for running GPT neo models based on GPT3☆64Updated 3 years ago
- ☆150Updated last year
- extending stable diffusion prompts with suitable style cues using text generation☆177Updated last year
- ☆128Updated 2 years ago
- Just a simple HowTo for https://github.com/johnsmith0031/alpaca_lora_4bit☆31Updated last year
- Create soft prompts for fairseq 13B dense, GPT-J-6B and GPT-Neo-2.7B for free in a Google Colab TPU instance☆27Updated last year
- ELIZA is an open domain chatbot with Discord and Twitter integration.☆70Updated last year
- stable diffusion training☆291Updated 2 years ago
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆124Updated last year
- Platform and API Agnostic library for powering chatbots☆24Updated last year