A repository to run gpt-j-6b on low vram machines (4.2 gb minimum vram for 2000 token context, 3.5 gb for 1000 token context). Model loading takes 12gb free ram.
β112Dec 23, 2021Updated 4 years ago
Alternatives and similar repositories for Basic-UI-for-GPT-J-6B-with-low-vram
Users that are interested in Basic-UI-for-GPT-J-6B-with-low-vram are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A basic ui for running gpt neo 2.7B on low vram (3 gb Vram minimum)β35Jun 14, 2021Updated 4 years ago
- π€Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.β56Jan 20, 2022Updated 4 years ago
- Colab notebooks to run a basic AI Dungeon clone using gpt-neo-2.7Bβ62Jul 21, 2021Updated 4 years ago
- Tools with GUI for GPT finetune data preparationβ22Aug 24, 2021Updated 4 years ago
- API for the GPT-J language model π¦. Including a FastAPI backend and a streamlit frontendβ335Oct 25, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- An attempt to create an open-source AI companion that is self-hostableβ83Nov 28, 2022Updated 3 years ago
- β27May 11, 2023Updated 3 years ago
- Hidden Engrams: Long Term Memory for Transformer Model Inferenceβ35Jun 26, 2021Updated 4 years ago
- A GPT-J API to use with python3 to generate text, blogs, code, and moreβ204Nov 12, 2022Updated 3 years ago
- Notebook for running GPT neo models based on GPT3β61Aug 10, 2021Updated 4 years ago
- Create soft prompts for fairseq 13B dense, GPT-J-6B and GPT-Neo-2.7B for free in a Google Colab TPU instanceβ28Mar 1, 2023Updated 3 years ago
- β50Jan 4, 2023Updated 3 years ago
- Just a repo with some AI Dungeon scriptsβ31Jul 4, 2021Updated 4 years ago
- Guide: Finetune GPT2-XL (1.5 Billion Parameters) and finetune GPT-NEO (2.7 B) on a single GPU with Huggingface Transformers using DeepSpeβ¦β436Jun 14, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Model parallel transformers in JAX and Haikuβ6,370Jan 21, 2023Updated 3 years ago
- A repo for code based language modelsβ18Feb 10, 2021Updated 5 years ago
- AI Dungeon Catalog Archive Toolkitβ35Jul 19, 2021Updated 4 years ago
- Fine-tuning 6-Billion GPT-J (& other models) with LoRA and 8-bit compressionβ69Oct 5, 2022Updated 3 years ago
- rwkv_chatbotβ62Feb 6, 2023Updated 3 years ago
- Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusionβ12Nov 6, 2022Updated 3 years ago
- A VR character controller for A-Frame with teleportation, smooth locomotion, snap turning, and smooth turning.β10Aug 12, 2023Updated 2 years ago
- How to build your own GPT-J Playgroundβ32May 4, 2022Updated 4 years ago
- Home of `erlich` and `ongo`. Finetune latent-diffusion/glid-3-xl text2image on your own data.β181Aug 5, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed librariesβ7,430May 19, 2026Updated last week
- Converts stable diffusion embeddings to loadable pngsβ40Dec 6, 2022Updated 3 years ago
- Run ONNX RWKV-v4 models with GPU acceleration using DirectML [Windows], or just on CPU [Windows AND Linux]; Limited to 430M model at thisβ¦β21Mar 16, 2023Updated 3 years ago
- β14Jan 14, 2024Updated 2 years ago
- Test prompts for GPT-J-6B and the resulting AI-generated textsβ53Jun 13, 2021Updated 4 years ago
- Small python project to generate movies with a droste effectβ27Dec 26, 2022Updated 3 years ago
- A robust Python tool for text-based AI training and generation using GPT-2.β1,841Jul 14, 2023Updated 2 years ago
- Turns KoboldAI into a crowdsourced distributed clusterβ33Oct 19, 2023Updated 2 years ago
- configuration browser for the sd2iec floppy drive emulator for C64β11Dec 14, 2025Updated 5 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Extension for stable diffusion webui to add advance prompt tuningβ10Nov 13, 2022Updated 3 years ago
- β56Mar 2, 2023Updated 3 years ago
- For GGUF support, see KoboldCPP: https://github.com/LostRuins/koboldcppβ3,902Jan 16, 2025Updated last year
- β22Oct 8, 2022Updated 3 years ago
- This will help you convert a GPT2-XL model to an optimized onnx model fp 16.β10Oct 13, 2020Updated 5 years ago
- Prompt tuning toolkit for GPT-2 and GPT-Neoβ90Sep 27, 2021Updated 4 years ago
- 1.4B latent diffusion model fine tuningβ265May 16, 2022Updated 4 years ago