TheProtaganist / gpt-j
A GPT-J API to use with python3 to generate text, blogs, code, and more
☆205Updated 2 years ago
Alternatives and similar repositories for gpt-j
Users that are interested in gpt-j are comparing it to the libraries listed below
Sorting:
- API for the GPT-J language model 🦜. Including a FastAPI backend and a streamlit frontend☆337Updated 3 years ago
- A repository to run gpt-j-6b on low vram machines (4.2 gb minimum vram for 2000 token context, 3.5 gb for 1000 token context). Model load…☆115Updated 3 years ago
- Simple Annotated implementation of GPT-NeoX in PyTorch☆110Updated 2 years ago
- 🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.☆56Updated 3 years ago
- A ready-to-deploy container for implementing an easy to use REST API to access Language Models.☆64Updated 2 years ago
- Open-AI's DALL-E for large scale training in mesh-tensorflow.☆433Updated 3 years ago
- A basic ui for running gpt neo 2.7B on low vram (3 gb Vram minimum)☆36Updated 3 years ago
- Notebook for running GPT neo models based on GPT3☆63Updated 3 years ago
- Guide: Finetune GPT2-XL (1.5 Billion Parameters) and finetune GPT-NEO (2.7 B) on a single GPU with Huggingface Transformers using DeepSpe…☆437Updated last year
- Training & Implementation of chatbots leveraging GPT-like architecture with the aitextgen package to enable dynamic conversations.☆49Updated 2 years ago
- ☆130Updated 2 years ago
- Tweet Generation with Huggingface☆429Updated last year
- Code Generation using GPT-J!☆518Updated 2 years ago
- A client for OpenAI's GPT-3 API for ad hoc testing of prompt without using the web interface.☆90Updated 4 years ago
- ☆62Updated 3 years ago
- Hidden Engrams: Long Term Memory for Transformer Model Inference☆35Updated 3 years ago
- A search engine for ParlAI's BlenderBot project (and probably other ones as well)☆131Updated 3 years ago
- A python code to interact with the GPT3 API to train the chatbot and use it.☆111Updated 4 years ago
- ELIZA is an open domain chatbot with Discord and Twitter integration.☆73Updated last year
- DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.☆167Updated last month
- ☆34Updated 3 years ago
- ☆9Updated 3 years ago
- ☆49Updated 2 years ago
- Code Generation and Search for Python☆53Updated 3 years ago
- Google's Meena transformer chatbot implementation☆105Updated 3 years ago
- Dalle service☆50Updated 3 years ago
- Fine-tuning GPT-J-6B on colab or equivalent PC GPU with your custom datasets: 8-bit weights with low-rank adaptors (LoRA)☆74Updated 2 years ago
- ☆33Updated last year
- llama-4bit-colab☆65Updated 2 years ago
- a simple bot that allows you to chat with various personas☆71Updated 4 years ago