☆50Jan 4, 2023Updated 3 years ago
Alternatives and similar repositories for gpt-j-6b
Users that are interested in gpt-j-6b are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆34Aug 10, 2021Updated 4 years ago
- 🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.☆56Jan 20, 2022Updated 4 years ago
- A repo for code based language models☆18Feb 10, 2021Updated 5 years ago
- Notebook for running GPT neo models based on GPT3☆61Aug 10, 2021Updated 4 years ago
- ☆58Apr 28, 2021Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆26Aug 10, 2021Updated 4 years ago
- A repository to run gpt-j-6b on low vram machines (4.2 gb minimum vram for 2000 token context, 3.5 gb for 1000 token context). Model load…☆112Dec 23, 2021Updated 4 years ago
- Just a repo with some AI Dungeon scripts☆31Jul 4, 2021Updated 4 years ago
- Hyper protocol + IPNS = HyPNS☆17Jul 27, 2022Updated 3 years ago
- GPT-jax based on the official huggingface library☆13Jun 22, 2021Updated 4 years ago
- API for the GPT-J language model 🦜. Including a FastAPI backend and a streamlit frontend☆335Oct 25, 2021Updated 4 years ago
- ☆25Jul 20, 2025Updated 10 months ago
- Grounding Language Models for Compositional and Spatial Reasoning☆18Oct 26, 2022Updated 3 years ago
- Fine-tuning GPT-J-6B on colab or equivalent PC GPU with your custom datasets: 8-bit weights with low-rank adaptors (LoRA)☆73Jun 18, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Interface for using TTS and vocoder models in the form of a text editor☆20Nov 25, 2025Updated 6 months ago
- Paper notes for Information Extraction, including Relation Extraction (RE), Named Entity Recognition (NER), Entity Linking (EL), Event Ex…☆17Apr 1, 2021Updated 5 years ago
- A tool for generic tracking-based CV annotation☆18Jan 27, 2021Updated 5 years ago
- Source code for the ACL-IJCNLP 2021 paper entitled "T-DNA: Taming Pre-trained Language Models with N-gram Representations for Low-Resourc…☆19Jan 12, 2023Updated 3 years ago
- A simple library that implements CLIP guided loss in PyTorch.☆77Dec 25, 2021Updated 4 years ago
- Named Entity Recognition via Attention_based CNNs-BiLSTm-CRF☆15Jun 27, 2018Updated 7 years ago
- generates images. easy to run script plus setup instructions for craiyon (formerly dalle-mini) meant for the mega model for low vram devi…☆11Jun 28, 2022Updated 3 years ago
- Convert a username/group name to a uid/gid number☆18Oct 8, 2015Updated 10 years ago
- Implementation of Siamese CBOW using keras whose backend is tensorflow.☆12Feb 2, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Hidden Engrams: Long Term Memory for Transformer Model Inference☆35Jun 26, 2021Updated 4 years ago
- ☆15Dec 20, 2020Updated 5 years ago
- Model parallel transformers in JAX and Haiku☆6,370Jan 21, 2023Updated 3 years ago
- QLoRA for Masked Language Modeling☆24Sep 11, 2023Updated 2 years ago
- A GPT-J API to use with python3 to generate text, blogs, code, and more☆204Nov 12, 2022Updated 3 years ago
- ☆12Jun 19, 2025Updated 11 months ago
- EQUATE (Evaluating Quantitative Understanding Aptitude in Textual Entailment), framework for evaluating quantitative reasoning ability in…☆14Feb 13, 2022Updated 4 years ago
- Source code of the paper "Prediction of Molecular Absorption Wavelength Using Deep Neural Networks"☆10May 29, 2022Updated 4 years ago
- Repo for fine-tuning Casual LLMs☆464Mar 27, 2024Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A text generation Transformer model trained on Reddit posts.☆16Jan 5, 2023Updated 3 years ago
- ☆33Apr 23, 2023Updated 3 years ago
- Data structure that maps entries to numeric ids☆14Aug 16, 2015Updated 10 years ago
- Here is a collection of checkpoints for DALLE-pytorch models, from where you can keep on training or start generating images.☆147Nov 23, 2022Updated 3 years ago
- a repository containing the details of natural language inference dataset in Hindi☆14Dec 28, 2020Updated 5 years ago
- 📦 A collection of pastable code gathered from past projects☆12Sep 9, 2024Updated last year
- Generate vector graphics from a textual caption☆120Jul 2, 2021Updated 4 years ago