paulcjh / gpt-j-6b
β50Updated last year
Related projects β
Alternatives and complementary repositories for gpt-j-6b
- Experiments with generating opensource language model assistantsβ97Updated last year
- π€Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.β55Updated 2 years ago
- β27Updated 3 years ago
- A library for squeakily cleaning and filtering language datasets.β45Updated last year
- β31Updated last year
- β128Updated 2 years ago
- Simple Annotated implementation of GPT-NeoX in PyTorchβ111Updated 2 years ago
- β91Updated last week
- Create soft prompts for fairseq 13B dense, GPT-J-6B and GPT-Neo-2.7B for free in a Google Colab TPU instanceβ27Updated last year
- β110Updated 2 years ago
- Hidden Engrams: Long Term Memory for Transformer Model Inferenceβ34Updated 3 years ago
- β28Updated last year
- β46Updated this week
- β42Updated last year
- Adversarial Training and SFT for Bot Safety Modelsβ39Updated last year
- One stop shop for all things carpβ59Updated 2 years ago
- GPT-jax based on the official huggingface libraryβ13Updated 3 years ago
- Tutorial to pretrain & fine-tune a π€ Flax T5 model on a TPUv3-8 with GCPβ58Updated 2 years ago
- DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.β164Updated 6 months ago
- Training & Implementation of chatbots leveraging GPT-like architecture with the aitextgen package to enable dynamic conversations.β46Updated 2 years ago
- Simple Python client for the Hugging Face Inference APIβ72Updated 4 years ago
- Prompt tuning toolkit for GPT-2 and GPT-Neoβ90Updated 3 years ago
- Exploring finetuning public checkpoints on filter 8K sequences on Pileβ115Updated last year
- A package for fine-tuning Transformers with TPUs, written in Tensorflow2.0+β37Updated 3 years ago
- A dataset of alignment research and code to reproduce itβ69Updated last year
- β9Updated 3 years ago
- Search through Facebook Research's PyTorch BigGraph Wikidata-dataset with the Weaviate vector search engineβ31Updated 2 years ago
- π€ Disaggregators: Curated data labelers for in-depth analysis.β65Updated last year
- Source codes for the paper "Bounding the Capabilities of Large Language Models in Open Text Generation with Prompt Constraints"β27Updated last year