paulcjh / gpt-j-6bLinks

☆50

Alternatives and similar repositories for gpt-j-6b

Users that are interested in gpt-j-6b are comparing it to the libraries listed below

Sorting:

labmlai / neox
Simple Annotated implementation of GPT-NeoX in PyTorch
☆110Updated 2 years ago
leogao2 / commoncrawl_downloader
☆33Updated 2 years ago
nickthorpie / gpt-j-simple
☆9Updated 4 years ago
EleutherAI / openwebtext2
☆90Updated 3 years ago
huggingface / hfapi
Simple Python client for the Hugging Face Inference API
☆74Updated 4 years ago
microsoft / xtreme-distil-transformers
XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale
☆155Updated last year
Rallio67 / language-model-agents
Experiments with generating opensource language model assistants
☆97Updated 2 years ago
EleutherAI / magiCARP
One stop shop for all things carp
☆59Updated 2 years ago
ConiferLabsWA / flan-ul2-alpaca
☆32Updated 2 years ago
finetunej / transformers
🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.
☆56Updated 3 years ago
SALT-NLP / Bound-Cap-LLM
Source codes for the paper "Bounding the Capabilities of Large Language Models in Open Text Generation with Prompt Constraints"
☆27Updated 2 years ago
dreasysnail / RetGen
☆112Updated 2 years ago
DOUDOU0314 / GPT-J-hf
GPT-jax based on the official huggingface library
☆13Updated 4 years ago
zphang / minimal-gpt-neox-20b
☆130Updated 3 years ago
pszemraj / ai-msgbot
Training & Implementation of chatbots leveraging GPT-like architecture with the aitextgen package to enable dynamic conversations.
☆48Updated 2 years ago
EleutherAI / DeeperSpeed
DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.
☆168Updated 2 weeks ago
AI21Labs / lm-evaluation
Evaluation suite for large-scale language models.
☆127Updated 3 years ago
curai / curai-research
☆94Updated 7 months ago
krandiash / gpt3-nli
Training a model without a dataset for natural language inference (NLI)
☆25Updated 5 years ago
LAION-AI / interesting-text-datasets
☆43Updated 2 years ago
kyleliang919 / Long-context-transformers
Exploring finetuning public checkpoints on filter 8K sequences on Pile
☆116Updated 2 years ago
VE-FORBRYDERNE / mtj-softtuner
Create soft prompts for fairseq 13B dense, GPT-J-6B and GPT-Neo-2.7B for free in a Google Colab TPU instance
☆28Updated 2 years ago
basusourya / mirostat
Code for the paper-"Mirostat: A Perplexity-Controlled Neural Text Decoding Algorithm" (https://arxiv.org/abs/2007.14966).
☆60Updated 3 years ago
salesforce / TaiChi
Open source library for few shot NLP
☆78Updated 2 years ago
CarperAI / cheese
Used for adaptive human in the loop evaluation of language and embedding models.
☆311Updated 2 years ago
CarperAI / squeakily
A library for squeakily cleaning and filtering language datasets.
☆47Updated 2 years ago
huggingface / data-measurements-tool
Developing tools to automatically analyze datasets
☆74Updated 9 months ago
EleutherAI / lm_perplexity
☆153Updated 4 years ago
radi-cho / RSTOD
Auxiliary tasks for task-oriented dialogue systems. Published in ICNLSP'22 and indexed in the ACL Anthology.
☆17Updated 2 years ago
BlinkDL / RWKV-v2-RNN-Pile
RWKV-v2-RNN trained on the Pile. See https://github.com/BlinkDL/RWKV-LM for details.
☆67Updated 2 years ago