paulcjh / gpt-j-6bLinks
β50Updated 2 years ago
Alternatives and similar repositories for gpt-j-6b
Users that are interested in gpt-j-6b are comparing it to the libraries listed below
Sorting:
- π€Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.β55Updated 3 years ago
- β131Updated 3 years ago
- Simple Annotated implementation of GPT-NeoX in PyTorchβ110Updated 3 years ago
- Smol but mighty language modelβ63Updated 2 years ago
- One stop shop for all things carpβ59Updated 3 years ago
- Simple Python client for the Hugging Face Inference APIβ75Updated 5 years ago
- π€ Disaggregators: Curated data labelers for in-depth analysis.β67Updated 2 years ago
- β33Updated 2 years ago
- Developing tools to automatically analyze datasetsβ75Updated 11 months ago
- Source codes for the paper "Bounding the Capabilities of Large Language Models in Open Text Generation with Prompt Constraints"β27Updated 2 years ago
- Open source library for few shot NLPβ79Updated 2 years ago
- β43Updated 2 years ago
- Experiments with generating opensource language model assistantsβ97Updated 2 years ago
- DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.β169Updated last week
- Seahorse is a dataset for multilingual, multi-faceted summarization evaluation. It consists of 96K summaries with human ratings along 6 qβ¦β89Updated last year
- XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scaleβ156Updated last year
- A library for squeakily cleaning and filtering language datasets.β47Updated 2 years ago
- β113Updated 3 years ago
- Evaluation suite for large-scale language models.β128Updated 4 years ago
- RWKV-v2-RNN trained on the Pile. See https://github.com/BlinkDL/RWKV-LM for details.β67Updated 3 years ago
- β33Updated 2 years ago
- Training a model without a dataset for natural language inference (NLI)β25Updated 5 years ago
- Search through Facebook Research's PyTorch BigGraph Wikidata-dataset with the Weaviate vector search engineβ31Updated 3 years ago
- GPT-jax based on the official huggingface libraryβ13Updated 4 years ago
- β91Updated 3 years ago
- Create soft prompts for fairseq 13B dense, GPT-J-6B and GPT-Neo-2.7B for free in a Google Colab TPU instanceβ28Updated 2 years ago
- A diff tool for language modelsβ44Updated last year
- Auxiliary tasks for task-oriented dialogue systems. Published in ICNLSP'22 and indexed in the ACL Anthology.β17Updated 2 years ago
- Hidden Engrams: Long Term Memory for Transformer Model Inferenceβ35Updated 4 years ago
- A Multilingual Dataset for Parsing Realistic Task-Oriented Dialogsβ115Updated 2 years ago