openai / gpt-3
GPT-3: Language Models are Few-Shot Learners
☆15,684Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for gpt-3
- Code for the paper "Language Models are Unsupervised Multitask Learners"☆22,489Updated 2 months ago
- An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries☆6,928Updated this week
- TensorFlow code and pre-trained models for BERT☆38,156Updated 3 months ago
- An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.☆8,233Updated 2 years ago
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.☆35,365Updated this week
- Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"☆6,160Updated last month
- Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities☆20,122Updated last week
- Dataset of GPT-2 outputs for research in detection, biases, and more☆1,939Updated 10 months ago
- Ongoing research training transformer models at scale☆10,480Updated this week
- 💫 Industrial-strength Natural Language Processing (NLP) in Python☆30,128Updated 2 weeks ago
- 💥 Fast State-of-the-Art Tokenizers optimized for Research and Production☆9,038Updated this week
- Library for fast text representation and classification.☆25,928Updated 7 months ago
- ALBERT: A Lite BERT for Self-supervised Learning of Language Representations☆3,245Updated last year
- Running large language models on a single GPU for throughput-oriented scenarios.☆9,186Updated last week
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.☆30,468Updated 3 weeks ago
- Unsupervised text tokenizer for Neural Network-based text generation.☆10,252Updated last week
- 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.☆134,531Updated this week
- XLNet: Generalized Autoregressive Pretraining for Language Understanding☆6,181Updated last year
- 🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.☆26,052Updated this week
- Inference code for Llama models☆56,320Updated 2 months ago
- Model parallel transformers in JAX and Haiku☆6,291Updated last year
- ☆4,587Updated last year
- A library for efficient similarity search and clustering of dense vectors.☆31,320Updated this week
- An open-source NLP research library, built on PyTorch.☆11,756Updated last year
- Repo for external large-scale work☆6,513Updated 6 months ago
- Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.☆14,247Updated 2 months ago
- Qlib is an AI-oriented quantitative investment platform that aims to realize the potential, empower research, and create value using AI t…☆15,473Updated last month
- Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM☆7,700Updated 9 months ago
- CodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.☆4,931Updated 7 months ago