rish-16 / gpt3-pytorchLinks

Unofficial PyTorch Implementation of OpenAI's GPT-3

☆13

Alternatives and similar repositories for gpt3-pytorch

Users that are interested in gpt3-pytorch are comparing it to the libraries listed below

Sorting:

bojone / analytical-classification
逻辑回归和单层softmax的解析解
☆12Updated 4 years ago
BlinkDL / minGPT-tuned
A *tuned* minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
☆116Updated 4 years ago
archinetai / vat-pytorch
Virtual Adversarial Training (VAT) techniques in PyTorch
☆17Updated 3 years ago
hpcaitech / ColossalAI-Pytorch-lightning
☆24Updated 2 years ago
BAAI-WuDao / P-tuning
Finetune CPM-1
☆24Updated 4 years ago
TurboNLP / Translate-Demo
A Translation Task using TurboTransformers
☆11Updated 4 years ago
rlin27 / DeBut
Codes of the paper Deformable Butterfly: A Highly Structured and Sparse Linear Transform.
☆12Updated 3 years ago
CLUEbenchmark / SuperCLUE-Code3
中文原生等级化代码能力测试基准
☆15Updated last year
BlinkDL / RWKV-v2-RNN-Pile
RWKV-v2-RNN trained on the Pile. See https://github.com/BlinkDL/RWKV-LM for details.
☆67Updated 2 years ago
lucidrains / multistream-transformers
Implementation of Multistream Transformers in Pytorch
☆54Updated 4 years ago
sunyt32 / torchscale
Transformers at any scale
☆41Updated last year
lucidrains / rela-transformer
Implementation of a Transformer using ReLA (Rectified Linear Attention) from https://arxiv.org/abs/2104.07012
☆49Updated 3 years ago
SIC98 / GPT2-python-code-generator
GPT2 finetuning with transformers 🤗
☆28Updated 4 years ago
Lightning-Universe / lightning-ColossalAI
Large Scale Distributed Model Training strategy with Colossal AI and Lightning AI
☆56Updated last year
acosharma / elita-transformer
Official Repository for Efficient Linear-Time Attention Transformers.
☆18Updated last year
bojone / univae
基于Transformer的单模型、多尺度的VAE模型
☆57Updated 4 years ago
aws-neuron / aws-neuron-reference-for-megatron-lm
☆14Updated last year
jaketae / fnet
PyTorch implementation of FNet: Mixing Tokens with Fourier transforms
☆27Updated 4 years ago
mrcabbage972 / simple-toolformer
A Python implementation of Toolformer using Huggingface Transformers
☆14Updated 2 years ago
zhisbug / ray-scalable-ml-design
Some microbenchmarks and design docs before commencement
☆12Updated 4 years ago
lucidrains / memory-compressed-attention
Implementation of Memory-Compressed Attention, from the paper "Generating Wikipedia By Summarizing Long Sequences"
☆70Updated 2 years ago
Rishit-dagli / Compositional-Attention
An implementation of Compositional Attention: Disentangling Search and Retrieval by MILA
☆14Updated 3 years ago
ahennequ / pytorch-custom-mma
☆29Updated 2 years ago
antofuller / configaformers
A python library for highly configurable transformers - easing model architecture search and experimentation.
☆49Updated 3 years ago
cimeister / typical-sampling
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
☆82Updated 3 years ago
NTU-SQUAD / transformers-coqa
Albert for Conversational Question Answering Challenge
☆23Updated 2 years ago
lucidrains / n-grammer-pytorch
Implementation of N-Grammer, augmenting Transformers with latent n-grams, in Pytorch
☆76Updated 2 years ago
lucidrains / memory-transformer-xl
A variant of Transformer-XL where the memory is updated not with a queue, but with attention
☆49Updated 5 years ago
lucidrains / token-shift-gpt
Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing
☆50Updated 3 years ago
ayaka14732 / bart-base-jax
JAX implementation of the bart-base model
☆32Updated 2 years ago