rayhern / convert-gpt2-xl-to-onnxLinks
This will help you convert a GPT2-XL model to an optimized onnx model fp 16.
☆10Updated 5 years ago
Alternatives and similar repositories for convert-gpt2-xl-to-onnx
Users that are interested in convert-gpt2-xl-to-onnx are comparing it to the libraries listed below
Sorting:
- Hidden Engrams: Long Term Memory for Transformer Model Inference☆35Updated 4 years ago
- ☆27Updated 2 years ago
- Code for the paper "Language Models are Unsupervised Multitask Learners"☆108Updated 4 years ago
- RWKV-v2-RNN trained on the Pile. See https://github.com/BlinkDL/RWKV-LM for details.☆66Updated 3 years ago
- Music GPT-2 Implementation with Relative Positional Embedding☆77Updated 5 years ago
- Experiments with Hugging Face 🔬 🤗☆44Updated last year
- Poetry generator by gpt-2 with meter and rhyme constraints.☆53Updated 4 years ago
- Python package to easily retrain OpenAI's GPT-2 text-generating model on new texts☆18Updated 4 years ago
- [DEPRECEATED] Multi-Instrumental Music Transformer trained on 12GB/400k MIDIs☆17Updated 3 years ago
- Library and command line utility to do approximate string matching of a source against a bitext index and get matched source and target.☆51Updated 5 months ago
- Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.☆147Updated 4 years ago
- ☆112Updated 4 years ago
- ☆30Updated 3 years ago
- A package for fine-tuning Transformers with TPUs, written in Tensorflow2.0+☆37Updated 4 years ago
- Fine tune GPT-2 with your favourite authors☆71Updated last year
- This project aims to make RWKV Accessible to everyone using a Hugging Face like interface, while keeping it close to the R and D RWKV bra…☆64Updated 2 years ago
- Framework agnostic python runtime for RWKV models☆145Updated 2 years ago
- Voice swapping with VQ-VAE and diffusion models☆67Updated 3 years ago
- Translating Visual Works of Art into Music (ICCVW 2019)☆34Updated 6 years ago
- Contrastive Language-Audio Pretraining☆88Updated 3 years ago
- Contrastive Language-Audio Pretraining☆15Updated 4 years ago
- XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale☆156Updated last year
- Create soft prompts for fairseq 13B dense, GPT-J-6B and GPT-Neo-2.7B for free in a Google Colab TPU instance☆28Updated 2 years ago
- Lite Inference Toolkit (LIT) for PyTorch☆161Updated 3 years ago
- Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP☆58Updated 3 years ago
- Implementation of Marge, Pre-training via Paraphrasing, in Pytorch☆76Updated 4 years ago
- TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis with Explicit Pitch and Duration Prediction.☆89Updated 4 years ago
- ☆22Updated 3 years ago
- Source code for the Apple reproduction☆32Updated 4 years ago
- Simple Annotated implementation of GPT-NeoX in PyTorch☆110Updated 3 years ago