kamalkraj / minGPT-TFLinks
A minimal TF2 re-implementation of the OpenAI GPT training
☆57Updated 3 years ago
Alternatives and similar repositories for minGPT-TF
Users that are interested in minGPT-TF are comparing it to the libraries listed below
Sorting:
- TorchServe+Streamlit for easily serving your HuggingFace NER models☆33Updated 2 years ago
- ☆60Updated 3 years ago
- A library for squeakily cleaning and filtering language datasets.☆47Updated last year
- Babysit your preemptible TPUs☆85Updated 2 years ago
- A package for fine-tuning Transformers with TPUs, written in Tensorflow2.0+☆38Updated 4 years ago
- GPT-jax based on the official huggingface library☆13Updated 3 years ago
- Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing☆50Updated 3 years ago
- ☆13Updated 3 years ago
- Sequence models in Numpy☆25Updated 4 years ago
- Another attempt at a long-context / efficient transformer by me☆38Updated 3 years ago
- This repository contains example code to build models on TPUs☆30Updated 2 years ago
- Implementation of autoregressive language model using improved Transformer and DeepSpeed pipeline parallelism.☆32Updated 3 years ago
- Experiments with generating opensource language model assistants☆97Updated 2 years ago
- HomebrewNLP in JAX flavour for maintable TPU-Training☆50Updated last year
- ☆28Updated 2 years ago
- ☆130Updated 2 years ago
- ☆28Updated 2 years ago
- Using short models to classify long texts☆21Updated 2 years ago
- A case study of efficient training of large language models using commodity hardware.☆69Updated 2 years ago
- ☆90Updated 2 years ago
- Comparing M2M and mT5 on a rare language pairs, blog post: https://medium.com/@abdessalemboukil/comparing-facebooks-m2m-to-mt5-in-low-re…☆15Updated 3 years ago
- Large Scale Distributed Model Training strategy with Colossal AI and Lightning AI☆57Updated last year
- Exploring finetuning public checkpoints on filter 8K sequences on Pile☆115Updated 2 years ago
- Simple Annotated implementation of GPT-NeoX in PyTorch☆110Updated 2 years ago
- Hugging Face Download (Cache) Manager☆21Updated 2 years ago
- ☆21Updated 4 years ago
- RWKV-v2-RNN trained on the Pile. See https://github.com/BlinkDL/RWKV-LM for details.☆67Updated 2 years ago
- Applying "Load What You Need: Smaller Versions of Multilingual BERT" to LaBSE☆18Updated 3 years ago
- [WIP] Behold, semantic-search, built over sentence-transformers to make it easy for search engineers to evaluate, optimise and deploy mod…☆15Updated 2 years ago
- All my experiments with the various transformers and various transformer frameworks available☆14Updated 4 years ago