kamalkraj / minGPT-TFLinks
A minimal TF2 re-implementation of the OpenAI GPT training
☆57Updated 4 years ago
Alternatives and similar repositories for minGPT-TF
Users that are interested in minGPT-TF are comparing it to the libraries listed below
Sorting:
- TorchServe+Streamlit for easily serving your HuggingFace NER models☆33Updated 3 years ago
- A package for fine-tuning Transformers with TPUs, written in Tensorflow2.0+☆37Updated 4 years ago
- Simple Annotated implementation of GPT-NeoX in PyTorch☆110Updated 3 years ago
- ☆131Updated 3 years ago
- Babysit your preemptible TPUs☆86Updated 2 years ago
- Experiments with generating opensource language model assistants☆97Updated 2 years ago
- A library for squeakily cleaning and filtering language datasets.☆47Updated 2 years ago
- ☆28Updated 2 years ago
- ☆92Updated 3 years ago
- GPT-jax based on the official huggingface library☆13Updated 4 years ago
- ☆13Updated 3 years ago
- NLP Examples using the 🤗 libraries☆40Updated 4 years ago
- A case study of efficient training of large language models using commodity hardware.☆68Updated 3 years ago
- HomebrewNLP in JAX flavour for maintable TPU-Training☆51Updated last year
- Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways - in Jax (Equinox framework)☆188Updated 3 years ago
- Official code for "Distributed Deep Learning in Open Collaborations" (NeurIPS 2021)☆118Updated 3 years ago
- This repository contains example code to build models on TPUs☆30Updated 2 years ago
- State of the art faster Transformer with Tensorflow 2.0 ( NLP, Computer Vision, Audio ).☆85Updated 2 years ago
- Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP☆58Updated 3 years ago
- JAX implementation of VQGAN☆91Updated 3 years ago
- Exploring finetuning public checkpoints on filter 8K sequences on Pile☆115Updated 2 years ago
- Load What You Need: Smaller Multilingual Transformers for Pytorch and TensorFlow 2.0.☆104Updated 3 years ago
- Google's Meena transformer chatbot implementation☆105Updated 4 years ago
- Large Scale Distributed Model Training strategy with Colossal AI and Lightning AI☆56Updated 2 years ago
- [WIP] A 🔥 interface for running code in the cloud☆85Updated 2 years ago
- RWKV-v2-RNN trained on the Pile. See https://github.com/BlinkDL/RWKV-LM for details.☆66Updated 3 years ago
- [WIP] Behold, semantic-search, built over sentence-transformers to make it easy for search engineers to evaluate, optimise and deploy mod…☆15Updated 2 years ago
- Sequence models in Numpy☆25Updated 5 years ago
- Accelerated NLP pipelines for fast inference on CPU. Built with Transformers and ONNX runtime.☆127Updated 4 years ago
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆35Updated 2 years ago