kamalkraj / minGPT-TFLinks
A minimal TF2 re-implementation of the OpenAI GPT training
☆57Updated 3 years ago
Alternatives and similar repositories for minGPT-TF
Users that are interested in minGPT-TF are comparing it to the libraries listed below
Sorting:
- A library for squeakily cleaning and filtering language datasets.☆47Updated last year
- HomebrewNLP in JAX flavour for maintable TPU-Training☆50Updated last year
- TorchServe+Streamlit for easily serving your HuggingFace NER models☆33Updated 2 years ago
- A package for fine-tuning Transformers with TPUs, written in Tensorflow2.0+☆38Updated 4 years ago
- ☆60Updated 3 years ago
- Babysit your preemptible TPUs☆85Updated 2 years ago
- Experiments with generating opensource language model assistants☆97Updated 2 years ago
- Implementation of Stable Diffusion from scratch [WORK IN PROGRESS]☆22Updated 2 years ago
- Exploring finetuning public checkpoints on filter 8K sequences on Pile☆115Updated 2 years ago
- Simple Annotated implementation of GPT-NeoX in PyTorch☆110Updated 2 years ago
- ☆67Updated 2 years ago
- Dense Passage Retrieval using tensorflow-keras on TPU☆15Updated 4 years ago
- ☆130Updated 3 years ago
- This repository contains example code to build models on TPUs☆30Updated 2 years ago
- ☆28Updated 2 years ago
- ☆18Updated 2 years ago
- Another attempt at a long-context / efficient transformer by me☆38Updated 3 years ago
- Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing☆50Updated 3 years ago
- ☆90Updated 2 years ago
- Code base for internal reward models and PPO training☆25Updated last year
- Hugging Face Download (Cache) Manager☆21Updated 2 years ago
- Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP☆58Updated 2 years ago
- All my experiments with the various transformers and various transformer frameworks available☆14Updated 4 years ago
- Command-line script for inferencing from models such as LLaMA, in a chat scenario, with LoRA adaptations☆33Updated 2 years ago
- ☆46Updated 2 years ago
- ☆13Updated 3 years ago
- Auxiliary tasks for task-oriented dialogue systems. Published in ICNLSP'22 and indexed in the ACL Anthology.☆17Updated 2 years ago
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆34Updated last year
- Contains my experiments with the `big_vision` repo to train ViTs on ImageNet-1k.☆22Updated 2 years ago
- GPT-jax based on the official huggingface library☆13Updated 4 years ago