kamalkraj / minGPT-TFLinks
A minimal TF2 re-implementation of the OpenAI GPT training
☆57Updated 3 years ago
Alternatives and similar repositories for minGPT-TF
Users that are interested in minGPT-TF are comparing it to the libraries listed below
Sorting:
- A library for squeakily cleaning and filtering language datasets.☆47Updated 2 years ago
- ☆32Updated 2 years ago
- Consists of the largest (10K) human annotated code-switched semantic parsing dataset & 170K generated utterance using the CST5 augmentati…☆39Updated 2 years ago
- TorchServe+Streamlit for easily serving your HuggingFace NER models☆33Updated 3 years ago
- [WIP] A 🔥 interface for running code in the cloud☆85Updated 2 years ago
- Large Scale Distributed Model Training strategy with Colossal AI and Lightning AI☆57Updated last year
- ☆90Updated 3 years ago
- Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing☆50Updated 3 years ago
- A package for fine-tuning Transformers with TPUs, written in Tensorflow2.0+☆38Updated 4 years ago
- Hugging Face Deep RL Class notes☆10Updated 2 years ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆93Updated 2 years ago
- Babysit your preemptible TPUs☆86Updated 2 years ago
- Helper scripts and notes that were used while porting various nlp models☆46Updated 3 years ago
- ☆28Updated 2 years ago
- HomebrewNLP in JAX flavour for maintable TPU-Training☆50Updated last year
- NLP Examples using the 🤗 libraries☆41Updated 4 years ago
- Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP☆58Updated 2 years ago
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆34Updated last year
- Experiments with generating opensource language model assistants☆97Updated 2 years ago
- ☆130Updated 3 years ago
- Simple Annotated implementation of GPT-NeoX in PyTorch☆110Updated 2 years ago
- Exploring finetuning public checkpoints on filter 8K sequences on Pile☆116Updated 2 years ago
- URL downloader supporting checkpointing and continuous checksumming.☆19Updated last year
- Hugging Face Download (Cache) Manager☆21Updated 2 years ago
- My explorations into editing the knowledge and memories of an attention network☆35Updated 2 years ago
- A case study of efficient training of large language models using commodity hardware.☆68Updated 2 years ago
- **ARCHIVED** Filesystem interface to 🤗 Hub☆58Updated 2 years ago
- ☆18Updated 2 years ago
- Code for the paper-"Mirostat: A Perplexity-Controlled Neural Text Decoding Algorithm" (https://arxiv.org/abs/2007.14966).☆60Updated 3 years ago
- Scripts to convert datasets from various sources to Hugging Face Datasets.☆57Updated 2 years ago