kamalkraj / minGPT-TF
A minimal TF2 re-implementation of the OpenAI GPT training
☆57Updated 3 years ago
Alternatives and similar repositories for minGPT-TF:
Users that are interested in minGPT-TF are comparing it to the libraries listed below
- A library for squeakily cleaning and filtering language datasets.☆47Updated last year
- TorchServe+Streamlit for easily serving your HuggingFace NER models☆33Updated 2 years ago
- A package for fine-tuning Transformers with TPUs, written in Tensorflow2.0+☆37Updated 4 years ago
- Babysit your preemptible TPUs☆85Updated 2 years ago
- Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing☆48Updated 3 years ago
- ☆60Updated 3 years ago
- GPT-jax based on the official huggingface library☆13Updated 3 years ago
- Repository for fine-tuning Transformers 🤗 based seq2seq speech models in JAX/Flax.☆35Updated 2 years ago
- ☆67Updated 2 years ago
- Dense Passage Retrieval using tensorflow-keras on TPU☆15Updated 3 years ago
- NLP Examples using the 🤗 libraries☆41Updated 4 years ago
- URL downloader supporting checkpointing and continuous checksumming.☆19Updated last year
- Helper scripts and notes that were used while porting various nlp models☆46Updated 3 years ago
- ☆13Updated 3 years ago
- **ARCHIVED** Filesystem interface to 🤗 Hub☆58Updated 2 years ago
- ☆19Updated 2 years ago
- Various handy scripts to quickly setup new Linux and Windows sandboxes, containers and WSL.☆40Updated this week
- ☆28Updated last year
- Experiments with generating opensource language model assistants☆97Updated last year
- The elegant integration of huggingface/nlp and fastai2 and handy transforms using pure huggingface/nlp☆19Updated 4 years ago
- ☆20Updated 3 years ago
- Another attempt at a long-context / efficient transformer by me☆37Updated 3 years ago
- Scripts to convert datasets from various sources to Hugging Face Datasets.☆57Updated 2 years ago
- HomebrewNLP in JAX flavour for maintable TPU-Training☆49Updated last year
- Implementation of a simple BPE tokenizer, but in Nim☆22Updated last year
- Visualising Losses in Deep Neural Networks☆16Updated 9 months ago
- A zero-shot captcha solver.☆16Updated last year
- Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP☆58Updated 2 years ago
- Large Scale Distributed Model Training strategy with Colossal AI and Lightning AI☆57Updated last year
- This repository contains example code to build models on TPUs☆30Updated 2 years ago