kamalkraj / minGPT-TFLinks

A minimal TF2 re-implementation of the OpenAI GPT training

☆57

Alternatives and similar repositories for minGPT-TF

Users that are interested in minGPT-TF are comparing it to the libraries listed below

Sorting:

CarperAI / squeakily
A library for squeakily cleaning and filtering language datasets.
☆47Updated last year
HomebrewML / Olmax
HomebrewNLP in JAX flavour for maintable TPU-Training
☆50Updated last year
cceyda / lit-NER
TorchServe+Streamlit for easily serving your HuggingFace NER models
☆33Updated 2 years ago
wangcongcong123 / ttt
A package for fine-tuning Transformers with TPUs, written in Tensorflow2.0+
☆38Updated 4 years ago
sholtodouglas / scalingExperiments
☆60Updated 3 years ago
shawwn / tpunicorn
Babysit your preemptible TPUs
☆85Updated 2 years ago
Rallio67 / language-model-agents
Experiments with generating opensource language model assistants
☆97Updated 2 years ago
xrsrke / stable-diffusion-from-scratch
Implementation of Stable Diffusion from scratch [WORK IN PROGRESS]
☆22Updated 2 years ago
kyleliang919 / Long-context-transformers
Exploring finetuning public checkpoints on filter 8K sequences on Pile
☆115Updated 2 years ago
labmlai / neox
Simple Annotated implementation of GPT-NeoX in PyTorch
☆110Updated 2 years ago
huggingface / bloom-jax-inference
☆67Updated 2 years ago
Ankur3107 / dpr-tf
Dense Passage Retrieval using tensorflow-keras on TPU
☆15Updated 4 years ago
zphang / minimal-gpt-neox-20b
☆130Updated 3 years ago
pytorch-tpu / examples
This repository contains example code to build models on TPUs
☆30Updated 2 years ago
philschmid / optimum-static-quantization
☆28Updated 2 years ago
johnrobinsn / alpaca_lora_30b_4bit
☆18Updated 2 years ago
lucidrains / panoptic-transformer
Another attempt at a long-context / efficient transformer by me
☆38Updated 3 years ago
lucidrains / token-shift-gpt
Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing
☆50Updated 3 years ago
EleutherAI / openwebtext2
☆90Updated 2 years ago
chai-research / lmgym
Code base for internal reward models and PPO training
☆25Updated last year
thesephist / hfm
Hugging Face Download (Cache) Manager
☆21Updated 2 years ago
gsarti / t5-flax-gcp
Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP
☆58Updated 2 years ago
lordtt13 / transformers-experiments
All my experiments with the various transformers and various transformer frameworks available
☆14Updated 4 years ago
Birch-san / llama-play
Command-line script for inferencing from models such as LLaMA, in a chat scenario, with LoRA adaptations
☆33Updated 2 years ago
keerthanpg / DadJokeGenerator
☆46Updated 2 years ago
yvrjsharma / JAX
☆13Updated 3 years ago
radi-cho / RSTOD
Auxiliary tasks for task-oriented dialogue systems. Published in ICNLSP'22 and indexed in the ACL Anthology.
☆17Updated 2 years ago
google-research-datasets / QAmeleon
QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…
☆34Updated last year
sayakpaul / big_vision_experiments
Contains my experiments with the `big_vision` repo to train ViTs on ImageNet-1k.
☆22Updated 2 years ago
DOUDOU0314 / GPT-J-hf
GPT-jax based on the official huggingface library
☆13Updated 4 years ago