Norod / TrainGPT2-127M-FromScratch

A trio of Google-Colab notebooks (ipynb) for training a GPT-2 (127M) model from scratch (useful for other / non-English languages) using gpt-2-simple

☆15

Alternatives and similar repositories for TrainGPT2-127M-FromScratch

Users that are interested in TrainGPT2-127M-FromScratch are comparing it to the libraries listed below

Sorting:

ConiferLabsWA / flan-ul2-alpaca
☆32Updated 2 years ago
camenduru / Text-To-Video-Finetuning-colab
☆26Updated last year
VE-FORBRYDERNE / mtj-softtuner
Create soft prompts for fairseq 13B dense, GPT-J-6B and GPT-Neo-2.7B for free in a Google Colab TPU instance
☆28Updated 2 years ago
donaldafeith / Pytorch_Merge
Merge LLM that are split in to parts
☆26Updated last year
LAION-AI / interesting-text-datasets
☆43Updated 2 years ago
TheoCoombes / crawlingathome
A client library for LAION's effort to filter CommonCrawl with CLIP, building a large scale image-text dataset.
☆33Updated 2 years ago
data2ml / all-clip
Load any clip model with a standardized interface
☆21Updated last year
nostalgebraist / improved-diffusion
Text-writing denoising diffusion (and much more)
☆30Updated 2 years ago
camenduru / alpaca-lora-colab
Alpaca Lora
☆26Updated last year
AeroScripts / HiddenEngrams
Hidden Engrams: Long Term Memory for Transformer Model Inference
☆35Updated 3 years ago
camenduru / UniControl-colab
☆13Updated last year
EleutherAI / magiCARP
One stop shop for all things carp
☆59Updated 2 years ago
pacman100 / peft-codegen-25
☆24Updated last year
camenduru / nvidia-llm-colab
☆14Updated last year
DOUDOU0314 / GPT-J-hf
GPT-jax based on the official huggingface library
☆13Updated 3 years ago
dmarx / Multi-Modal-Comparators
Unified API to facilitate usage of pre-trained "perceptor" models, a la CLIP
☆39Updated 2 years ago
aicrumb / doohickey
Doohickey is a stable diffusion tool for technical artists who want to stay up-to-date with the latest developments in the field.
☆39Updated 2 years ago
deep-diver / LoRA-deployment
LoRA fine-tuned Stable Diffusion Deployment
☆31Updated 2 years ago
dzryk / cliptalk
☆19Updated 3 years ago
zarakiquemparte / zaraki-tools
☆26Updated last year
JD-P / simulacrabot
Discord AI Generation Bot to collect an aesthetic rating dataset
☆60Updated 2 years ago
the-crypt-keeper / the-muse
Experimental sampler to make LLMs more creative
☆31Updated last year
bhattbhavesh91 / few-shot-learning-using-gpt-neo
Few Shot Learning using EleutherAI's GPT-Neo an Open-source version of GPT-3
☆18Updated 3 years ago
mlabonne / tinytuner
🐜🔧 A minimalistic tool to fine-tune your LLMs
☆18Updated last year
EleutherAI / exploring-contrastive-topology
☆15Updated 2 years ago
borisdayma / sora-mini
☆17Updated last year
pbaylies / clustering-laion400m
Script and models for clustering LAION-400m CLIP embeddings.
☆26Updated 3 years ago
joey00072 / TinyLora
Low-Rank Adaptation of Large Language Models clean implementation
☆8Updated last year
Jack000 / ru-dalle
Generate images from texts. In Russian
☆19Updated 3 years ago
robvanvolt / DALLE-tools
DALLE-tools provided useful dataset utilities to improve you workflow with WebDatasets.
☆15Updated 3 years ago