PraveenRaja42 / Tiny-Stories-GPT
A minimal PyTorch re-implementation of GPT (Generative Pretrained Transformer) language model training
☆12Updated last year
Alternatives and similar repositories for Tiny-Stories-GPT:
Users that are interested in Tiny-Stories-GPT are comparing it to the libraries listed below
- LLM training in simple, raw C/CUDA☆14Updated 3 months ago
- Jax like function transformation engine but micro, microjax☆30Updated 5 months ago
- A place to store reusable transformer components of my own creation or found on the interwebs☆48Updated last week
- A sample pattern for running CI tests on Modal☆16Updated 6 months ago
- ☆17Updated 2 weeks ago
- QLoRA for Masked Language Modeling☆21Updated last year
- alternative way to calculating self attention☆18Updated 10 months ago
- ☆9Updated 5 months ago
- Implementation of Gradient Agreement Filtering, from Chaubard et al. of Stanford, but for single machine microbatches, in Pytorch☆23Updated 2 months ago
- ☆17Updated last year
- Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and te…☆42Updated last year
- ☆38Updated 8 months ago
- ☆15Updated 6 months ago
- ☆22Updated last year
- Latent Large Language Models☆17Updated 7 months ago
- Rust bindings for CTranslate2☆14Updated last year
- ☆38Updated last month
- Implementation of a holodeck, written in Pytorch☆17Updated last year
- ☆19Updated last year
- NLP with Rust for Python 🦀🐍☆61Updated 10 months ago
- ☆12Updated last week
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated last year
- ☆27Updated 8 months ago
- DiCE: The Infinitely Differentiable Monte-Carlo Estimator☆31Updated last year
- Describe the format of image/text datasets☆11Updated 2 years ago
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆23Updated 2 weeks ago
- Deep Learning how-to's using Lance file format☆16Updated 6 months ago
- Andrej Kapathy's micrograd implemented in c☆28Updated 7 months ago
- Python tools☆12Updated last year
- ☆34Updated last year