PraveenRaja42 / Tiny-Stories-GPT
A minimal PyTorch re-implementation of GPT (Generative Pretrained Transformer) language model training
☆14Updated last year
Alternatives and similar repositories for Tiny-Stories-GPT:
Users that are interested in Tiny-Stories-GPT are comparing it to the libraries listed below
- ☆38Updated 9 months ago
- Jax like function transformation engine but micro, microjax☆30Updated 6 months ago
- LLM training in simple, raw C/CUDA☆14Updated 4 months ago
- ☆18Updated last month
- Pre-train BERT from scratch, with HuggingFace. Accompanies the blog post: sidsite.com/posts/bert-from-scratch☆40Updated last year
- NLP with Rust for Python 🦀🐍☆62Updated 10 months ago
- A place to store reusable transformer components of my own creation or found on the interwebs☆49Updated last week
- ☆41Updated 2 months ago
- ☆27Updated 9 months ago
- Using multiple LLMs for ensemble Forecasting☆16Updated last year
- alternative way to calculating self attention☆18Updated 11 months ago
- Collection of autoregressive model implementation☆85Updated 2 months ago
- QLoRA for Masked Language Modeling☆22Updated last year
- NanoGPT-speedrunning for the poor T4 enjoyers☆62Updated this week
- ☆9Updated 6 months ago
- Just large language models. Hackable, with as little abstraction as possible. Done for my own purposes, feel free to rip.☆44Updated last year
- look how they massacred my boy☆63Updated 6 months ago
- Andrej Kapathy's micrograd implemented in c☆28Updated 8 months ago
- Your favourite classical machine learning algos on the GPU/TPU☆20Updated 3 months ago
- Simple GRPO scripts and configurations.☆58Updated 2 months ago
- gzip Predicts Data-dependent Scaling Laws☆34Updated 10 months ago
- Code accompanying the paper "A Language Model's Guide Through Latent Space". It contains functionality for training and using concept vec…☆19Updated last year
- Karpathy's llama2.c transpiled to MLX for Apple Silicon☆15Updated last year
- a WIP architecture designed to allow transformers to think in a manner without tokens☆19Updated last year
- Low-Rank Adaptation of Large Language Models clean implementation☆8Updated last year
- Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and te…☆42Updated last year
- ☆19Updated 8 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated last year
- Rust Implementation of micrograd☆51Updated 9 months ago
- ☆52Updated 5 months ago