Minimalistic, extremely fast, and hackable researcher's toolbench for GPT models in 307 lines of code. Reaches <3.8 validation loss on wikitext-103 on a single A100 in <100 seconds. Scales to larger models with one parameter change (feature currently in alpha).
☆356Jul 29, 2024Updated last year
Alternatives and similar repositories for hlb-gpt
Users that are interested in hlb-gpt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Train to 94% on CIFAR-10 in <6.3 seconds on a single A100. Or ~95.79% in ~110 seconds (or less!)☆1,299Dec 18, 2024Updated last year
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆192Jan 19, 2026Updated 3 months ago
- ☆145Mar 31, 2023Updated 3 years ago
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆132Apr 17, 2024Updated 2 years ago
- Collection of autoregressive model implementation☆85Feb 23, 2026Updated 2 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- It's a baby compiler. (Lean btw.)☆16May 19, 2025Updated 11 months ago
- implementation of https://arxiv.org/pdf/2312.09299☆21Jul 3, 2024Updated last year
- Official implementation of 'A Large-Scale Exploration of mu-Transfer'☆32Jun 5, 2025Updated 10 months ago
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Jun 21, 2023Updated 2 years ago
- NanoGPT (124M) in 90 seconds☆5,157Updated this week
- Low-Rank adapter extraction for fine-tuned transformers models☆181May 2, 2024Updated 2 years ago
- Schedule-Free Optimization in PyTorch☆2,277May 21, 2025Updated 11 months ago
- Fast & Simple repository for pre-training and fine-tuning T5-style models☆1,018Aug 21, 2024Updated last year
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆704Jan 26, 2026Updated 3 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- an implementation of Self-Extend, to expand the context window via grouped attention☆119Jan 7, 2024Updated 2 years ago
- ☆308Jul 15, 2024Updated last year
- WIP☆95Aug 13, 2024Updated last year
- ☆124May 28, 2024Updated last year
- Full finetuning of large language models without large memory requirements☆94Sep 22, 2025Updated 7 months ago
- supporting pytorch FSDP for optimizers☆84Dec 8, 2024Updated last year
- Entropy Based Sampling and Parallel CoT Decoding☆3,431Nov 13, 2024Updated last year
- A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.☆600Aug 12, 2025Updated 8 months ago
- ☆54May 20, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code for the paper "Function-Space Learning Rates"☆24Jun 3, 2025Updated 10 months ago
- 🚀 Efficiently (pre)training foundation models with native PyTorch features, including FSDP for training and SDPA implementation of Flash…☆285Nov 24, 2025Updated 5 months ago
- ☆62Mar 4, 2022Updated 4 years ago
- CIFAR-10 speedruns: 94% in 2.6 seconds and 96% in 27 seconds☆376Nov 15, 2025Updated 5 months ago
- ☆317Jun 21, 2024Updated last year
- High-performance tokenized language data-loader for Python C++ extension☆14Jul 22, 2024Updated last year
- ☆15Oct 31, 2023Updated 2 years ago
- Code for the paper "QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models".☆280Nov 3, 2023Updated 2 years ago
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆19Jul 24, 2025Updated 9 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Just a bunch of benchmark logs for different LLMs☆124Jul 28, 2024Updated last year
- PyTorch interface for TrueGrad Optimizers☆43Aug 8, 2023Updated 2 years ago
- Cramming the training of a (BERT-type) language model into limited compute.☆1,364Jun 13, 2024Updated last year
- Fast reinforcement learning 💨☆29Jul 15, 2025Updated 9 months ago
- ☆49Feb 23, 2025Updated last year
- The repository for the code of the UltraFastBERT paper☆518Mar 24, 2024Updated 2 years ago
- ☆13Jun 18, 2024Updated last year