A ground-up LLM engineering project: tokenizer → architecture → training → scaling laws → inference. Starts at 80M, engineered to scale into 1B+ models with minimal changes. Clean, research-ready code for anyone serious about understanding and building LLMs from first principles.
☆55Jan 29, 2026Updated 2 months ago
Alternatives and similar repositories for Mini-LLM
Users that are interested in Mini-LLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Gradient Descent optimizers for Julia☆12May 26, 2020Updated 5 years ago
- Tools for basic array manipulation and help dealing with the different flavors of arrays in Julia☆12Sep 24, 2024Updated last year
- Utillity Module for Claude Code. Dev Container, Clawdbot, queuing commands, prompt history, prompt alias, everything in one place☆21Mar 20, 2026Updated 3 weeks ago
- Lightweight offline Linux command tutor using a local LLM and ChromaDB.☆13Apr 27, 2025Updated 11 months ago
- High-performance open-source synthetic data engine. Uses LLMs for schema design and vectorized NumPy for deterministic, scalable generati…☆52Apr 8, 2026Updated last week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆19Jul 4, 2025Updated 9 months ago
- Flysystem Adapter for Google cloud storage using the gcloud PHP library☆14May 15, 2021Updated 4 years ago
- A local-first LLM development studio. Build, test, and customize inference workflows with your own models — no cloud, totally local.☆17May 21, 2025Updated 10 months ago
- lshash for python3☆10Mar 21, 2018Updated 8 years ago
- Open source e-learning plugin for WordPress☆14Oct 27, 2019Updated 6 years ago
- Automated LLM Coding Tournaments. There can be only one (winning code solution from the competing AIs)☆49Mar 22, 2026Updated 3 weeks ago
- ☆14May 2, 2024Updated last year
- bodyweight workout plan from https://www.reddit.com/r/bodyweightfitness/wiki/kb/recommended_routine☆17Jan 16, 2018Updated 8 years ago
- A usable replacement for PHPUnit withConsecutive after it got deprecated without a replacement.☆18Apr 20, 2025Updated 11 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A WordPress plugin to generate Terms of Service and Privacy Policy.☆17Jan 23, 2018Updated 8 years ago
- Responsive HTML template for exporting Zim Desktop Wiki notebooks☆19Dec 13, 2017Updated 8 years ago
- General fixed point mapping acceleration and optimization in Julia☆17Sep 24, 2025Updated 6 months ago
- A quick implementation of diffusion language models.☆48Oct 11, 2025Updated 6 months ago
- Nsight Compute In Docker☆13Dec 21, 2023Updated 2 years ago
- An agent that can run everywhere - even in your watch!☆30Apr 8, 2026Updated last week
- ☆83Feb 28, 2025Updated last year
- quaprogIP solver for Non-Convex quadratic programs☆11Jun 28, 2019Updated 6 years ago
- A fast, interactive tool to visualize how different gradient descent algorithms (like vanilla gradient Descent, Momentum, RMSprop, Adam, …☆20May 12, 2025Updated 11 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- obliquetree is an advanced decision tree implementation featuring oblique and axis-aligned splits, optimized performance.☆24Mar 26, 2026Updated 3 weeks ago
- RO-ViT CVPR 2023 "Region-Aware Pretraining for Open-Vocabulary Object Detection with Vision Transformers"☆17Aug 24, 2023Updated 2 years ago
- Collection of GAN models with Flux☆23Jul 28, 2023Updated 2 years ago
- Compare Savant and PyTorch performance☆13Feb 9, 2024Updated 2 years ago
- Profile page - Build with React and styled-components