A ground-up LLM engineering project: tokenizer → architecture → training → scaling laws → inference. Starts at 80M, engineered to scale into 1B+ models with minimal changes. Clean, research-ready code for anyone serious about understanding and building LLMs from first principles.
☆63Jan 29, 2026Updated 3 months ago
Alternatives and similar repositories for Mini-LLM
Users that are interested in Mini-LLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A concise list of CLI coding tools similar to Claude Code☆33Apr 13, 2026Updated 3 weeks ago
- Tools for basic array manipulation and help dealing with the different flavors of arrays in Julia☆12Sep 24, 2024Updated last year
- Utillity Module for Claude Code. Dev Container, Clawdbot, queuing commands, prompt history, prompt alias, everything in one place☆22Apr 24, 2026Updated last week
- Lightweight offline Linux command tutor using a local LLM and ChromaDB.☆13Apr 27, 2025Updated last year
- Using spirits to communicate over long distances.☆17Jun 26, 2019Updated 6 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆19Jul 4, 2025Updated 10 months ago
- This repository contains a deep learning-based approach for improving A* search efficiency on grid graphs. By learning instance-dependent…☆15Oct 2, 2024Updated last year
- Flysystem Adapter for Google cloud storage using the gcloud PHP library☆14May 15, 2021Updated 4 years ago
- A local-first LLM development studio. Build, test, and customize inference workflows with your own models — no cloud, totally local.☆17May 21, 2025Updated 11 months ago
- Open source e-learning plugin for WordPress☆14Oct 27, 2019Updated 6 years ago
- Automated LLM Coding Tournaments. There can be only one (winning code solution from the competing AIs)☆50Apr 15, 2026Updated 3 weeks ago
- Fast evaluation of multivariate polynomials☆17Jun 26, 2023Updated 2 years ago
- FBconverter is a database converter for Firebird OpenSource RDBMS.☆16Jul 13, 2017Updated 8 years ago
- bodyweight workout plan from https://www.reddit.com/r/bodyweightfitness/wiki/kb/recommended_routine☆17Jan 16, 2018Updated 8 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A WordPress plugin to generate Terms of Service and Privacy Policy.☆17Jan 23, 2018Updated 8 years ago
- Responsive HTML template for exporting Zim Desktop Wiki notebooks☆19Dec 13, 2017Updated 8 years ago
- A quick implementation of diffusion language models.☆48Oct 11, 2025Updated 6 months ago
- GraphLand: Evaluating Graph Machine Learning Models on Diverse Industrial Data☆35Apr 8, 2026Updated 3 weeks ago
- Nsight Compute In Docker☆13Dec 21, 2023Updated 2 years ago
- ☆83Feb 28, 2025Updated last year
- Collection of optimization test functions and some useful methods for working with them☆16Apr 4, 2025Updated last year
- obliquetree is an advanced decision tree implementation featuring oblique and axis-aligned splits, optimized performance.☆24Mar 26, 2026Updated last month
- This source code is a MATLAB implementation of a nonlinear unsharp masking method, published in the proceeding of ICEIC 2020, Barcelona, …☆19Sep 4, 2021Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Compare Savant and PyTorch performance☆13Feb 9, 2024Updated 2 years ago
- ☆15Jan 22, 2022Updated 4 years ago
- ICP implementation in Rust☆15Jun 27, 2024Updated last year
- Profile page - Build with React and styled-components☆15Oct 9, 2023Updated 2 years ago
- ☆31Aug 27, 2024Updated last year
- A simple example of how to bind C++ code in Python☆14Nov 13, 2020Updated 5 years ago
- A fast, interactive tool to visualize how different gradient descent algorithms (like vanilla gradient Descent, Momentum, RMSprop, Adam, …☆20May 12, 2025Updated 11 months ago
- ☆13Mar 1, 2023Updated 3 years ago
- ☆23Sep 27, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A collection of best practices and procedures for using Claude Code☆82Nov 9, 2025Updated 5 months ago
- commonly used test images☆26Oct 25, 2024Updated last year
- ☆24Apr 18, 2019Updated 7 years ago
- Parakeet, a tiny language model by Byte Breeze Studios☆29Oct 19, 2024Updated last year
- Solve the advection diffusion equations looped into an optimization problem with JAX/autodiff☆14May 8, 2025Updated 11 months ago
- A general branch and bound framework☆34Dec 28, 2025Updated 4 months ago
- Simple setup for horizontally scaling php-fpm docker containers.☆22Oct 13, 2015Updated 10 years ago