A ground-up LLM engineering project: tokenizer → architecture → training → scaling laws → inference. Starts at 80M, engineered to scale into 1B+ models with minimal changes. Clean, research-ready code for anyone serious about understanding and building LLMs from first principles.
☆66Jan 29, 2026Updated 4 months ago
Alternatives and similar repositories for Mini-LLM
Users that are interested in Mini-LLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12Dec 21, 2024Updated last year
- Gradient Descent optimizers for Julia☆12May 26, 2020Updated 6 years ago
- A concise list of CLI coding tools similar to Claude Code☆39Apr 13, 2026Updated last month
- Tools for basic array manipulation and help dealing with the different flavors of arrays in Julia☆12Sep 24, 2024Updated last year
- Utillity Module for Claude Code. Dev Container, Clawdbot, queuing commands, prompt history, prompt alias, everything in one place☆23Apr 24, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Lightweight offline Linux command tutor using a local LLM and ChromaDB.☆13Apr 27, 2025Updated last year
- Using spirits to communicate over long distances.☆17Jun 26, 2019Updated 6 years ago
- ☆19Jul 4, 2025Updated 10 months ago
- Flysystem Adapter for Google cloud storage using the gcloud PHP library☆14May 15, 2021Updated 5 years ago
- lshash for python3☆10Mar 21, 2018Updated 8 years ago
- Open source e-learning plugin for WordPress☆14Oct 27, 2019Updated 6 years ago
- Automated LLM Coding Tournaments. There can be only one (winning code solution from the competing AIs)☆50Apr 15, 2026Updated last month
- Fast evaluation of multivariate polynomials☆17Jun 26, 2023Updated 2 years ago
- ☆14May 2, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- bodyweight workout plan from https://www.reddit.com/r/bodyweightfitness/wiki/kb/recommended_routine☆17Jan 16, 2018Updated 8 years ago
- A usable replacement for PHPUnit withConsecutive after it got deprecated without a replacement.☆18Apr 20, 2025Updated last year
- A WordPress plugin to generate Terms of Service and Privacy Policy.☆17Jan 23, 2018Updated 8 years ago
- Responsive HTML template for exporting Zim Desktop Wiki notebooks☆20Dec 13, 2017Updated 8 years ago
- A quick implementation of diffusion language models.☆48Oct 11, 2025Updated 7 months ago
- Nsight Compute In Docker☆13Dec 21, 2023Updated 2 years ago
- A component orchestration engine☆27Nov 28, 2023Updated 2 years ago
- ☆82Feb 28, 2025Updated last year
- Student version of Mini-SLAM.☆10Mar 16, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Collection of optimization test functions and some useful methods for working with them☆17Apr 4, 2025Updated last year
- bash-based certificate authority, with CA-cert, signing-cert, and revocation support.☆10Oct 8, 2021Updated 4 years ago
- obliquetree is an advanced decision tree implementation featuring oblique and axis-aligned splits, optimized performance.☆24May 5, 2026Updated 3 weeks ago
- This source code is a MATLAB implementation of a nonlinear unsharp masking method, published in the proceeding of ICEIC 2020, Barcelona, …☆19Sep 4, 2021Updated 4 years ago
- A fast, interactive tool to visualize how different gradient descent algorithms (like vanilla gradient Descent, Momentum, RMSprop, Adam, …☆20May 12, 2025Updated last year
- A simple example of how to bind C++ code in Python☆14Nov 13, 2020Updated 5 years ago
- Pythonic Perambulations website. Source at http://github.com/jakevdp/jakevdp.github.io-source☆40Oct 1, 2020Updated 5 years ago
- Module implements CRUD with static pages with uses Imperavi Redactor.☆15Nov 13, 2020Updated 5 years ago
- ☆14Mar 1, 2023Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆23Sep 27, 2024Updated last year
- ZIM-wiki export templates☆21May 19, 2018Updated 8 years ago
- Docker images for GStreamer☆15May 29, 2018Updated 8 years ago
- commonly used test images☆26Oct 25, 2024Updated last year
- ECDLP solver with hardware acceleration of GPU.☆18Mar 5, 2025Updated last year
- ☆24Apr 18, 2019Updated 7 years ago
- Example of using TBB parallel pipeline with OpenCV☆16Mar 27, 2017Updated 9 years ago