A ground-up LLM engineering project: tokenizer → architecture → training → scaling laws → inference. Starts at 80M, engineered to scale into 1B+ models with minimal changes. Clean, research-ready code for anyone serious about understanding and building LLMs from first principles.
☆53Jan 29, 2026Updated last month
Alternatives and similar repositories for Mini-LLM
Users that are interested in Mini-LLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Gradient Descent optimizers for Julia☆12May 26, 2020Updated 5 years ago
- Lightweight offline Linux command tutor using a local LLM and ChromaDB.☆13Apr 27, 2025Updated 11 months ago
- ☆19Jul 4, 2025Updated 8 months ago
- A local-first LLM development studio. Build, test, and customize inference workflows with your own models — no cloud, totally local.☆17May 21, 2025Updated 10 months ago
- lshash for python3☆10Mar 21, 2018Updated 8 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Open source e-learning plugin for WordPress☆14Oct 27, 2019Updated 6 years ago
- Automated LLM Coding Tournaments. There can be only one (winning code solution from the competing AIs)☆47Updated this week
- Fast evaluation of multivariate polynomials☆17Jun 26, 2023Updated 2 years ago
- Conductor is a Gemini CLI extension that allows you to specify, plan, and implement software features.☆48Mar 19, 2026Updated last week
- FBconverter is a database converter for Firebird OpenSource RDBMS.☆16Jul 13, 2017Updated 8 years ago
- bodyweight workout plan from https://www.reddit.com/r/bodyweightfitness/wiki/kb/recommended_routine☆17Jan 16, 2018Updated 8 years ago
- Plugin to hide lines with or without appropriate text. Or simply select several lines and hide them.☆15May 21, 2022Updated 3 years ago
- Responsive HTML template for exporting Zim Desktop Wiki notebooks☆19Dec 13, 2017Updated 8 years ago
- BZScene Multimedia 2D, 3D, Audio library for Lazarus and FPC / BZScene Bibliothèque multimédia 2D, 3D, Audio pour Lazarus et FPC☆20Aug 11, 2020Updated 5 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- General fixed point mapping acceleration and optimization in Julia☆17Sep 24, 2025Updated 6 months ago
- A quick implementation of diffusion language models.☆48Oct 11, 2025Updated 5 months ago
- Nsight Compute In Docker☆13Dec 21, 2023Updated 2 years ago
- ☆83Feb 28, 2025Updated last year
- Student version of Mini-SLAM.☆10Mar 16, 2024Updated 2 years ago
- Collection of optimization test functions and some useful methods for working with them☆16Apr 4, 2025Updated 11 months ago
- Browser extension that lets you summarize and chat with any webpage using a local LLM of your choice.☆22Oct 24, 2024Updated last year
- obliquetree is an advanced decision tree implementation featuring oblique and axis-aligned splits, optimized performance.☆23Mar 14, 2026Updated last week
- RO-ViT CVPR 2023 "Region-Aware Pretraining for Open-Vocabulary Object Detection with Vision Transformers"☆17Aug 24, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Collection of GAN models with Flux☆23Jul 28, 2023Updated 2 years ago
- ICP implementation in Rust☆15Jun 27, 2024Updated last year
- ☆31Aug 27, 2024Updated last year
- ☆13Mar 1, 2023Updated 3 years ago
- A Windows tool to query various LLM AIs. Supports branched conversations, history and summaries among others.☆34Feb 11, 2026Updated last month
- commonly used test images☆26Oct 25, 2024Updated last year
- ECDLP solver with hardware acceleration of GPU.☆18Mar 5, 2025Updated last year
- Example of using TBB parallel pipeline with OpenCV☆16Mar 27, 2017Updated 9 years ago
- A general branch and bound framework☆34Dec 28, 2025Updated 2 months ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- A natural language file search tool that uses LLMs to help you find files by describing what you're looking for.☆27Mar 8, 2025Updated last year
- llm-eval-simple is a simple LLM evaluation framework with intermediate actions and prompt pattern selection☆59Feb 28, 2026Updated 3 weeks ago
- Generate Your Own Private Morning Radio for Commute☆32Feb 5, 2025Updated last year
- Generative Models with trainable conditional distributions in Julia!☆31Sep 21, 2022Updated 3 years ago
- Official implementation for "Few-shot Image Generation with Mixup-based Distance Learning" [ECCV 2022]☆26Nov 14, 2022Updated 3 years ago
- Triton backend for managing the model state tensors automatically in sequence batcher☆16Feb 12, 2024Updated 2 years ago
- A Very Simple Vector Database☆15May 1, 2023Updated 2 years ago