Ashx098/Mini-LLM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Ashx098/Mini-LLM)

Ashx098 / Mini-LLM

A ground-up LLM engineering project: tokenizer → architecture → training → scaling laws → inference. Starts at 80M, engineered to scale into 1B+ models with minimal changes. Clean, research-ready code for anyone serious about understanding and building LLMs from first principles.

☆53

Alternatives and similar repositories for Mini-LLM

Users that are interested in Mini-LLM are comparing it to the libraries listed below

Sorting:

MiguelAngelCalveraUnizar / Mini-SLAM_student
View on GitHub
Student version of Mini-SLAM.
☆10Mar 16, 2024Updated last year
wzes / lshash
View on GitHub
lshash for python3
☆10Mar 21, 2018Updated 7 years ago
mark-borg / python_cpp_messaging
View on GitHub
Messaging between C++ and Python using RabbitMQ
☆11Aug 17, 2018Updated 7 years ago
vianarafael / rafterai
View on GitHub
Lightweight offline Linux command tutor using a local LLM and ChromaDB.
☆13Apr 27, 2025Updated 10 months ago
insight-platform / SavantPyTorchComparison
View on GitHub
Compare Savant and PyTorch performance
☆13Feb 9, 2024Updated 2 years ago
edbeeching / learning_to_plan
View on GitHub
Code for paper "Learning to Plan with Uncertain Topological Maps"
☆10Aug 28, 2020Updated 5 years ago
samas69420 / transformino
View on GitHub
☆19Jul 4, 2025Updated 8 months ago
pmocz / advectiondiffusion-jax
View on GitHub
Solve the advection diffusion equations looped into an optimization problem with JAX/autodiff
☆14May 8, 2025Updated 9 months ago
vincent-vdb / medium_posts
View on GitHub
Compilation of codes for medium posts or drafts
☆15May 18, 2025Updated 9 months ago
sagi-z / OpenCVPipeline
View on GitHub
Example of using TBB parallel pipeline with OpenCV
☆16Mar 27, 2017Updated 8 years ago
maxmcd / gstreamer-docker
View on GitHub
Docker images for GStreamer
☆15May 29, 2018Updated 7 years ago
moorissa / lidar-obstacle-detector
View on GitHub
Lidar Obstacle Detector
☆24Jun 3, 2025Updated 9 months ago
darconeous / SimpleCardAuth
View on GitHub
A simple but secure implementation of a PKI-based Physical Access Control System
☆16Sep 20, 2016Updated 9 years ago
Dicklesworthstone / llm-tournament
View on GitHub
Automated LLM Coding Tournaments. There can be only one (winning code solution from the competing AIs)
☆47Updated this week
cyrusbehr / sdk_design
View on GitHub
How to Design a Language Agnostic SDK for Cross Platform Deployment and Maximum Extensibility: A Tutorial
☆20Nov 21, 2022Updated 3 years ago
triton-inference-server / stateful_backend
View on GitHub
Triton backend for managing the model state tensors automatically in sequence batcher
☆17Feb 12, 2024Updated 2 years ago
tier4 / icp_rust
View on GitHub
ICP implementation in Rust
☆15Jun 27, 2024Updated last year
Thrasher-Software / sigil
View on GitHub
A local-first LLM development studio. Build, test, and customize inference workflows with your own models — no cloud, totally local.
☆17May 21, 2025Updated 9 months ago
mcahny / rovit
View on GitHub
RO-ViT CVPR 2023 "Region-Aware Pretraining for Open-Vocabulary Object Detection with Vision Transformers"
☆17Aug 24, 2023Updated 2 years ago
duzgunilaslan / Deploy-ML-Model-FastAPI-MLFlow-MINIO-MySQL
View on GitHub
This repository about how to deploy machine learning model end serving with FastAPI and using MLFlow-MINIO
☆18Jun 11, 2023Updated 2 years ago
deep-diver / Continuous-Adaptation-with-VertexAI-AutoML-Pipeline
View on GitHub
☆23Apr 19, 2022Updated 3 years ago
KartDriver / mira_converse
View on GitHub
☆83Feb 28, 2025Updated last year
ibm-granite / granite-vision-models
View on GitHub
☆32Feb 9, 2026Updated 3 weeks ago
thanhlnbka / yolov7-triton-deepstream
View on GitHub
☆24Oct 10, 2022Updated 3 years ago
nmaggioni / Dumpster
View on GitHub
A lightweight, self-hosted and API-based file upload server supporting YubiKey OTP authentication.
☆23Jul 17, 2020Updated 5 years ago
britram / trilateration
View on GitHub
Tools for quantifying error in RTT-based trilateration/geolocation techniques
☆20Nov 10, 2018Updated 7 years ago
nadrino / simple-cpp-logger
View on GitHub
A simple header-only library written in C++ which provide logger features
☆23Jul 4, 2025Updated 8 months ago
yanlai00 / bridge_data_imitation_learning
View on GitHub
☆22Oct 4, 2021Updated 4 years ago
Detrol / quorum-cli
View on GitHub
Multi-agent AI discussion CLI for structured debates between LLMs
☆72Jan 1, 2026Updated 2 months ago
ChuChuIgbokwe / SLAM-from-scratch
View on GitHub
☆22Feb 23, 2017Updated 9 years ago
psdwizzard / MeetingBuddy
View on GitHub
☆31Mar 26, 2025Updated 11 months ago
vatsalsaglani / bert4rec
View on GitHub
☆27Aug 4, 2022Updated 3 years ago
TheMoskowitz / tensorflow-surgery
View on GitHub
A guide on how to edit the nodes in trained tensorflow models
☆27May 22, 2017Updated 8 years ago
yangliu28 / minimal_robot_gripper
View on GitHub
Simple robot gripper demonstration in ROS (homework 5 for ROS class)
☆25Jan 19, 2018Updated 8 years ago
stringandstickytape / MaxsAiStudio
View on GitHub
A Windows tool to query various LLM AIs. Supports branched conversations, history and summaries among others.
☆35Feb 11, 2026Updated 3 weeks ago
YH-Wu / Triton-Inference-Server-on-Kubernetes
View on GitHub
☆33Jul 7, 2022Updated 3 years ago
zhouyuchong / gst-nvinfer-custom
View on GitHub
Custom gst-nvinfer for alignment in Deepstream
☆31Nov 22, 2024Updated last year
jonasteuwen / chaos-challenge
View on GitHub
Scripts to work with the chaos challenge
☆26Mar 4, 2019Updated 7 years ago
Mahrkeenerh / lfind
View on GitHub
A natural language file search tool that uses LLMs to help you find files by describing what you're looking for.
☆27Mar 8, 2025Updated 11 months ago