Ashx098 / Mini-LLMView external linksLinks
A ground-up LLM engineering project: tokenizer → architecture → training → scaling laws → inference. Starts at 80M, engineered to scale into 1B+ models with minimal changes. Clean, research-ready code for anyone serious about understanding and building LLMs from first principles.
☆49Jan 29, 2026Updated 2 weeks ago
Alternatives and similar repositories for Mini-LLM
Users that are interested in Mini-LLM are comparing it to the libraries listed below
Sorting:
- Student version of Mini-SLAM.☆10Mar 16, 2024Updated last year
- ☆10Dec 8, 2022Updated 3 years ago
- An agent that can run everywhere - even in your watch!☆28Feb 2, 2026Updated last week
- This repository contains a deep learning-based approach for improving A* search efficiency on grid graphs. By learning instance-dependent…☆13Oct 2, 2024Updated last year
- Gradient Descent optimizers for Julia☆12May 26, 2020Updated 5 years ago
- Tools for basic array manipulation and help dealing with the different flavors of arrays in Julia☆12Sep 24, 2024Updated last year
- A simple example of how to bind C++ code in Python☆14Nov 13, 2020Updated 5 years ago
- An artificial life experiment.☆12Jul 30, 2020Updated 5 years ago
- Messaging between C++ and Python using RabbitMQ☆11Aug 17, 2018Updated 7 years ago
- Nsight Compute In Docker☆13Dec 21, 2023Updated 2 years ago
- Code of paper "A Video Dataset for Falling Object Detection around Buildings" https://arxiv.org/abs/2408.05750☆17Jul 10, 2025Updated 7 months ago
- Lightweight offline Linux command tutor using a local LLM and ChromaDB.☆13Apr 27, 2025Updated 9 months ago
- Compare Savant and PyTorch performance☆13Feb 9, 2024Updated 2 years ago
- ☆18Jul 4, 2025Updated 7 months ago
- Code for paper "Learning to Plan with Uncertain Topological Maps"☆10Aug 28, 2020Updated 5 years ago
- My own implementation of Redis☆17Jun 30, 2025Updated 7 months ago
- Compilation of codes for medium posts or drafts☆14May 18, 2025Updated 8 months ago
- quaprogIP solver for Non-Convex quadratic programs☆11Jun 28, 2019Updated 6 years ago
- FBconverter is a database converter for Firebird OpenSource RDBMS.☆16Jul 13, 2017Updated 8 years ago
- Using spirits to communicate over long distances.☆17Jun 26, 2019Updated 6 years ago
- How to Design a Language Agnostic SDK for Cross Platform Deployment and Maximum Extensibility: A Tutorial☆20Nov 21, 2022Updated 3 years ago
- Triton backend for managing the model state tensors automatically in sequence batcher☆17Feb 12, 2024Updated 2 years ago
- A local-first LLM development studio. Build, test, and customize inference workflows with your own models — no cloud, totally local.☆17May 21, 2025Updated 8 months ago
- RO-ViT CVPR 2023 "Region-Aware Pretraining for Open-Vocabulary Object Detection with Vision Transformers"☆17Aug 24, 2023Updated 2 years ago
- udev rules and helper script to bring up Virtual Function interfaces on Intel NIC that are using ixgbe Linux kernel module☆17Oct 20, 2020Updated 5 years ago
- YOLO v5 Object Detection on Triton Inference Server☆16Mar 30, 2023Updated 2 years ago
- Official Implementation of Few-shot Visual Relationship Co-localization☆25Aug 25, 2021Updated 4 years ago
- ☆24Oct 10, 2022Updated 3 years ago
- ☆30Aug 27, 2024Updated last year
- ☆23Sep 27, 2024Updated last year
- Multi-agent AI discussion CLI for structured debates between LLMs☆69Jan 1, 2026Updated last month
- A PyTorch implementation of RetinaNet with `ResNet` backbone☆21Aug 9, 2021Updated 4 years ago
- ☆31Mar 26, 2025Updated 10 months ago
- ☆22Feb 23, 2017Updated 8 years ago
- ☆27Jun 11, 2025Updated 8 months ago
- llm-eval-simple is a simple LLM evaluation framework with intermediate actions and prompt pattern selection☆57Dec 24, 2025Updated last month
- Scripts to work with the chaos challenge☆26Mar 4, 2019Updated 6 years ago
- Script for building FreeBSD Edgerouter Lite disk images☆31Aug 19, 2019Updated 6 years ago
- A natural language file search tool that uses LLMs to help you find files by describing what you're looking for.☆27Mar 8, 2025Updated 11 months ago