skyzh / tiny-llm

(🚧 WIP) a course of LLM inference serving on Apple Silicon for systems engineers.

☆1,871

Alternatives and similar repositories for tiny-llm

Users that are interested in tiny-llm are comparing it to the libraries listed below

Sorting:

Anemll / Anemll
Artificial Neural Engine Machine Learning Library
☆914Updated this week
therealoliver / Deepdive-llama3-from-scratch
Achieve the llama3 inference step-by-step, grasp the core concepts, master the process derivation, implement the code.
☆573Updated 2 months ago
ml-explore / mlx-lm
Run LLMs with MLX
☆667Updated this week
mohsen1 / llm-debugger-vscode-extension
VSCode extension that demonstrates the use of large language models (LLMs) for active debugging of programs
☆331Updated 3 months ago
DonTizi / rlama
A powerful document AI question-answering tool that connects to your local Ollama models. Create, manage, and interact with RAG systems f…
☆1,006Updated last month
lmnr-ai / index
The SOTA Open-Source Browser Agent for autonomously performing complex tasks on the web
☆2,110Updated last week
Om-Alve / smolGPT
☆1,353Updated 3 months ago
BrowserMCP / mcp
Browser MCP is a Model Context Provider (MCP) server that allows AI applications to control your browser
☆1,475Updated 3 weeks ago
ezyang / codemcp
Coding assistant MCP for Claude Desktop
☆1,289Updated last week
Blaizzy / mlx-vlm
MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.
☆1,258Updated this week
perone / vectorvfs
Your filesystem as a vector database
☆339Updated 2 weeks ago
morphik-org / morphik-core
Open source multi-modal RAG for building AI apps over private knowledge.
☆2,266Updated this week
huggingface / smollm
Everything about the SmolLM2 and SmolVLM family of models
☆2,361Updated last month
bodo-run / yek
A fast Rust based tool to serialize text-based files in a repository or directory for LLM consumption
☆2,059Updated this week
codingmoh / open-codex
Fully open-source command-line AI assistant inspired by OpenAI Codex, supporting local language models.
☆491Updated last week
trycua / cua
c/ua is the Docker Container for Computer-Use AI Agents.
☆6,603Updated this week
Blaizzy / mlx-audio
A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speec…
☆2,124Updated this week
HelixDB / helix-db
HelixDB is a powerful, open-source, graph-vector database built in Rust for intelligent data storage for RAG and AI.
☆1,257Updated this week
natolambert / rlhf-book
Textbook on reinforcement learning from human feedback
☆894Updated last week
KoljaB / RealtimeVoiceChat
Have a natural, spoken conversation with AI!
☆2,139Updated last week
apple / ml-fastvlm
This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025
☆1,990Updated last week
samuel-vitorino / lm.rs
Minimal LLM inference in Rust
☆983Updated 6 months ago
pingcap / autoflow
pingcap/autoflow is a Graph RAG based and conversational knowledge base tool built with TiDB Serverless Vector Storage. Demo: https://tid…
☆2,556Updated this week
rowboatlabs / rowboat
AI-powered multi-agent builder
☆2,787Updated this week
PhialsBasement / Chain-of-Recursive-Thoughts
I made my AI think harder by making it argue with itself repeatedly. It works stupidly well.
☆2,059Updated 2 weeks ago
OpenPipe / ART
Agent Reinforcement Trainer for training multi-turn agents using GRPO
☆560Updated this week
robertpiosik / CodeWebChat
Non-agentic 100% free & open source coding tool for AI-assisted programming.
☆816Updated this week
ses4255 / Versatile-OCR-Program
Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)
☆637Updated 2 weeks ago
MinishLab / model2vec
Fast State-of-the-Art Static Embeddings
☆1,615Updated this week
opencode-ai / opencode
☆2,465Updated this week