skyzh / tiny-llm
(π§ WIP) a course of LLM inference serving on Apple Silicon for systems engineers.
β1,871Updated last week
Alternatives and similar repositories for tiny-llm
Users that are interested in tiny-llm are comparing it to the libraries listed below
Sorting:
- Artificial Neural Engine Machine Learning Libraryβ914Updated this week
- Achieve the llama3 inference step-by-step, grasp the core concepts, master the process derivation, implement the code.β573Updated 2 months ago
- Run LLMs with MLXβ667Updated this week
- VSCode extension that demonstrates the use of large language models (LLMs) for active debugging of programsβ331Updated 3 months ago
- A powerful document AI question-answering tool that connects to your local Ollama models. Create, manage, and interact with RAG systems fβ¦β1,006Updated last month
- The SOTA Open-Source Browser Agent for autonomously performing complex tasks on the webβ2,110Updated last week
- β1,353Updated 3 months ago
- Browser MCP is a Model Context Provider (MCP) server that allows AI applications to control your browserβ1,475Updated 3 weeks ago
- Coding assistant MCP for Claude Desktopβ1,289Updated last week
- MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.β1,258Updated this week
- Your filesystem as a vector databaseβ339Updated 2 weeks ago
- Open source multi-modal RAG for building AI apps over private knowledge.β2,266Updated this week
- Everything about the SmolLM2 and SmolVLM family of modelsβ2,361Updated last month
- A fast Rust based tool to serialize text-based files in a repository or directory for LLM consumptionβ2,059Updated this week
- Fully open-source command-line AI assistant inspired by OpenAI Codex, supporting local language models.β491Updated last week
- c/ua is the Docker Container for Computer-Use AI Agents.β6,603Updated this week
- A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speecβ¦β2,124Updated this week
- HelixDB is a powerful, open-source, graph-vector database built in Rust for intelligent data storage for RAG and AI.β1,257Updated this week
- Textbook on reinforcement learning from human feedbackβ894Updated last week
- Have a natural, spoken conversation with AI!β2,139Updated last week
- This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025β1,990Updated last week
- Minimal LLM inference in Rustβ983Updated 6 months ago
- pingcap/autoflow is a Graph RAG based and conversational knowledge base tool built with TiDB Serverless Vector Storage. Demo: https://tidβ¦β2,556Updated this week
- AI-powered multi-agent builderβ2,787Updated this week
- I made my AI think harder by making it argue with itself repeatedly. It works stupidly well.β2,059Updated 2 weeks ago
- Agent Reinforcement Trainer for training multi-turn agents using GRPOβ560Updated this week
- Non-agentic 100% free & open source coding tool for AI-assisted programming.β816Updated this week
- Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)β637Updated 2 weeks ago
- Fast State-of-the-Art Static Embeddingsβ1,615Updated this week
- β2,465Updated this week