jasonacox / TinyLLMLinks
Setup and run a local LLM and Chatbot using consumer grade hardware.
☆292Updated this week
Alternatives and similar repositories for TinyLLM
Users that are interested in TinyLLM are comparing it to the libraries listed below
Sorting:
- Locally running LLM with internet access☆97Updated 4 months ago
- Self-host LLMs with vLLM and BentoML☆153Updated this week
- Unsloth Studio☆113Updated 6 months ago
- The RunPod worker template for serving our large language model endpoints. Powered by vLLM.☆375Updated this week
- Generate train.jsonl and valid.jsonl files to use for fine-tuning Mistral and other LLMs.☆96Updated last year
- API Server for Transformer Lab☆79Updated this week
- A simple Jupyter Notebook for learning MLX text-completion fine-tuning!☆122Updated 11 months ago
- LLaMA Server combines the power of LLaMA C++ with the beauty of Chatbot UI.☆130Updated 2 years ago
- Fully-featured, beautiful web interface for vLLM - built with NextJS.☆159Updated 5 months ago
- Practical and advanced guide to LLMOps. It provides a solid understanding of large language models’ general concepts, deployment techniqu…☆75Updated last year
- An endpoint server for efficiently serving quantized open-source LLMs for code.☆57Updated 2 years ago
- ☆163Updated 8 months ago
- One click templates for inferencing Language Models☆217Updated 2 months ago
- Local first human friendly agents toolkit for the browser and Nodejs☆44Updated last week
- Build your own RAG and run it locally on your laptop: ColBERT + DSPy + Streamlit☆58Updated last year
- Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation …☆188Updated last year
- The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM …☆601Updated 8 months ago
- Dagger functions to import Hugging Face GGUF models into a local ollama instance and optionally push them to ollama.com.☆119Updated last year
- A proxy server for multiple ollama instances with Key security☆515Updated 2 weeks ago
- Auto Data is a library designed for quick and effortless creation of datasets tailored for fine-tuning Large Language Models (LLMs).☆102Updated last year
- ☆206Updated last month
- A package for visualising Chroma vector collections in 3D☆108Updated last year
- SiLLM simplifies the process of training and running Large Language Models (LLMs) on Apple Silicon by leveraging the MLX framework.☆278Updated 4 months ago
- ☆162Updated 2 months ago
- Tool to download models from Huggingface Hub and convert them to GGML/GGUF for llama.cpp☆160Updated 6 months ago
- A OpenAI API compatible REST server for llama.☆208Updated 8 months ago
- A fast batching API to serve LLM models☆188Updated last year
- This is the Mixture-of-Agents (MoA) concept, adapted from the original work by TogetherAI. My version is tailored for local model usage a…☆118Updated last year
- ezlocalai is an easy to set up local artificial intelligence server with OpenAI Style Endpoints.☆88Updated last month
- Python package wrapping llama.cpp for on-device LLM inference☆92Updated 2 weeks ago