jasonacox / TinyLLMLinks
Setup and run a local LLM and Chatbot using consumer grade hardware.
☆308Updated last month
Alternatives and similar repositories for TinyLLM
Users that are interested in TinyLLM are comparing it to the libraries listed below
Sorting:
- Self-host LLMs with vLLM and BentoML☆163Updated last month
- Locally running LLM with internet access☆97Updated 6 months ago
- An endpoint server for efficiently serving quantized open-source LLMs for code.☆58Updated 2 years ago
- Tool to download models from Huggingface Hub and convert them to GGML/GGUF for llama.cpp☆167Updated 8 months ago
- VSCode AI coding assistant powered by self-hosted llama.cpp endpoint.☆183Updated 11 months ago
- API Server for Transformer Lab☆82Updated last month
- Unsloth Studio☆122Updated 9 months ago
- On-device LLM Inference Powered by X-Bit Quantization☆274Updated this week
- Practical and advanced guide to LLMOps. It provides a solid understanding of large language models’ general concepts, deployment techniqu…☆77Updated last year
- LLM using long-term memory through vector database☆53Updated last year
- The RunPod worker template for serving our large language model endpoints. Powered by vLLM.☆390Updated 2 weeks ago
- ezlocalai is an easy to set up local artificial intelligence server with OpenAI Style Endpoints.☆89Updated last week
- A package for visualising Chroma vector collections in 3D☆110Updated last year
- QA-Pilot is an interactive chat project that leverages online/local LLM for rapid understanding and navigation of GitHub code repository.☆315Updated 4 months ago
- Local first human friendly agents toolkit for the browser and Nodejs☆45Updated this week
- Corrective RAG demo powerd by Ollama☆109Updated last year
- This is the Mixture-of-Agents (MoA) concept, adapted from the original work by TogetherAI. My version is tailored for local model usage a…☆117Updated last year
- Fully-featured, beautiful web interface for vLLM - built with NextJS.☆170Updated 3 weeks ago
- A fast batching API to serve LLM models☆189Updated last year
- A simple, intuitive toolkit for quickly implementing LLM powered applications.☆272Updated last year
- ☆210Updated this week
- Auto Data is a library designed for quick and effortless creation of datasets tailored for fine-tuning Large Language Models (LLMs).☆103Updated last year
- LLM Benchmark for Throughput via Ollama (Local LLMs)☆322Updated last week
- Dagger functions to import Hugging Face GGUF models into a local ollama instance and optionally push them to ollama.com.☆118Updated last year
- Building open version of OpenAI o1 via reasoning traces (Groq, ollama, Anthropic, Gemini, OpenAI, Azure supported) Demo: https://hugging…☆187Updated last year
- Nginx proxy server in a Docker container to Authenticate & Proxy requests to Ollama from Public Internet via Cloudflare Tunnel☆155Updated 4 months ago
- Build an AI Agent from Libraries of Functions -- My most advanced agent framework☆144Updated 7 months ago
- Build your own RAG and run it locally on your laptop: ColBERT + DSPy + Streamlit☆59Updated last year
- One click templates for inferencing Language Models☆223Updated last month
- ☆69Updated last year