jasonacox / TinyLLMLinks
Setup and run a local LLM and Chatbot using consumer grade hardware.
☆298Updated last week
Alternatives and similar repositories for TinyLLM
Users that are interested in TinyLLM are comparing it to the libraries listed below
Sorting:
- Self-host LLMs with vLLM and BentoML☆157Updated this week
- On-device LLM Inference Powered by X-Bit Quantization☆273Updated last week
- The RunPod worker template for serving our large language model endpoints. Powered by vLLM.☆385Updated this week
- Locally running LLM with internet access☆97Updated 4 months ago
- The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM …☆609Updated 9 months ago
- An endpoint server for efficiently serving quantized open-source LLMs for code.☆58Updated 2 years ago
- Practical and advanced guide to LLMOps. It provides a solid understanding of large language models’ general concepts, deployment techniqu…☆76Updated last year
- FineTune LLMs in few lines of code (Text2Text, Text2Speech, Speech2Text)☆245Updated last year
- WebAssembly (Wasm) Build and Bindings for llama.cpp☆285Updated last year
- Corrective RAG demo powerd by Ollama☆108Updated last year
- VSCode AI coding assistant powered by self-hosted llama.cpp endpoint.☆183Updated 9 months ago
- Build your own RAG and run it locally on your laptop: ColBERT + DSPy + Streamlit☆58Updated last year
- Fully-featured, beautiful web interface for vLLM - built with NextJS.☆161Updated 6 months ago
- plug whisper audio transcription to a local ollama server and ouput tts audio responses☆360Updated last month
- Own your AI, search the web with it🌐😎☆92Updated 10 months ago
- A collection of all available inference solutions for the LLMs☆92Updated 8 months ago
- ☆268Updated 5 months ago
- Local llamaindex RAG to assist researchers quickly navigate research papers☆122Updated 6 months ago
- 🌉 How to deploy an open-source code LLM for your dev team☆108Updated 2 years ago
- ☆170Updated last year
- LLM using long-term memory through vector database☆53Updated last year
- One click templates for inferencing Language Models☆219Updated last week
- LLaMA Server combines the power of LLaMA C++ with the beauty of Chatbot UI.☆129Updated 2 years ago
- Deployment a light and full OpenAI API for production with vLLM to support /v1/embeddings with all embeddings models.☆44Updated last year
- Unsloth Studio☆118Updated 7 months ago
- An application for running LLMs locally on your device, with your documents, facilitating detailed citations in generated responses.☆621Updated last year
- Tutorial for building LLM router☆236Updated last year
- A proxy server for multiple ollama instances with Key security☆531Updated 2 weeks ago
- ☆207Updated last year
- DSPY on action with OpenSource LLMs.☆98Updated last year