jasonacox / TinyLLMLinks
Setup and run a local LLM and Chatbot using consumer grade hardware.
☆312Updated 2 months ago
Alternatives and similar repositories for TinyLLM
Users that are interested in TinyLLM are comparing it to the libraries listed below
Sorting:
- On-device LLM Inference Powered by X-Bit Quantization☆278Updated 2 weeks ago
- Self-host LLMs with vLLM and BentoML☆168Updated 3 weeks ago
- Locally running LLM with internet access☆97Updated 7 months ago
- [EMNLP 2024 Demo] TinyAgent: Function Calling at the Edge!☆467Updated last year
- Practical and advanced guide to LLMOps. It provides a solid understanding of large language models’ general concepts, deployment techniqu…☆79Updated last year
- API Server for Transformer Lab☆83Updated 2 months ago
- An endpoint server for efficiently serving quantized open-source LLMs for code.☆58Updated 2 years ago
- A collection of all available inference solutions for the LLMs☆94Updated 11 months ago
- ☆185Updated 2 years ago
- Granite 3.1 Language Models☆137Updated 7 months ago
- Building open version of OpenAI o1 via reasoning traces (Groq, ollama, Anthropic, Gemini, OpenAI, Azure supported) Demo: https://hugging…☆188Updated last year
- The RunPod worker template for serving our large language model endpoints. Powered by vLLM.☆401Updated 2 weeks ago
- LLM using long-term memory through vector database☆52Updated last year
- LLaMA Server combines the power of LLaMA C++ with the beauty of Chatbot UI.☆130Updated 2 years ago
- ☆270Updated 7 months ago
- Auto Data is a library designed for quick and effortless creation of datasets tailored for fine-tuning Large Language Models (LLMs).☆105Updated last year
- Build your own RAG and run it locally on your laptop: ColBERT + DSPy + Streamlit☆60Updated last year
- Rank LLMs, RAG systems, and prompts using automated head-to-head evaluation☆108Updated last year
- LLM Benchmark for Throughput via Ollama (Local LLMs)☆331Updated 3 weeks ago
- VSCode AI coding assistant powered by self-hosted llama.cpp endpoint.☆183Updated last year
- Tool to download models from Huggingface Hub and convert them to GGML/GGUF for llama.cpp☆170Updated 9 months ago
- The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM …☆615Updated 11 months ago
- QA-Pilot is an interactive chat project that leverages online/local LLM for rapid understanding and navigation of GitHub code repository.☆318Updated 5 months ago
- ☆109Updated 5 months ago
- One click templates for inferencing Language Models☆228Updated 2 months ago
- 🌉 How to deploy an open-source code LLM for your dev team☆112Updated 2 years ago
- A simple to use Ollama autocompletion engine with options exposed and streaming functionality☆144Updated 10 months ago
- A simple Jupyter Notebook for learning MLX text-completion fine-tuning!☆123Updated last year
- Corrective RAG demo powerd by Ollama☆110Updated last year
- A fast batching API to serve LLM models☆189Updated last year