jasonacox / TinyLLMLinks
Setup and run a local LLM and Chatbot using consumer grade hardware.
☆310Updated 2 months ago
Alternatives and similar repositories for TinyLLM
Users that are interested in TinyLLM are comparing it to the libraries listed below
Sorting:
- On-device LLM Inference Powered by X-Bit Quantization☆278Updated last week
- Locally running LLM with internet access☆97Updated 7 months ago
- Practical and advanced guide to LLMOps. It provides a solid understanding of large language models’ general concepts, deployment techniqu…☆78Updated last year
- LLM Benchmark for Throughput via Ollama (Local LLMs)☆326Updated 2 weeks ago
- Self-host LLMs with vLLM and BentoML☆168Updated last week
- The RunPod worker template for serving our large language model endpoints. Powered by vLLM.☆393Updated last week
- LLaMA Server combines the power of LLaMA C++ with the beauty of Chatbot UI.☆130Updated 2 years ago
- API Server for Transformer Lab☆83Updated 2 months ago
- Tutorial for building LLM router☆242Updated last year
- This small API downloads and exposes access to NeuML's txtai-wikipedia and full wikipedia datasets, taking in a query and returning full …☆103Updated 5 months ago
- An application for running LLMs locally on your device, with your documents, facilitating detailed citations in generated responses.☆629Updated last year
- A proxy server for multiple ollama instances with Key security☆575Updated last week
- Build your own RAG and run it locally on your laptop: ColBERT + DSPy + Streamlit☆60Updated last year
- One click templates for inferencing Language Models☆227Updated 2 months ago
- WebAssembly (Wasm) Build and Bindings for llama.cpp☆285Updated last year
- ☆269Updated 7 months ago
- ☆109Updated 5 months ago
- Fully-featured, beautiful web interface for vLLM - built with NextJS.☆172Updated last month
- 🚀 Retrieval Augmented Generation (RAG) with txtai. Combine search and LLMs to find insights with your own data.☆438Updated 2 months ago
- FineTune LLMs in few lines of code (Text2Text, Text2Speech, Speech2Text)☆246Updated 2 years ago
- Not Diamond Python SDK☆90Updated last month
- VSCode AI coding assistant powered by self-hosted llama.cpp endpoint.☆183Updated last year
- Python package wrapping llama.cpp for on-device LLM inference☆100Updated 3 months ago
- ☆209Updated 3 weeks ago
- A fast batching API to serve LLM models☆188Updated last year
- The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM …☆612Updated 11 months ago
- This project is a **proof of concept** that aims to replicate the reasoning capabilities of OpenAI's newly released O1 model.☆90Updated last year
- [EMNLP 2024 Demo] TinyAgent: Function Calling at the Edge!☆467Updated last year
- Building open version of OpenAI o1 via reasoning traces (Groq, ollama, Anthropic, Gemini, OpenAI, Azure supported) Demo: https://hugging…☆188Updated last year
- ☆69Updated last year