jasonacox / TinyLLMLinks
Setup and run a local LLM and Chatbot using consumer grade hardware.
☆281Updated last month
Alternatives and similar repositories for TinyLLM
Users that are interested in TinyLLM are comparing it to the libraries listed below
Sorting:
- On-device LLM Inference Powered by X-Bit Quantization☆267Updated last month
- Unsloth Studio☆106Updated 5 months ago
- Building open version of OpenAI o1 via reasoning traces (Groq, ollama, Anthropic, Gemini, OpenAI, Azure supported) Demo: https://hugging…☆183Updated 10 months ago
- Self-host LLMs with vLLM and BentoML☆142Updated this week
- Practical and advanced guide to LLMOps. It provides a solid understanding of large language models’ general concepts, deployment techniqu…☆75Updated last year
- Granite 3.1 Language Models☆120Updated 2 months ago
- Fully-featured, beautiful web interface for vLLM - built with NextJS.☆152Updated 3 months ago
- Local llamaindex RAG to assist researchers quickly navigate research papers☆115Updated 3 months ago
- Locally running LLM with internet access☆96Updated 2 months ago
- Rank LLMs, RAG systems, and prompts using automated head-to-head evaluation☆104Updated 8 months ago
- The RunPod worker template for serving our large language model endpoints. Powered by vLLM.☆365Updated last week
- An endpoint server for efficiently serving quantized open-source LLMs for code.☆56Updated last year
- Local PDF Chat Application with Mistral 7B LLM, Langchain, Ollama, and Streamlit☆150Updated last year
- A proxy server for multiple ollama instances with Key security☆485Updated last month
- VSCode AI coding assistant powered by self-hosted llama.cpp endpoint.☆183Updated 7 months ago
- 🚀 Retrieval Augmented Generation (RAG) with txtai. Combine search and LLMs to find insights with your own data.☆403Updated 4 months ago
- The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM …☆587Updated 6 months ago
- LLaMA Server combines the power of LLaMA C++ with the beauty of Chatbot UI.☆129Updated 2 years ago
- One click templates for inferencing Language Models☆214Updated last month
- A simple experiment on letting two local LLM have a conversation about anything!☆110Updated last year
- Tool to download models from Huggingface Hub and convert them to GGML/GGUF for llama.cpp☆158Updated 4 months ago
- LLM Benchmark for Throughput via Ollama (Local LLMs)☆290Updated 3 weeks ago
- Build your own RAG and run it locally on your laptop: ColBERT + DSPy + Streamlit☆58Updated last year
- A memory framework for Large Language Models and Agents.☆183Updated 8 months ago
- An application for running LLMs locally on your device, with your documents, facilitating detailed citations in generated responses.☆608Updated 10 months ago
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆259Updated 6 months ago
- ezlocalai is an easy to set up local artificial intelligence server with OpenAI Style Endpoints.☆90Updated this week
- [EMNLP 2024 Demo] TinyAgent: Function Calling at the Edge!☆440Updated last year
- Tutorial for building LLM router☆226Updated last year
- Retrieval of fully structured data made easy. Use LLMs or custom models. Specialized on PDFs and HTML files. Extensive support of tabular…☆73Updated last week