jasonacox / TinyLLMLinks
Setup and run a local LLM and Chatbot using consumer grade hardware.
☆276Updated last week
Alternatives and similar repositories for TinyLLM
Users that are interested in TinyLLM are comparing it to the libraries listed below
Sorting:
- Locally running LLM with internet access☆96Updated last month
- Practical and advanced guide to LLMOps. It provides a solid understanding of large language models’ general concepts, deployment techniqu…☆72Updated 11 months ago
- An endpoint server for efficiently serving quantized open-source LLMs for code.☆56Updated last year
- The RunPod worker template for serving our large language model endpoints. Powered by vLLM.☆349Updated this week
- Build your own RAG and run it locally on your laptop: ColBERT + DSPy + Streamlit☆58Updated last year
- Running local Language Language Models (LLM) to perform Retrieval-Augmented Generation (RAG)☆242Updated 6 months ago
- On-device LLM Inference Powered by X-Bit Quantization☆261Updated last week
- Self-host LLMs with vLLM and BentoML☆140Updated last week
- function calling-based LLM agents☆288Updated 10 months ago
- Unsloth Studio☆99Updated 4 months ago
- Building open version of OpenAI o1 via reasoning traces (Groq, ollama, Anthropic, Gemini, OpenAI, Azure supported) Demo: https://hugging…☆183Updated 9 months ago
- Corrective RAG demo powerd by Ollama☆104Updated last year
- An application for running LLMs locally on your device, with your documents, facilitating detailed citations in generated responses.☆604Updated 9 months ago
- Tutorial for building LLM router☆221Updated last year
- VSCode AI coding assistant powered by self-hosted llama.cpp endpoint.☆183Updated 6 months ago
- [EMNLP 2024 Demo] TinyAgent: Function Calling at the Edge!☆429Updated 11 months ago
- This is the Mixture-of-Agents (MoA) concept, adapted from the original work by TogetherAI. My version is tailored for local model usage a…☆118Updated last year
- Granite 3.1 Language Models☆117Updated last month
- Route LLM requests to the best model for the task at hand.☆90Updated last month
- The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM …☆584Updated 5 months ago
- Auto Data is a library designed for quick and effortless creation of datasets tailored for fine-tuning Large Language Models (LLMs).☆102Updated 9 months ago
- Compare open-source local LLM inference projects by their metrics to assess popularity and activeness.☆625Updated 3 months ago
- 🚀 Retrieval Augmented Generation (RAG) with txtai. Combine search and LLMs to find insights with your own data.☆391Updated 3 months ago
- LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.☆177Updated 11 months ago
- A collection of all available inference solutions for the LLMs☆91Updated 5 months ago
- ☆160Updated 6 months ago
- 🚀 The LLM Automatic Computer Framework: L2MAC☆130Updated 7 months ago
- Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation …☆184Updated last year
- Fully-featured, beautiful web interface for vLLM - built with NextJS.☆150Updated 3 months ago
- ☆261Updated last month