jasonacox / TinyLLMLinks
Setup and run a local LLM and Chatbot using consumer grade hardware.
☆269Updated 3 weeks ago
Alternatives and similar repositories for TinyLLM
Users that are interested in TinyLLM are comparing it to the libraries listed below
Sorting:
- Practical and advanced guide to LLMOps. It provides a solid understanding of large language models’ general concepts, deployment techniqu…☆69Updated 11 months ago
- On-device LLM Inference Powered by X-Bit Quantization☆256Updated last month
- Locally running LLM with internet access☆96Updated 2 weeks ago
- Build your own RAG and run it locally on your laptop: ColBERT + DSPy + Streamlit☆58Updated last year
- Self-host LLMs with vLLM and BentoML☆134Updated 2 weeks ago
- VSCode AI coding assistant powered by self-hosted llama.cpp endpoint.☆183Updated 5 months ago
- The RunPod worker template for serving our large language model endpoints. Powered by vLLM.☆333Updated 3 weeks ago
- Unsloth Studio☆93Updated 3 months ago
- This is the Mixture-of-Agents (MoA) concept, adapted from the original work by TogetherAI. My version is tailored for local model usage a…☆117Updated last year
- Using Langroid's Multi-Agent Framework to Build LLM Apps☆142Updated 2 weeks ago
- Serving LLMs in the HF-Transformers format via a PyFlask API☆71Updated 10 months ago
- Implementing Ollama and Agents to create a blogging bot☆127Updated 5 months ago
- This small API downloads and exposes access to NeuML's txtai-wikipedia and full wikipedia datasets, taking in a query and returning full …☆98Updated this week
- Building open version of OpenAI o1 via reasoning traces (Groq, ollama, Anthropic, Gemini, OpenAI, Azure supported) Demo: https://hugging…☆181Updated 9 months ago
- ☆156Updated last year
- Local first human friendly agents toolkit for the browser and Nodejs☆42Updated last month
- Auto Data is a library designed for quick and effortless creation of datasets tailored for fine-tuning Large Language Models (LLMs).☆102Updated 8 months ago
- Fully-featured, beautiful web interface for vLLM - built with NextJS.☆146Updated 2 months ago
- Rank LLMs, RAG systems, and prompts using automated head-to-head evaluation☆105Updated 7 months ago
- Corrective RAG demo powerd by Ollama☆103Updated last year
- Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation …☆185Updated 11 months ago
- Tool to download models from Huggingface Hub and convert them to GGML/GGUF for llama.cpp☆154Updated 2 months ago
- Distribute and run llamafile/LLMs with a single docker image.☆73Updated 2 months ago
- A simple experiment on letting two local LLM have a conversation about anything!☆110Updated last year
- ☆131Updated 2 months ago
- An endpoint server for efficiently serving quantized open-source LLMs for code.☆55Updated last year
- An OpenAI-like LLaMA inference API☆112Updated last year
- Own your AI, search the web with it🌐😎☆86Updated 6 months ago
- Build your own ChatPDF and run it locally☆383Updated 9 months ago
- Dagger functions to import Hugging Face GGUF models into a local ollama instance and optionally push them to ollama.com.☆116Updated last year