Python package wrapping llama.cpp for on-device LLM inference
☆101Oct 12, 2025Updated 4 months ago
Alternatives and similar repositories for easy-llama
Users that are interested in easy-llama are comparing it to the libraries listed below
Sorting:
- Simple Summarizer Tool using Llama 3 8b.☆10May 14, 2024Updated last year
- JacQues is a Dash-based interactive web application that facilitates real-time chat and document management.☆22Jan 5, 2026Updated 2 months ago
- An API for VoiceCraft.☆25Jun 27, 2024Updated last year
- ☆83Feb 28, 2025Updated last year
- Accepts a Hugging Face model URL, automatically downloads and quantizes it using Bits and Bytes.☆38Mar 12, 2024Updated last year
- ☆92Dec 9, 2025Updated 2 months ago
- run ollama & gguf easily with a single command☆52May 15, 2024Updated last year
- Terminal Voice Assistant is a powerful and flexible tool designed to help users interact with their terminal using natural language comma…☆19Jun 9, 2024Updated last year
- A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.☆51Jul 30, 2024Updated last year
- An extension to Oobabooga to add a simple memory function for chat☆25Jun 5, 2023Updated 2 years ago
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- ☆12Jan 19, 2024Updated 2 years ago
- A simple no-install web UI for Ollama and OAI-Compatible APIs!☆31Jan 30, 2025Updated last year
- ☆17Apr 22, 2024Updated last year
- an auto-sleeping and -waking framework around llama.cpp☆12Feb 8, 2025Updated last year
- AI Based "Happiness Optimizer"☆12Oct 20, 2024Updated last year
- ☆18Jul 12, 2025Updated 7 months ago
- A bot that checks your grammar and phrasing using LLM of choice☆32Feb 6, 2025Updated last year
- paper-read-notes☆13Sep 26, 2024Updated last year
- A QT GUI for large language models☆40Dec 27, 2023Updated 2 years ago
- ☆12Oct 23, 2022Updated 3 years ago
- ☆14Jan 14, 2023Updated 3 years ago
- An F/OSS solution combining AI with Wikipedia knowledge via a RAG pipeline☆95Jan 12, 2025Updated last year
- Low-Rank adapter extraction for fine-tuned transformers models☆180May 2, 2024Updated last year
- a character-ai like UI for LLM☆10Dec 3, 2024Updated last year
- DOD or data oriented design development, what is it and how to do it☆29Updated this week
- ☆15Feb 1, 2025Updated last year
- A simple library for working with Hugging Face models.☆14Dec 30, 2024Updated last year
- Docker/podman container for llama.cpp/vllm/exllamav{2,3} orchestrated using llama-swap☆17Feb 22, 2026Updated last week
- Use Codestral Mamba with Visual Studio Code and the Continue extension. A local LLM alternative to GitHub Copilot.☆29Jul 18, 2024Updated last year
- A Qt GUI for large language models☆45Nov 17, 2023Updated 2 years ago
- Never forget the resource that helps to close that sales call! Power a real-time speech-to-text agent with retrieval augmented generation…☆14Jan 23, 2024Updated 2 years ago
- 33B Chinese LLM, DPO QLORA, 100K context, AirLLM 70B inference with single 4GB GPU☆13May 5, 2024Updated last year
- convert a saved pytorch model to gguf and generate as much corresponding ggml c code as possible☆15Dec 19, 2023Updated 2 years ago
- A pure and fast NumPy implementation of Mamba with cache support.☆18Jun 16, 2024Updated last year
- LLM backed Fantasy Tribe Game☆19Nov 21, 2024Updated last year
- Writing Extension for Text Generation WebUI☆66Aug 7, 2025Updated 6 months ago
- Kosmos-2.5 is a cutting-edge Multimodal-LLM (MLLM) specializing in image OCR. However, its stringent software requirements & Python-scrip…☆68Jul 22, 2024Updated last year
- The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM …☆619Feb 17, 2025Updated last year