ddh0 / easy-llamaView external linksLinks
Python package wrapping llama.cpp for on-device LLM inference
☆100Oct 12, 2025Updated 4 months ago
Alternatives and similar repositories for easy-llama
Users that are interested in easy-llama are comparing it to the libraries listed below
Sorting:
- entropix style sampling + GUI☆27Oct 30, 2024Updated last year
- Simple Summarizer Tool using Llama 3 8b.☆10May 14, 2024Updated last year
- JacQues is a Dash-based interactive web application that facilitates real-time chat and document management.☆22Jan 5, 2026Updated last month
- An API for VoiceCraft.☆25Jun 27, 2024Updated last year
- ☆83Feb 28, 2025Updated 11 months ago
- ☆35May 9, 2024Updated last year
- Accepts a Hugging Face model URL, automatically downloads and quantizes it using Bits and Bytes.☆38Mar 12, 2024Updated last year
- ☆90Dec 9, 2025Updated 2 months ago
- run ollama & gguf easily with a single command☆52May 15, 2024Updated last year
- Terminal Voice Assistant is a powerful and flexible tool designed to help users interact with their terminal using natural language comma…☆19Jun 9, 2024Updated last year
- A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.☆51Jul 30, 2024Updated last year
- An extension to Oobabooga to add a simple memory function for chat☆25Jun 5, 2023Updated 2 years ago
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- ☆11Jan 19, 2024Updated 2 years ago
- ☆18Jul 12, 2025Updated 7 months ago
- AI Based "Happiness Optimizer"☆12Oct 20, 2024Updated last year
- ☆17Apr 22, 2024Updated last year
- an auto-sleeping and -waking framework around llama.cpp☆12Feb 8, 2025Updated last year
- A bot that checks your grammar and phrasing using LLM of choice☆32Feb 6, 2025Updated last year
- A Local Proxy and Compatibility Layer for LLM Services☆11May 2, 2024Updated last year
- A browser-based tool that renders `.gguf` language model files as interactive 3D point clouds.☆38Updated this week
- ☆12Oct 23, 2022Updated 3 years ago
- paper-read-notes☆13Sep 26, 2024Updated last year
- A QT GUI for large language models☆39Dec 27, 2023Updated 2 years ago
- Web UI for ExLlamaV2☆513Feb 5, 2025Updated last year
- Low-Rank adapter extraction for fine-tuned transformers models☆180May 2, 2024Updated last year
- ☆15Feb 1, 2025Updated last year
- Docker/podman container for llama.cpp/vllm/exllamav{2,3} orchestrated using llama-swap☆16Feb 6, 2026Updated last week
- a character-ai like UI for LLM☆10Dec 3, 2024Updated last year
- Use Codestral Mamba with Visual Studio Code and the Continue extension. A local LLM alternative to GitHub Copilot.☆29Jul 18, 2024Updated last year
- A Qt GUI for large language models☆45Nov 17, 2023Updated 2 years ago
- [ICLR 2026] RuleReasoner: Reinforced Rule-based Reasoning via Domain-aware Dynamic Sampling☆31Feb 1, 2026Updated last week
- Never forget the resource that helps to close that sales call! Power a real-time speech-to-text agent with retrieval augmented generation…☆13Jan 23, 2024Updated 2 years ago
- Writing Extension for Text Generation WebUI☆64Aug 7, 2025Updated 6 months ago
- convert a saved pytorch model to gguf and generate as much corresponding ggml c code as possible☆15Dec 19, 2023Updated 2 years ago
- A pure and fast NumPy implementation of Mamba with cache support.☆18Jun 16, 2024Updated last year
- Kosmos-2.5 is a cutting-edge Multimodal-LLM (MLLM) specializing in image OCR. However, its stringent software requirements & Python-scrip…☆67Jul 22, 2024Updated last year
- The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM …☆614Feb 17, 2025Updated 11 months ago
- AI Coding Stack - Your AI Coding Ecosystem Hub.☆28Feb 3, 2026Updated last week