Python package wrapping llama.cpp for on-device LLM inference
☆101Oct 12, 2025Updated 5 months ago
Alternatives and similar repositories for easy-llama
Users that are interested in easy-llama are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- entropix style sampling + GUI☆27Oct 30, 2024Updated last year
- Simple Summarizer Tool using Llama 3 8b.☆10May 14, 2024Updated last year
- JacQues is a Dash-based interactive web application that facilitates real-time chat and document management.☆22Jan 5, 2026Updated 2 months ago
- An extension to Oobabooga to add a simple memory function for chat☆25Jun 5, 2023Updated 2 years ago
- oobabooga extension - Experimental sampler to make LLMs more creative☆23Aug 2, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- An API for VoiceCraft.☆25Jun 27, 2024Updated last year
- ☆93Dec 9, 2025Updated 3 months ago
- ☆35May 9, 2024Updated last year
- ☆83Feb 28, 2025Updated last year
- Terminal Voice Assistant is a powerful and flexible tool designed to help users interact with their terminal using natural language comma…☆19Jun 9, 2024Updated last year
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- A bot that checks your grammar and phrasing using LLM of choice☆32Feb 6, 2025Updated last year
- A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.☆52Jul 30, 2024Updated last year
- Web UI for ExLlamaV2☆510Feb 5, 2025Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- run ollama & gguf easily with a single command☆52May 15, 2024Updated last year
- an auto-sleeping and -waking framework around llama.cpp☆12Feb 8, 2025Updated last year
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆45Mar 21, 2024Updated 2 years ago
- Llama.cui is a small llama.cpp-based chat application for Node.js☆20Jul 10, 2025Updated 8 months ago
- A simple library for working with Hugging Face models.☆14Dec 30, 2024Updated last year
- Accepts a Hugging Face model URL, automatically downloads and quantizes it using Bits and Bytes.☆38Mar 12, 2024Updated 2 years ago
- A QT GUI for large language models☆40Dec 27, 2023Updated 2 years ago
- The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM …☆624Mar 9, 2026Updated 2 weeks ago
- ChatGPT CSS style☆14Apr 28, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Low-Rank adapter extraction for fine-tuned transformers models☆181May 2, 2024Updated last year
- DOD or data oriented design development, what is it and how to do it☆31Updated this week
- Web page with political compass quiz results for open LLMs☆38Jan 31, 2024Updated 2 years ago
- Writing Extension for Text Generation WebUI☆66Aug 7, 2025Updated 7 months ago
- convert a saved pytorch model to gguf and generate as much corresponding ggml c code as possible☆15Dec 19, 2023Updated 2 years ago
- A Qt GUI for large language models☆45Nov 17, 2023Updated 2 years ago
- LLM backed Fantasy Tribe Game☆19Nov 21, 2024Updated last year
- An open source collection of agentic Github workflows☆24May 7, 2024Updated last year
- ☆30May 30, 2025Updated 9 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Kosmos-2.5 is a cutting-edge Multimodal-LLM (MLLM) specializing in image OCR. However, its stringent software requirements & Python-scrip…☆68Jul 22, 2024Updated last year
- A library for working with GBNF files☆29Nov 2, 2025Updated 4 months ago
- AI Based "Happiness Optimizer"☆12Oct 20, 2024Updated last year
- ☆15Feb 1, 2025Updated last year
- A browser-based tool that renders `.gguf` language model files as interactive 3D point clouds.☆53Feb 8, 2026Updated last month
- A simple frontend page to interact with an OpenAI like API☆16Jan 31, 2025Updated last year
- ☆43Aug 2, 2025Updated 7 months ago