Python package wrapping llama.cpp for on-device LLM inference
☆105Apr 2, 2026Updated last month
Alternatives and similar repositories for easy-llama
Users that are interested in easy-llama are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- JacQues is a Dash-based interactive web application that facilitates real-time chat and document management.☆23Jan 5, 2026Updated 4 months ago
- An extension to Oobabooga to add a simple memory function for chat☆25Jun 5, 2023Updated 2 years ago
- An API for VoiceCraft.☆25Jun 27, 2024Updated last year
- A simple no-install web UI for Ollama and OAI-Compatible APIs!☆31Jan 30, 2025Updated last year
- ☆94Mar 28, 2026Updated last month
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆83Feb 28, 2025Updated last year
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- A bot that checks your grammar and phrasing using LLM of choice☆32Feb 6, 2025Updated last year
- A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.☆52Jul 30, 2024Updated last year
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆45Mar 21, 2024Updated 2 years ago
- Llama.cui is a small llama.cpp-based chat application for Node.js☆20Jul 10, 2025Updated 9 months ago
- Web UI for ExLlamaV2☆511Feb 5, 2025Updated last year
- A simple library for working with Hugging Face models.☆14Dec 30, 2024Updated last year
- Use Codestral Mamba with Visual Studio Code and the Continue extension. A local LLM alternative to GitHub Copilot.☆29Jul 18, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Server plugin to extract text from Office documents using the officeparser library.☆13Mar 14, 2026Updated last month
- A QT GUI for large language models☆40Dec 27, 2023Updated 2 years ago
- ChatGPT CSS style☆14Apr 28, 2024Updated 2 years ago
- A simple character editor for v2 Tavern Character Cards☆62Jan 18, 2025Updated last year
- Low-Rank adapter extraction for fine-tuned transformers models☆181May 2, 2024Updated 2 years ago
- run ollama & gguf easily with a single command☆52May 15, 2024Updated last year
- The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM …☆630Mar 9, 2026Updated last month
- Terminal Voice Assistant is a powerful and flexible tool designed to help users interact with their terminal using natural language comma…☆19Jun 9, 2024Updated last year
- Writing Extension for Text Generation WebUI☆66Aug 7, 2025Updated 8 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- LLM backed Fantasy Tribe Game☆19Nov 21, 2024Updated last year
- A Qt GUI for large language models☆45Nov 17, 2023Updated 2 years ago
- An open source collection of agentic Github workflows☆24May 7, 2024Updated last year
- Kosmos-2.5 is a cutting-edge Multimodal-LLM (MLLM) specializing in image OCR. However, its stringent software requirements & Python-scrip…☆67Jul 22, 2024Updated last year
- DOD or data oriented design development, what is it and how to do it☆34Updated this week
- A browser-based tool that renders `.gguf` language model files as interactive 3D point clouds.☆53Feb 8, 2026Updated 2 months ago
- ☆43Aug 2, 2025Updated 9 months ago
- A multimodal inference pipeline that integrates InstructBLIP with textgen-webui for Vicuna and related models.☆33Jul 14, 2023Updated 2 years ago
- Give your local LLM a real memory with a lightweight, fully local memory system. 100% offline and under your control.☆73Sep 16, 2025Updated 7 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A portable linker for multiple file formats.☆14Aug 28, 2023Updated 2 years ago
- ☆12Oct 23, 2022Updated 3 years ago
- Template Bracket 2.0☆10Dec 14, 2019Updated 6 years ago
- Proteus is an experimental platform that combines the power of Large Language Models with the Genesis physics engine☆25Dec 20, 2024Updated last year
- Teaching AI to play the classic text adventure Zork using Large Language Models☆37Apr 5, 2026Updated last month
- LLM inference in C/C++☆23Oct 4, 2024Updated last year
- automatically quant GGUF models☆224Dec 23, 2025Updated 4 months ago