Python package wrapping llama.cpp for on-device LLM inference
☆106Apr 2, 2026Updated 2 months ago
Alternatives and similar repositories for easy-llama
Users that are interested in easy-llama are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An extension to Oobabooga to add a simple memory function for chat☆25Jun 5, 2023Updated 3 years ago
- An API for VoiceCraft.☆25Jun 27, 2024Updated last year
- ☆97Mar 28, 2026Updated 2 months ago
- ☆83Feb 28, 2025Updated last year
- ☆35May 9, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆141Jun 2, 2026Updated last week
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- A bot that checks your grammar and phrasing using LLM of choice☆33Feb 6, 2025Updated last year
- A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.☆52Jul 30, 2024Updated last year
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆45Mar 21, 2024Updated 2 years ago
- Llama.cui is a small llama.cpp-based chat application for Node.js☆20Jul 10, 2025Updated 11 months ago
- Web UI for ExLlamaV2☆512Feb 5, 2025Updated last year
- Use Codestral Mamba with Visual Studio Code and the Continue extension. A local LLM alternative to GitHub Copilot.☆30Jul 18, 2024Updated last year
- A QT GUI for large language models☆40Dec 27, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ChatGPT CSS style☆14Apr 28, 2024Updated 2 years ago
- Low-Rank adapter extraction for fine-tuned transformers models☆181May 2, 2024Updated 2 years ago
- run ollama & gguf easily with a single command☆52May 15, 2024Updated 2 years ago
- The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM …☆643Mar 9, 2026Updated 3 months ago
- A small collection of nodes intended for use with Lodestone Rock's Chroma model, for ComfyUI.☆14Jul 8, 2025Updated 11 months ago
- Web page with political compass quiz results for open LLMs☆38Jan 31, 2024Updated 2 years ago
- Terminal Voice Assistant is a powerful and flexible tool designed to help users interact with their terminal using natural language comma…☆18Jun 9, 2024Updated 2 years ago
- Writing Extension for Text Generation WebUI☆67Aug 7, 2025Updated 10 months ago
- LLM backed Fantasy Tribe Game☆19Nov 21, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Kosmos-2.5 is a cutting-edge Multimodal-LLM (MLLM) specializing in image OCR. However, its stringent software requirements & Python-scrip…☆67Jul 22, 2024Updated last year
- A library for working with GBNF files☆30May 27, 2026Updated 2 weeks ago
- An F/OSS solution combining AI with Wikipedia knowledge via a RAG pipeline☆104Jan 12, 2025Updated last year
- ☆16Feb 1, 2025Updated last year
- AI Based "Happiness Optimizer"☆12Oct 20, 2024Updated last year
- DOD or data oriented design development, what is it and how to do it☆35Updated this week
- A browser-based tool that renders `.gguf` language model files as interactive 3D point clouds.☆55Feb 8, 2026Updated 4 months ago
- ☆42Aug 2, 2025Updated 10 months ago
- A simple frontend page to interact with an OpenAI like API☆16Jan 31, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A multimodal inference pipeline that integrates InstructBLIP with textgen-webui for Vicuna and related models.☆33Jul 14, 2023Updated 2 years ago
- Give your local LLM a real memory with a lightweight, fully local memory system. 100% offline and under your control.☆77Sep 16, 2025Updated 8 months ago
- Groquments is a simple demonstration project showcasing how easily PocketGroq can help developers integrate Groq's powerful AI capabiliti…☆12Sep 19, 2024Updated last year
- A portable linker for multiple file formats.☆14Aug 28, 2023Updated 2 years ago
- ☆12Oct 23, 2022Updated 3 years ago
- A Docker Compose file to run the Archiveteam warrior while preserving the settings between updates.☆15Feb 8, 2023Updated 3 years ago
- Proteus is an experimental platform that combines the power of Large Language Models with the Genesis physics engine☆25Dec 20, 2024Updated last year