lemonade-sdk / lemonadeLinks
Lemonade helps users run local LLMs with the highest performance by configuring state-of-the-art inference engines for their NPUs and GPUs. Join our discord: https://discord.gg/5xXzkMu8Zk
☆1,622Updated this week
Alternatives and similar repositories for lemonade
Users that are interested in lemonade are comparing it to the libraries listed below
Sorting:
- ☆2,131Updated 2 weeks ago
- Run LLM Agents on Ryzen AI PCs in Minutes☆744Updated this week
- ☆1,205Updated this week
- Reliable model swapping for any local OpenAI compatible server - llama.cpp, vllm, etc☆1,862Updated last week
- Communicate with an LLM provider using a single interface☆1,365Updated this week
- A beautiful local-first coding agent running in your terminal - built by the community for the community ⚒☆872Updated last week
- Big & Small LLMs working together☆1,200Updated this week
- Run LLMs on AMD Ryzen™ AI NPUs in minutes. Just like Ollama - but purpose-built and deeply optimized for the AMD NPUs.☆451Updated this week
- llama.cpp fork with additional SOTA quants and improved performance☆1,329Updated this week
- Welcome to the official repository of SINQ! A novel, fast and high-quality quantization method designed to make any Large Language Model …☆570Updated 2 weeks ago
- ☆524Updated this week
- Build AI applications that can see, hear, and speak using your screens, microphones, and cameras as inputs.☆1,027Updated last week
- Official python implementation of UTCP. UTCP is an open standard that lets AI agents call any API directly, without extra middleware.☆602Updated 3 weeks ago
- ☆477Updated this week
- MAESTRO is an AI-powered research application designed to streamline complex research tasks.☆1,351Updated last month
- Semantic search and document parsing tools for the command line☆1,443Updated this week
- Python package and backend for the Elysia platform app.☆1,806Updated last week
- RAGLight is a modular framework for Retrieval-Augmented Generation (RAG). It makes it easy to plug in different LLMs, embeddings, and vec…☆607Updated 2 weeks ago
- ☆365Updated this week
- Docs for GGUF quantization (unofficial)☆312Updated 4 months ago
- Building blocks for rapid development of GenAI applications☆1,589Updated this week
- A command-line interface tool for serving LLM using vLLM.☆441Updated 3 weeks ago
- 100% Local Memory layer and Knowledge base for agents with WebUI☆574Updated 4 months ago
- A single interface to use and evaluate different agent frameworks☆1,026Updated this week
- ☆192Updated 2 months ago
- A Python framework that emulates Grok Heavy functionality using intelligent multi-agent orchestration. Deploy 4 (or more) specialized AI …☆1,032Updated 4 months ago
- Optimized Whisper models for streaming and on-device use☆507Updated last week
- Build, enrich, and transform datasets using AI models with no code☆1,564Updated 3 weeks ago
- Fastest LLM gateway (50x faster than LiteLLM) with adaptive load balancer, cluster mode, guardrails, 1000+ models support & <100 µs overh…☆1,112Updated this week
- A powerful Python library for creating and managing isolated desktop environments using Docker containers.☆432Updated 2 months ago