lemonade-sdk / lemonadeLinks
Local LLM Server with NPU Acceleration
β36Updated this week
Alternatives and similar repositories for lemonade
Users that are interested in lemonade are comparing it to the libraries listed below
Sorting:
- No-code CLI designed for accelerating ONNX workflowsβ192Updated 3 weeks ago
- beep boop π€ (experimental)β110Updated 4 months ago
- Personnal collection of pipes and filters I use for open-webuiβ16Updated 3 weeks ago
- LlamaCards is a web application that provides a dynamic interface for interacting with LLM models in real-time. This app allows users to β¦β39Updated 9 months ago
- Cognee starter repo with examplesβ24Updated 3 weeks ago
- Mixture-of-Ollamasβ30Updated 9 months ago
- Serving LLMs in the HF-Transformers format via a PyFlask APIβ71Updated 8 months ago
- Lightweight Inference server for OpenVINOβ176Updated last week
- KoboldCpp Smart Launcher with GPU Layer and Tensor Override Tuningβ20Updated 2 weeks ago
- β90Updated 5 months ago
- we're cookingβ36Updated 5 months ago
- Llama.cpp runner/swapper and proxy that emulates LMStudio / Ollama backendsβ20Updated last week
- β15Updated 5 months ago
- Wraps any OpenAI API interface as Responses with MCPs support so it supports Codex. Adding any missing stateful features. Ollama and Vllmβ¦β48Updated last week
- β25Updated 2 months ago
- GroqFlow provides an automated tool flow for compiling machine learning and linear algebra workloads into Groq programs and executing thoβ¦β109Updated 3 weeks ago
- LLM Benchmark for Throughput via Ollama (Local LLMs)β231Updated this week
- *NIX SHELL with Local AI/LLM integrationβ22Updated 3 months ago
- A repository of Dockerfiles, scripts, yaml files, Helm Charts, etc. used to build and scale the sample AI workflows with python, kubernetβ¦β11Updated last year
- β14Updated 9 months ago
- InferX is a Inference Function as a Service Platformβ105Updated this week
- A high performance MCP client sdk for pythonβ17Updated last month
- This project sets up an Open WebUI interface with LiteLLM as a backend proxy for AI models. This simple setup makes adding and using variβ¦β23Updated 6 months ago
- OllamaLab: a fully fledged AI assistant utilizing Ollama with Companion Mode for MacOSβ34Updated 8 months ago
- Local drive deep search.β26Updated 3 months ago
- MCP server for connecting agentic systems to search systems via searXNGβ68Updated 3 months ago
- Docker images and configuration to run text-generation-webui with GPU or CPU supportβ29Updated last year
- Web UI and API for managing MCP Orchestrator (mcpo) instances and configurationsβ65Updated 2 weeks ago
- β71Updated last week
- GPU prices aggregator for cloud providersβ37Updated last week