SearchSavior / OpenArc
Lightweight Inference server for OpenVINO
☆156Updated this week
Alternatives and similar repositories for OpenArc:
Users that are interested in OpenArc are comparing it to the libraries listed below
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆56Updated 2 months ago
- Turns devices into a scalable LLM platform☆128Updated this week
- ☆58Updated this week
- ☆68Updated last month
- Easy to use interface for the Whisper model optimized for all GPUs!☆108Updated this week
- Local LLM Powered Recursive Search & Smart Knowledge Explorer☆235Updated 2 months ago
- ☆84Updated 4 months ago
- ☆169Updated this week
- Open source LLM UI, compatible with all local LLM providers.☆173Updated 7 months ago
- A Discord bot for large language models. Add Gemini, Sonnet, GPT, and other models. Easily change models, edit prompts, and enable web se…☆80Updated this week
- GPU Power and Performance Manager☆58Updated 6 months ago
- llama.cpp fork with additional SOTA quants and improved performance☆292Updated this week
- Guaranteed Structured Output from any Language Model via Hierarchical State Machines☆124Updated last week
- Easily view and modify JSON datasets for large language models☆73Updated last month
- Local LLM Server with NPU Acceleration☆156Updated last week
- RetroChat is a powerful command-line interface for interacting with various AI language models. It provides a seamless experience for eng…☆73Updated 4 months ago
- ☆198Updated last week
- The Fastest Way to Fine-Tune LLMs Locally☆293Updated last month
- Model swapping for llama.cpp (or any local OpenAPI compatible server)☆544Updated last week
- Open source tool for transcirption and subtitling, alternative to happyscribe.☆26Updated 2 months ago
- A open webui function for better R1 experience☆80Updated last month
- klmbr - a prompt pre-processing technique to break through the barrier of entropy while generating text with LLMs☆71Updated 7 months ago
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆28Updated 3 months ago
- A fast batching API to serve LLM models☆182Updated 11 months ago
- AI management tool☆114Updated 5 months ago
- Comparison of the output quality of quantization methods, using Llama 3, transformers, GGUF, EXL2.☆150Updated 11 months ago
- Orpheus Chat WebUI☆49Updated 3 weeks ago
- ☆46Updated 2 months ago
- A local AI companion that uses a collection of free, open source AI models in order to create two virtual companions that will follow you…☆199Updated last week
- automatically quant GGUF models☆167Updated last week