SearchSavior / OpenArcLinks
Lightweight Inference server for OpenVINO
☆187Updated last week
Alternatives and similar repositories for OpenArc
Users that are interested in OpenArc are comparing it to the libraries listed below
Sorting:
- ☆78Updated this week
- Open source LLM UI, compatible with all local LLM providers.☆174Updated 9 months ago
- InferX is a Inference Function as a Service Platform☆111Updated this week
- llama.cpp fork with additional SOTA quants and improved performance☆608Updated this week
- Easy to use interface for the Whisper model optimized for all GPUs!☆225Updated this week
- Turns devices into a scalable LLM platform☆138Updated this week
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆255Updated 3 months ago
- ☆79Updated 3 months ago
- Local LLM Server with GPU and NPU Acceleration☆138Updated this week
- ☆103Updated last month
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆67Updated this week
- ☆204Updated last month
- This is a cross-platform desktop application that allows you to chat with locally hosted LLMs and enjoy features like MCP support☆221Updated last week
- The Fastest Way to Fine-Tune LLMs Locally☆306Updated 3 months ago
- Local LLM Powered Recursive Search & Smart Knowledge Explorer☆243Updated 4 months ago
- An optimized quantization and inference library for running LLMs locally on modern consumer-class GPUs☆408Updated this week
- A local AI companion that uses a collection of free, open source AI models in order to create two virtual companions that will follow you…☆219Updated 2 weeks ago
- ☆129Updated last month
- Minimal Linux OS with a Model Context Protocol (MCP) gateway to expose local capabilities to LLMs.☆244Updated 2 weeks ago
- klmbr - a prompt pre-processing technique to break through the barrier of entropy while generating text with LLMs☆76Updated 9 months ago
- Orpheus Chat WebUI☆65Updated 2 months ago
- ☆145Updated last month
- ☆204Updated last month
- Eternal is an experimental platform for machine learning models and workflows.☆68Updated 3 months ago
- ☆94Updated 6 months ago
- GPU Power and Performance Manager☆59Updated 8 months ago
- MAESTRO is an AI-powered research application designed to streamline complex research tasks.☆158Updated last week
- Sparse Inferencing for transformer based LLMs☆183Updated this week
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆28Updated 5 months ago
- A web application that converts speech to speech 100% private☆71Updated 3 weeks ago