inferx-net / inferxLinks
InferX is a Inference Function as a Service Platform
☆119Updated last week
Alternatives and similar repositories for inferx
Users that are interested in inferx are comparing it to the libraries listed below
Sorting:
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆67Updated last month
- Minimal Linux OS with a Model Context Protocol (MCP) gateway to expose local capabilities to LLMs.☆259Updated last month
- ☆152Updated last week
- Sparse Inferencing for transformer based LLMs☆196Updated this week
- Guaranteed Structured Output from any Language Model via Hierarchical State Machines☆141Updated 2 months ago
- ☆109Updated this week
- A web application that converts speech to speech 100% private☆73Updated 2 months ago
- ☆82Updated 5 months ago
- The Fastest Way to Fine-Tune LLMs Locally☆313Updated 4 months ago
- Super simple python connectors for llama.cpp, including vision models (Gemma 3, Qwen2-VL). Compile llama.cpp and run!☆25Updated 3 months ago
- ☆132Updated 3 months ago
- ☆132Updated 3 months ago
- Lightweight Inference server for OpenVINO☆191Updated last week
- klmbr - a prompt pre-processing technique to break through the barrier of entropy while generating text with LLMs☆78Updated 10 months ago
- ☆207Updated 2 weeks ago
- A platform to self-host AI on easy mode☆152Updated last week
- A local AI companion that uses a collection of free, open source AI models in order to create two virtual companions that will follow you…☆227Updated last week
- Local AI voice assistant stack for Home Assistant (GPU-accelerated) with persistent memory, follow-up conversation, and Ollama model reco…☆84Updated last week
- KoboldCpp Smart Launcher with GPU Layer and Tensor Override Tuning☆26Updated 2 months ago
- Serving LLMs in the HF-Transformers format via a PyFlask API☆71Updated 10 months ago
- A simple tool to anonymize LLM prompts.☆64Updated 6 months ago
- A Conversational Speech Generation Model with Gradio UI and OpenAI compatible API. UI and API support CUDA, MLX and CPU devices.☆194Updated 2 months ago
- ☆81Updated last week
- ☆217Updated 2 months ago
- Local Qwen3 LLM inference. One easy-to-understand file of C source with no dependencies.☆97Updated last month
- CaSIL is an advanced natural language processing system that implements a sophisticated four-layer semantic analysis architecture. It pro…☆66Updated 9 months ago
- Transplants vocabulary between language models, enabling the creation of draft models for speculative decoding WITHOUT retraining.☆39Updated 2 weeks ago
- Speech-to-speech AI assistant with natural conversation flow, mid-speech interruption, vision capabilities and AI-initiated follow-ups. F…☆200Updated 3 months ago
- Local LLM Powered Recursive Search & Smart Knowledge Explorer☆247Updated 5 months ago
- Easily view and modify JSON datasets for large language models☆81Updated 2 months ago