inferx-net/inferx

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/inferx-net/inferx)

inferx-net / inferx

InferX: Inference as a Service Platform

☆213

Alternatives and similar repositories for inferx

Users that are interested in inferx are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

TesslateAI / TFrameX
View on GitHub
☆179Aug 10, 2025Updated 9 months ago
Cybonto / OllaDeck
View on GitHub
OllaDeck is a purple technology stack for Generative AI (text modality) cybersecurity. It provides a comprehensive set of tools for both …
☆17Sep 21, 2024Updated last year
AaronFeng753 / Qwen3-Gemini2.5
View on GitHub
Make Qwen3 Think like Gemini 2.5 Pro | Open webui function
☆25May 10, 2025Updated last year
TesslateAI / Agent-Builder
View on GitHub
☆213Sep 7, 2025Updated 8 months ago
thustorage / RoundPipe
View on GitHub
Large DNNs training framework for consumer GPUs
☆78Updated this week
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
multiplexerai / Namespace-RAG
View on GitHub
☆13Feb 18, 2024Updated 2 years ago
bjodah / llm-multi-backend-container
View on GitHub
Docker/podman container for llama.cpp/vllm/exllamav{2,3} orchestrated using llama-swap
☆18Apr 10, 2026Updated last month
idoh / fast_mamba.np
View on GitHub
A pure and fast NumPy implementation of Mamba with cache support.
☆18Jun 16, 2024Updated last year
mostlygeek / llama-swap
View on GitHub
Reliable model swapping for any local OpenAI/Anthropic compatible server - llama.cpp, vllm, etc
☆4,107Updated this week
JordanDalton / pm
View on GitHub
Vibe Coded Project Management System
☆21Apr 19, 2025Updated last year
Anonymous1252022 / fp4-all-the-way
View on GitHub
☆49May 20, 2025Updated last year
RobTand / prismaquant
View on GitHub
Mixed-precision quantization for LLMs. Every layer refracts into a different format based on its sensitivity. Native compressed-tensors e…
☆69Updated this week
theroyallab / YALS
View on GitHub
☆97Mar 28, 2026Updated last month
Roy3838 / Observer
View on GitHub
☆1,361May 12, 2026Updated last week
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
intelligencedev / manifold
View on GitHub
Manifold is an experimental platform for enabling long horizon workflow automation using teams of AI assistants.
☆493May 14, 2026Updated last week
PasiKoodaa / Chat-with-Screen
View on GitHub
☆20Sep 28, 2024Updated last year
efogdev / apollo
View on GitHub
Deploy Apollo HF space locally
☆40Dec 16, 2024Updated last year
av / harbor
View on GitHub
Stop configuring your AI stack. Start using it. One command brings a complete pre-wired LLM stack with hundreds of services to explore.
☆2,946Updated this week
llm-agent-x / llm-agent-x
View on GitHub
☆21Dec 9, 2025Updated 5 months ago
willmil11 / cleanai-c
View on GitHub
Cleanai (https://github.com/willmil11/cleanai) except I'm making it in c now. Fast and clean from the start this time :)
☆16Mar 6, 2026Updated 2 months ago
Contextualist / lone-arena
View on GitHub
Self-hosted LLM chatbot arena, with yourself as the only judge
☆41Feb 6, 2024Updated 2 years ago
bdytx5 / open_answer_engine
View on GitHub
☆22Aug 9, 2024Updated last year
abgulati / hf-waitress
View on GitHub
Serving LLMs in the HF-Transformers format via a PyFlask API
☆72Sep 10, 2024Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
ikawrakow / ik_llama.cpp
View on GitHub
llama.cpp fork with additional SOTA quants and improved performance
☆2,448Updated this week
PuchToTalk / DOOM-MistralAI
View on GitHub
Mistral7B playing DOOM
☆29Mar 27, 2024Updated 2 years ago
h2210316651 / lexicrawler
View on GitHub
LexiCrawler is a powerful Go-based web crawling API meticulously designed to extract, clean, and transform web page content into a pristi…
☆48Feb 27, 2025Updated last year
adriancable / qwen3.c
View on GitHub
Local Qwen3 LLM inference. One easy-to-understand file of C source with no dependencies.
☆175Jul 5, 2025Updated 10 months ago
MaggotHATE / Llama_chat
View on GitHub
A chat UI for Llama.cpp
☆16May 13, 2026Updated last week
intentee / llmops-handbook
View on GitHub
Practical and advanced guide to LLMOps. It provides a solid understanding of large language models’ general concepts, deployment techniqu…
☆82Aug 16, 2024Updated last year
Sumrix / LiteRP
View on GitHub
Minimal web client for chatting and roleplay with AI characters
☆26Aug 21, 2025Updated 9 months ago
ScalingIntelligence / good-kernels
View on GitHub
Samples of good AI generated CUDA kernels
☆104May 30, 2025Updated 11 months ago
premAI-io / Ayup
View on GitHub
Quickly and securely turn any Linux box into a build and deployment assistant
☆25Oct 3, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
kleinlee / MiniQwen
View on GitHub
☆14Dec 6, 2023Updated 2 years ago
createthis / diffcalculia
View on GitHub
☆16May 8, 2025Updated last year
SicariusSicariiStuff / SLOP_Detector
View on GitHub
SLOP Detector and analyzer based on dictionary for shareGPT JSON and text
☆98Apr 2, 2026Updated last month
MVPandey / DTS
View on GitHub
🌳 MCTS-inspired parallel beam search for conversation optimization. Explore multiple dialogue strategies simultaneously, stress-test a…
☆36Jan 18, 2026Updated 4 months ago
twerkmeister / tokenflood
View on GitHub
Tokenflood is a load testing framework for simulating arbitary loads on instruction-tuned LLMs
☆45May 12, 2026Updated last week
foldl / WritingTools
View on GitHub
Writing Tools, Apple's AI-inspired app, enchants Windows, enhancing your pen with AI LLMs. One hotkey press, system-wide, fixes grammar, …
☆28Jul 26, 2025Updated 9 months ago
ideaweaver-ai / DeepSeek-Children-Stories-15M-model
View on GitHub
☆114Jun 19, 2025Updated 11 months ago