InferX: Inference as a Service Platform
☆172Mar 6, 2026Updated this week
Alternatives and similar repositories for inferx
Users that are interested in inferx are comparing it to the libraries listed below
Sorting:
- A pure and fast NumPy implementation of Mamba with cache support.☆18Jun 16, 2024Updated last year
- Vibe Coded Project Management System☆21Apr 19, 2025Updated 10 months ago
- ☆20Sep 28, 2024Updated last year
- ☆177Aug 10, 2025Updated 6 months ago
- ☆207Sep 7, 2025Updated 6 months ago
- ☆22Aug 9, 2024Updated last year
- Make Qwen3 Think like Gemini 2.5 Pro | Open webui function☆25May 10, 2025Updated 10 months ago
- Cleanai (https://github.com/willmil11/cleanai) except I'm making it in c now. Fast and clean from the start this time :)☆17Updated this week
- Deploy Apollo HF space locally☆40Dec 16, 2024Updated last year
- Using deep research workflow to generate datasets for finetuning LLMs.☆39Oct 9, 2025Updated 5 months ago
- ☆18Dec 9, 2025Updated 3 months ago
- Minimal web client for chatting and roleplay with AI characters☆26Aug 21, 2025Updated 6 months ago
- Reliable model swapping for any local OpenAI/Anthropic compatible server - llama.cpp, vllm, etc☆2,716Mar 2, 2026Updated last week
- Serving LLMs in the HF-Transformers format via a PyFlask API☆72Sep 10, 2024Updated last year
- Writing Tools, Apple's AI-inspired app, enchants Windows, enhancing your pen with AI LLMs. One hotkey press, system-wide, fixes grammar, …☆27Jul 26, 2025Updated 7 months ago
- ☆17Apr 22, 2024Updated last year
- ☆13Feb 18, 2024Updated 2 years ago
- 🌳 MCTS-inspired parallel beam search for conversation optimization. Explore multiple dialogue strategies simultaneously, stress-test a…☆35Jan 18, 2026Updated last month
- EpochFS is a versioned cloud file system with git-like branching, transaction support.☆17Feb 3, 2026Updated last month
- A chat UI for Llama.cpp☆15Dec 2, 2025Updated 3 months ago
- Mistral7B playing DOOM☆29Mar 27, 2024Updated last year
- BitTorrent Data Set☆13Jan 2, 2025Updated last year
- ☆19Jul 12, 2025Updated 7 months ago
- Manifold is an experimental platform for enabling long horizon workflow automation using teams of AI assistants.☆482Updated this week
- CLaMR: Contextualized Late-Interaction for Multimodal Content Retrieval☆23Jun 28, 2025Updated 8 months ago
- a character-ai like UI for LLM☆10Dec 3, 2024Updated last year
- Docker/podman container for llama.cpp/vllm/exllamav{2,3} orchestrated using llama-swap☆17Feb 22, 2026Updated 2 weeks ago
- Good enough PDF parser for CPU☆15Aug 9, 2024Updated last year
- 🔍📃 LLM-powered PDF Table Extractor☆19Jun 26, 2025Updated 8 months ago
- ☆17Apr 7, 2025Updated 11 months ago
- An slight upgrade from v2.1 which includes catppuccin theme light and dark☆12Jun 2, 2024Updated last year
- ☆1,262Updated this week
- an AI interaction tool with RAG hybrid search, conversation context, web content processing and structured data analysis with LLM / GPT☆212Jun 17, 2025Updated 8 months ago
- Efficient non-uniform quantization with GPTQ for GGUF☆61Sep 17, 2025Updated 5 months ago
- Creates an index of images, queries a local LLM and adds tags to the image metadata☆339Feb 5, 2026Updated last month
- ☆14Dec 6, 2023Updated 2 years ago
- OllaDeck is a purple technology stack for Generative AI (text modality) cybersecurity. It provides a comprehensive set of tools for both …☆18Sep 21, 2024Updated last year
- Llama.cpp-qt is a Python-based GUI wrapper for the LLama.cpp server, providing a user-friendly interface for configuring and running the …☆16Oct 4, 2023Updated 2 years ago
- Creating diff that supports wildcard produced by LLMs☆16Sep 18, 2024Updated last year