cortex.llamacpp is a high-efficiency C++ inference engine for edge computing. It is a dynamic library that can be loaded by any server at runtime.
☆42Jul 4, 2025Updated 7 months ago
Alternatives and similar repositories for cortex.llamacpp
Users that are interested in cortex.llamacpp are comparing it to the libraries listed below
Sorting:
- Cortex.Tensorrt-LLM is a C++ inference library that can be loaded by any server at runtime. It submodules NVIDIA’s TensorRT-LLM for GPU a…☆42Sep 26, 2024Updated last year
- Run Stable diffusion 3 on low VRAM systems☆29Jun 13, 2024Updated last year
- Image synthesis using machine learning☆23May 6, 2025Updated 9 months ago
- Simple agent framework using Ollama tool calling☆10Aug 27, 2024Updated last year
- A chat UI for Llama.cpp☆15Dec 2, 2025Updated 3 months ago
- Minimalistic batching application for LLMs using ASP.NET Core and LLamaSharp☆12Oct 23, 2024Updated last year
- win32 native frontend for llama-cli☆12Nov 2, 2024Updated last year
- A Deeplearn Model to rec table in photo with ncnn. 一个深度学习模型用于检测图片中的表格 画像内のテーブルを検出するためのディープラーニング モデル☆20Mar 2, 2025Updated last year
- Recording models☆12Sep 19, 2023Updated 2 years ago
- Your Python AI Coder!☆36May 21, 2025Updated 9 months ago
- ☆17Feb 18, 2025Updated last year
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆42Jul 18, 2024Updated last year
- Inference Llama 2 in one file of pure C☆12Nov 17, 2023Updated 2 years ago
- LibreTranslate C++ bindings☆18Aug 27, 2021Updated 4 years ago
- Stable Diffusion in pure C/C++☆16Jan 11, 2026Updated last month
- Minimalist stable-diffusion desktop application with only one executable file writen with golang ( No python ).☆18Apr 16, 2025Updated 10 months ago
- A micro LLM multi-agent system for data analysis☆17Apr 27, 2025Updated 10 months ago
- C++17 port of Open-Unmix-PyTorch with streaming LSTM inference, ggml, quantization, and Eigen☆53Mar 15, 2025Updated 11 months ago
- High Performan Ai Model Web Server. Mainly support computer vision model. Quickly establish your own ai-model server. https://github.com/…☆45May 13, 2025Updated 9 months ago
- Browser extension that lets you summarize and chat with any webpage using a local LLM of your choice.☆22Oct 24, 2024Updated last year
- SAM and lama inpaint,包含QT的GUI交互界面,实现了交互式可实时显示结果的画点、画框进行SAM,然后通过进行Inpaint,具体操作看readme里的视频。☆52Jan 30, 2024Updated 2 years ago
- Simple, Fast, Parallel Huggingface GGML model downloader written in python☆24Jul 26, 2023Updated 2 years ago
- 2UTM - до 10ти УТМ на одной машине☆12Oct 25, 2025Updated 4 months ago
- Local AI API Platform☆2,759Jul 4, 2025Updated 7 months ago
- A Windows tool to query various LLM AIs. Supports branched conversations, history and summaries among others.☆35Feb 11, 2026Updated 2 weeks ago
- segment anything(SAM) for CPP Inference☆31Jun 11, 2024Updated last year
- Running Microsoft's BitNet inference framework via FastAPI, Uvicorn and Docker.☆36Jul 2, 2025Updated 8 months ago
- Text-to-Speech (TTS) engine for the Armenian language☆12Sep 29, 2024Updated last year
- Natural language control for Python CLI tools using locally-trained SLMs (CPU inference)☆30Feb 21, 2026Updated last week
- automatically quant GGUF models☆220Dec 23, 2025Updated 2 months ago
- Lightweight, standalone, multi-platform, and privacy focused local LLM chat interface with optional encryption☆154Apr 26, 2025Updated 10 months ago
- Golang web client for Ollama, fast and easy to use.☆32Jul 18, 2025Updated 7 months ago
- RetroChat is a powerful command-line interface for interacting with various AI language models. It provides a seamless experience for eng…☆84Jul 13, 2025Updated 7 months ago
- a Federated Learning Framework adapted for resource-constrained environments, focusing on IoT devices☆10Oct 6, 2025Updated 4 months ago
- ProSuite Source Code☆10Feb 12, 2026Updated 2 weeks ago
- LCM Drawing app☆12Dec 1, 2023Updated 2 years ago
- call rwkv v4/v5/v6/v7 raven/world/finch 1B5-14B rwkv.cpp using csharp cpu/gpu (support INT4,8,Float16,32)☆35Feb 21, 2025Updated last year
- ONNX Command-Line Toolbox☆35Oct 11, 2024Updated last year
- Best Movie App with Ionic 4 using The Movie DB API☆16May 24, 2019Updated 6 years ago