cortex.llamacpp is a high-efficiency C++ inference engine for edge computing. It is a dynamic library that can be loaded by any server at runtime.
☆44Jul 4, 2025Updated 11 months ago
Alternatives and similar repositories for cortex.llamacpp
Users that are interested in cortex.llamacpp are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Cortex.Tensorrt-LLM is a C++ inference library that can be loaded by any server at runtime. It submodules NVIDIA’s TensorRT-LLM for GPU a…☆42Sep 26, 2024Updated last year
- win32 native frontend for llama-cli☆14Nov 2, 2024Updated last year
- 🌐 OpenCrawl: An ethical, high-performance web crawler built for scale A powerful web crawling library that respects robots.txt and rate…☆24Apr 3, 2025Updated last year
- Jan.ai Website & Documentation☆38Oct 14, 2024Updated last year
- Local AI API Platform☆2,757Jul 4, 2025Updated 11 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Simple agent framework using Ollama tool calling☆10Aug 27, 2024Updated last year
- Run Stable diffusion 3 on low VRAM systems☆29Jun 13, 2024Updated last year
- Your Python AI Coder!☆36May 21, 2025Updated last year
- C++ pipeline with OpenVINO native API for Stable Diffusion v1.5☆13Feb 23, 2024Updated 2 years ago
- Repository sifter and hardlinker☆13Jun 13, 2020Updated 5 years ago
- ☆18Feb 18, 2025Updated last year
- A chat UI for Llama.cpp☆16Jun 4, 2026Updated last week
- ☆27Mar 17, 2025Updated last year
- Self Organizing Maps (SOM) ML model can be used to conduct semantic search to populate context required for Retrieval Augmented Generatio…☆15Mar 16, 2024Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆10Nov 24, 2024Updated last year
- ☆17May 18, 2026Updated 3 weeks ago
- Welcome to the LLM Tutorials and RAG Implementations repository! This repository provides tutorials, guides, and implementations for work…☆13Jul 1, 2025Updated 11 months ago
- ☆11Feb 7, 2024Updated 2 years ago
- A plugin for Oobabooga TextUI that allows you to search multiple search engines. Initially we're using Google API or DuckDuckGo.☆18Jun 4, 2023Updated 3 years ago
- Python console application designed to provide an engaging and visually appealing LLM chat experience on Unix-like consoles or Terminals.☆25May 20, 2026Updated 3 weeks ago
- Recording models☆12Sep 19, 2023Updated 2 years ago
- Minimalistic batching application for LLMs using ASP.NET Core and LLamaSharp☆12Oct 23, 2024Updated last year
- LibreTranslate C++ bindings☆18Aug 27, 2021Updated 4 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A Deeplearn Model to rec table in photo with ncnn. 一个深度学习模型用于检测图片中的表格 画像内のテーブルを検出するためのディープラーニング モデル☆20Mar 2, 2025Updated last year
- personal accounting tool with file conversion☆18Jul 23, 2025Updated 10 months ago
- A proof-of-concept parser for the Prolog programming language, at the Bern University of Applied Sciences for the course "Automata and fo…☆11Jan 17, 2014Updated 12 years ago
- This is the modified version of llama2.c LLM inference app ported to run on 32-bit capable DOS machines.☆30May 23, 2025Updated last year
- Intuitive RAG system on top of LllamaIndex☆15Nov 8, 2024Updated last year
- Statically typed wrappers for various markup lanuages - grapvhiz, svg, openscad, latex & more☆10Feb 15, 2022Updated 4 years ago
- Awesome resources for GIMP☆22May 16, 2026Updated 3 weeks ago
- Multithreaded TCP Client/Server implementation in C++☆11Jul 20, 2022Updated 3 years ago
- LCM OpenVINO model converter☆24Mar 27, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A ggml (C++) re-implementation of tortoise-tts☆194Aug 20, 2024Updated last year
- Unicode text segmentation (tr29)☆11Sep 19, 2024Updated last year
- search for files in a directory hierarchy☆11Oct 16, 2024Updated last year
- Inference Llama 2 in pure Nim☆35Mar 19, 2026Updated 2 months ago
- ☆20Sep 10, 2025Updated 9 months ago
- Draw MNIST digits and classify in real time!☆12Aug 27, 2024Updated last year
- Stable Diffusion desktop UI for Windows☆25Nov 27, 2022Updated 3 years ago