menloresearch / cortex.llamacpp
cortex.llamacpp is a high-efficiency C++ inference engine for edge computing. It is a dynamic library that can be loaded by any server at runtime.
☆36Updated this week
Alternatives and similar repositories for cortex.llamacpp:
Users that are interested in cortex.llamacpp are comparing it to the libraries listed below
- Port of Suno AI's Bark in C/C++ for fast inference☆53Updated 11 months ago
- ☆31Updated last year
- TTS support with GGML☆26Updated last month
- Yet Another (LLM) Web UI, made with Gemini☆11Updated 3 months ago
- Cortex.Tensorrt-LLM is a C++ inference library that can be loaded by any server at runtime. It submodules NVIDIA’s TensorRT-LLM for GPU a…☆43Updated 6 months ago
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆55Updated last month
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆32Updated 8 months ago
- A ggml (C++) re-implementation of tortoise-tts☆178Updated 7 months ago
- SPLAA is an AI assistant framework that utilizes voice recognition, text-to-speech, and tool-calling capabilities to provide a conversati…☆25Updated 2 months ago
- Spotlight-like client for Ollama on Windows.☆27Updated 10 months ago
- AirLLM 70B inference with single 4GB GPU☆12Updated 7 months ago
- Local LLM inference & management server with built-in OpenAI API☆31Updated 11 months ago
- A chat UI for Llama.cpp☆12Updated this week
- ☆52Updated this week
- Something similar to Apple Intelligence?☆59Updated 8 months ago
- ☆91Updated 2 months ago
- Tcurtsni: Reverse Instruction Chat, ever wonder what your LLM wants to ask you?☆21Updated 9 months ago
- Accepts a Hugging Face model URL, automatically downloads and quantizes it using Bits and Bytes.☆38Updated last year
- GGML implementation of BERT model with Python bindings and quantization.☆56Updated last year
- The hearth of The Pulsar App, fast, secure and shared inference with modern UI☆56Updated 4 months ago
- 👁️ Multimodal LLM vision multitool☆26Updated 5 months ago
- Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port…☆16Updated 7 months ago
- instinct.cpp provides ready to use alternatives to OpenAI Assistant API and built-in utilities for developing AI Agent applications (RAG,…☆45Updated 8 months ago
- LlamaCards is a web application that provides a dynamic interface for interacting with LLM models in real-time. This app allows users to …☆38Updated 7 months ago
- Experiments with BitNet inference on CPU☆53Updated last year
- A Windows tool to query various LLM AIs. Supports branched conversations, history and summaries among others.☆29Updated this week
- Speech To Speech: an effort for an open-sourced and modular GPT4-o☆51Updated 5 months ago
- Video+code lecture on building nanoGPT from scratch☆66Updated 9 months ago
- CI for ggml and related projects☆25Updated this week
- ☆24Updated 2 months ago