menloresearch / cortex.llamacppLinks
cortex.llamacpp is a high-efficiency C++ inference engine for edge computing. It is a dynamic library that can be loaded by any server at runtime.
☆42Updated 3 weeks ago
Alternatives and similar repositories for cortex.llamacpp
Users that are interested in cortex.llamacpp are comparing it to the libraries listed below
Sorting:
- Cortex.Tensorrt-LLM is a C++ inference library that can be loaded by any server at runtime. It submodules NVIDIA’s TensorRT-LLM for GPU a…☆43Updated 9 months ago
- Yet Another (LLM) Web UI, made with Gemini☆12Updated 6 months ago
- ☆31Updated last year
- Thin wrapper around GGML to make life easier☆35Updated 3 weeks ago
- Something similar to Apple Intelligence?☆61Updated 11 months ago
- TTS support with GGML☆117Updated last week
- Running Microsoft's BitNet via Electron, React & Astro☆40Updated 3 weeks ago
- Port of Suno AI's Bark in C/C++ for fast inference☆52Updated last year
- ☆21Updated 4 months ago
- GGML implementation of BERT model with Python bindings and quantization.☆55Updated last year
- LLM inference in C/C++☆77Updated this week
- Experiments with BitNet inference on CPU☆54Updated last year
- Local LLM inference & management server with built-in OpenAI API☆31Updated last year
- A python package for serving LLM on OpenAI-compatible API endpoints with prompt caching using MLX.☆86Updated this week
- AirLLM 70B inference with single 4GB GPU☆14Updated this week
- The hearth of The Pulsar App, fast, secure and shared inference with modern UI☆56Updated 6 months ago
- Simple, Fast, Parallel Huggingface GGML model downloader written in python☆24Updated last year
- Hanasu is a human-like TTS model based on the multilingual Himitsu V1 transformer-based encoder and VITS architecture☆30Updated 2 weeks ago
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆67Updated last week
- ☆57Updated 10 months ago
- Video+code lecture on building nanoGPT from scratch☆68Updated last year
- Course Project for COMP4471 on RWKV☆17Updated last year
- A ggml (C++) re-implementation of tortoise-tts☆187Updated 10 months ago
- On-device streaming text-to-speech engine powered by deep learning☆87Updated this week
- ☆95Updated 6 months ago
- kokoro text to speech using javascript☆58Updated 4 months ago
- Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port…☆21Updated 9 months ago
- Who needs o1 anyways. Add CoT to any OpenAI compatible endpoint.☆41Updated 9 months ago
- A random walk voice style cloning application for Kokoro text to speech☆99Updated last week
- run ollama & gguf easily with a single command☆51Updated last year