ggerganov / bark.cpp
Port of Suno AI's Bark in C/C++ for fast inference
☆50Updated 5 months ago
Related projects: ⓘ
- LLaVA server (llama.cpp).☆173Updated 10 months ago
- Local ML voice chat using high-end models.☆138Updated last week
- A ggml (C++) re-implementation of tortoise-tts☆147Updated 3 weeks ago
- Inference of Large Multimodal Models in C/C++. LLaVA and others☆46Updated 11 months ago
- Port of Microsoft's BioGPT in C/C++ using ggml☆87Updated 6 months ago
- Port of Meta's Encodec in C/C++☆187Updated last month
- llama.cpp clone with additional SOTA quants and improved CPU performance☆57Updated this week
- Stable Diffusion in pure C/C++☆56Updated last year
- GGML implementation of BERT model with Python bindings and quantization.☆51Updated 7 months ago
- Testing LLM reasoning abilities with family relationship quizzes.☆40Updated 2 weeks ago
- GRDN.AI app for garden optimization☆68Updated 7 months ago
- Web browser version of StarCoder.cpp☆43Updated last year
- Inference Vision Transformer (ViT) in plain C/C++ with ggml☆31Updated 9 months ago
- WebAssembly binding for llama.cpp - Enabling in-browser LLM inference☆342Updated last week
- ☆79Updated 2 months ago
- Pybind11 bindings for Whisper.cpp☆38Updated 2 weeks ago
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.☆44Updated 10 months ago
- WebGPU LLM inference tuned by hand☆145Updated last year
- Python bindings for ggml☆125Updated 2 weeks ago
- Course Project for COMP4471 on RWKV☆16Updated 7 months ago
- 1.58 Bit LLM on Apple Silicon using MLX☆97Updated 4 months ago
- Inference of Mamba models in pure C☆176Updated 6 months ago
- General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). …☆35Updated last week
- Port of Facebook's LLaMA model in C/C++☆31Updated 6 months ago
- cortex.llamacpp is a high-efficiency C++ inference engine for edge computing. It is a dynamic library that can be loaded by any server a…☆15Updated this week
- iterate quickly with llama.cpp hot reloading. use the llama.cpp bindings with bun.sh☆45Updated 10 months ago
- TypeScript generator for llama.cpp Grammar directly from TypeScript interfaces☆126Updated 2 months ago
- Video+code lecture on building nanoGPT from scratch☆64Updated 3 months ago
- Extend the original llama.cpp repo to support redpajama model.☆117Updated 2 weeks ago
- ☆55Updated last month