ngxson / ggml-easyLinks
Thin wrapper around GGML to make life easier
☆34Updated this week
Alternatives and similar repositories for ggml-easy
Users that are interested in ggml-easy are comparing it to the libraries listed below
Sorting:
- GGML implementation of BERT model with Python bindings and quantization.☆55Updated last year
- Implementation of 'Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis', in MLX☆19Updated 7 months ago
- Profile your CoreML models directly from Python 🐍☆27Updated 7 months ago
- ☆22Updated last year
- Simple high-throughput inference library☆115Updated 3 weeks ago
- Port of Facebook's LLaMA model in C/C++☆21Updated last year
- Course Project for COMP4471 on RWKV☆17Updated last year
- Experiments with BitNet inference on CPU☆55Updated last year
- ☆19Updated 2 months ago
- Rust crate for some audio utilities☆23Updated 2 months ago
- Yet Another (LLM) Web UI, made with Gemini☆12Updated 5 months ago
- implementation of https://arxiv.org/pdf/2312.09299☆20Updated 11 months ago
- Proof of concept for running moshi/hibiki using webrtc☆19Updated 3 months ago
- ☆28Updated 9 months ago
- AirLLM 70B inference with single 4GB GPU☆13Updated 9 months ago
- ☆15Updated 4 months ago
- cortex.llamacpp is a high-efficiency C++ inference engine for edge computing. It is a dynamic library that can be loaded by any server a…☆40Updated this week
- Port of Suno AI's Bark in C/C++ for fast inference☆52Updated last year
- Fast approximate inference on a single GPU with sparsity aware offloading☆38Updated last year
- Trying to deconstruct RWKV in understandable terms☆14Updated 2 years ago
- Lightweight package that tracks and summarizes code changes using LLMs (Large Language Models)☆34Updated 3 months ago
- Testing LLM reasoning abilities with family relationship quizzes.☆61Updated 4 months ago
- ANE accelerated embedding models!☆17Updated 5 months ago
- Find out why your CoreML model isn't running on the Neural Engine!☆25Updated 11 months ago
- Open-source and reproducible benchmarks for Speaker Diarization☆26Updated last month
- A minimalistic C++ Jinja templating engine for LLM chat templates☆153Updated 3 weeks ago
- ☆9Updated last year
- ☆18Updated 2 months ago
- Video+code lecture on building nanoGPT from scratch☆67Updated 11 months ago
- Simple LLM inference server☆20Updated 11 months ago