moonshine-ai / useful-transformersLinks
Efficient Inference of Transformer models
☆436Updated 10 months ago
Alternatives and similar repositories for useful-transformers
Users that are interested in useful-transformers are comparing it to the libraries listed below
Sorting:
- Streaming TTS based on Piper with optional RK3588 NPU support☆93Updated 2 months ago
- Suno AI's Bark model in C/C++ for fast text-to-speech generation☆826Updated 7 months ago
- Run Large Language Models on RK3588 with GPU-acceleration☆105Updated last year
- My develoopment fork of llama.cpp. For now working on RK3588 NPU and Tenstorrent backend☆95Updated 3 weeks ago
- Reverse engineering the rk3588 npu☆84Updated last year
- Robust Speech Recognition via Large-Scale Weak Supervision☆81Updated last year
- Speech-to-text server framework with next-gen Kaldi☆709Updated this week
- ☆827Updated last month
- Optimized OpenAI's Whisper TFLite Port for Efficient Offline Inference on Edge Devices☆236Updated 10 months ago
- The Qualcomm® AI Hub Models are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.)…☆719Updated last week
- OpenAI Whisper for edge devices☆126Updated 2 years ago
- ggml implementation of BERT☆493Updated last year
- ☆741Updated last year
- A ggml (C++) re-implementation of tortoise-tts☆186Updated 10 months ago
- Experiments to test different speech recognition systems for SEPIA Framework☆60Updated 2 years ago
- ONNX implementation of Whisper. PyTorch free.☆99Updated 7 months ago
- CLIP inference in plain C/C++ with no extra dependencies☆502Updated last week
- LLaMa/RWKV onnx models, quantization and testcase☆363Updated last year
- Port of Meta's Encodec in C/C++☆223Updated 6 months ago
- INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model☆1,529Updated 2 months ago
- Inference Vision Transformer (ViT) in plain C/C++ with ggml☆288Updated last year
- Using FastChat-T5 Large Language Model, Vosk API for automatic speech recognition, and Piper for text-to-speech☆120Updated 2 years ago
- whisper.cpp bindings for python☆98Updated last year
- ☆995Updated last year
- Pybind11 bindings for Whisper.cpp☆332Updated 6 months ago
- Python bindings for whisper.cpp☆266Updated last week
- An innovative library for efficient LLM inference via low-bit quantization☆349Updated 9 months ago
- LiteRT is the new name for TensorFlow Lite (TFLite). While the name is new, it's still the same trusted, high-performance runtime for on-…☆585Updated this week
- Minimal extension of OpenAI's Whisper adding speaker diarization with special tokens☆497Updated last year
- Whisper with Medusa heads☆842Updated 3 weeks ago