moonshine-ai / useful-transformersLinks
Efficient Inference of Transformer models
☆449Updated last year
Alternatives and similar repositories for useful-transformers
Users that are interested in useful-transformers are comparing it to the libraries listed below
Sorting:
- My develoopment fork of llama.cpp. For now working on RK3588 NPU and Tenstorrent backend☆107Updated 3 weeks ago
- Run Large Language Models on RK3588 with GPU-acceleration☆114Updated 2 years ago
- Reverse engineering the rk3588 npu☆94Updated last year
- Streaming TTS based on Piper with optional RK3588 NPU support☆103Updated 4 months ago
- Easy installation and usage of Rockchip's NPUs found in RK3588 and similar SoCs☆175Updated 3 weeks ago
- top-like script for rockhip NPUs on linux☆50Updated last month
- Suno AI's Bark model in C/C++ for fast text-to-speech generation☆835Updated 9 months ago
- Easier usage of LLMs in Rockchip's NPU on SBCs like Orange Pi 5 and Radxa Rock 5 series☆149Updated 3 weeks ago
- Robust Speech Recognition via Large-Scale Weak Supervision☆84Updated 2 years ago
- Inference Vision Transformer (ViT) in plain C/C++ with ggml☆293Updated last year
- ggml implementation of BERT☆492Updated last year
- A ggml (C++) re-implementation of tortoise-tts☆187Updated last year
- CLIP inference in plain C/C++ with no extra dependencies☆517Updated 2 months ago
- ☆943Updated last month
- Port of Meta's Encodec in C/C++☆224Updated 8 months ago
- Open source repo for AI in a Box.☆66Updated last year
- Python bindings for ggml☆146Updated 11 months ago
- Pybind11 bindings for Whisper.cpp☆338Updated 8 months ago
- LLM-based code completion engine☆193Updated 7 months ago
- ASR/NLP/TTS deep learning inference library for NVIDIA Jetson using PyTorch and TensorRT☆217Updated last year
- OpenAI Whisper for edge devices☆129Updated 2 years ago
- openvino version of openai/whisper☆174Updated last year
- Improving transcription performance of OpenAI Whisper for CPU based deployment☆249Updated 2 years ago
- Using FastChat-T5 Large Language Model, Vosk API for automatic speech recognition, and Piper for text-to-speech☆124Updated 2 years ago
- Experiments to test different speech recognition systems for SEPIA Framework☆60Updated 2 years ago
- LLaVA server (llama.cpp).☆181Updated last year
- An innovative library for efficient LLM inference via low-bit quantization☆349Updated 11 months ago
- LiteRT continues the legacy of TensorFlow Lite as the trusted, high-performance runtime for on-device AI. Now with LiteRT Next, we're exp…☆737Updated this week
- Pybind11 bindings for Whisper.cpp☆61Updated 3 weeks ago
- LLaMa/RWKV onnx models, quantization and testcase☆365Updated 2 years ago