moonshine-ai / useful-transformersLinks
Efficient Inference of Transformer models
☆468Updated last year
Alternatives and similar repositories for useful-transformers
Users that are interested in useful-transformers are comparing it to the libraries listed below
Sorting:
- My develoopment fork of llama.cpp. For now working on RK3588 NPU and Tenstorrent backend☆108Updated this week
- Streaming TTS based on Piper with optional RK3588 NPU support☆112Updated 6 months ago
- Run Large Language Models on RK3588 with GPU-acceleration☆117Updated 2 years ago
- Reverse engineering the rk3588 npu☆99Updated last year
- Suno AI's Bark model in C/C++ for fast text-to-speech generation☆847Updated last year
- Easier usage of LLMs in Rockchip's NPU on SBCs like Orange Pi 5 and Radxa Rock 5 series☆160Updated 3 months ago
- top-like script for rockhip NPUs on linux☆63Updated 2 weeks ago
- Easy installation and usage of Rockchip's NPUs found in RK3588 and similar SoCs☆206Updated 3 months ago
- A ggml (C++) re-implementation of tortoise-tts☆192Updated last year
- CLIP inference in plain C/C++ with no extra dependencies☆533Updated 5 months ago
- ggml implementation of BERT☆496Updated last year
- Robust Speech Recognition via Large-Scale Weak Supervision☆87Updated 2 years ago
- An innovative library for efficient LLM inference via low-bit quantization☆349Updated last year
- Port of Meta's Encodec in C/C++☆225Updated 11 months ago
- Pybind11 bindings for Whisper.cpp☆340Updated 11 months ago
- Inference Vision Transformer (ViT) in plain C/C++ with ggml☆298Updated last year
- OpenAI Whisper for edge devices☆132Updated 2 years ago
- Using FastChat-T5 Large Language Model, Vosk API for automatic speech recognition, and Piper for text-to-speech☆125Updated 2 years ago
- Pybind11 bindings for Whisper.cpp☆62Updated 2 weeks ago
- Open source repo for AI in a Box.☆68Updated last year
- Improving transcription performance of OpenAI Whisper for CPU based deployment☆256Updated 3 years ago
- ☆1,072Updated last month
- Python bindings for ggml☆146Updated last year
- Minimal extension of OpenAI's Whisper adding speaker diarization with special tokens☆527Updated 2 years ago
- whisper.cpp bindings for python☆107Updated 2 years ago
- Falcon LLM ggml framework with CPU and GPU support☆247Updated last year
- LLaVA server (llama.cpp).☆183Updated 2 years ago
- SoTA Transformers with C-backend for fast inference on your CPU.☆308Updated last year
- Pure C++ implementation of several models for real-time chatting on your computer (CPU & GPU)☆746Updated this week
- TTS support with GGML☆193Updated last month