moonshine-ai / useful-transformersLinks
Efficient Inference of Transformer models
☆478Updated last year
Alternatives and similar repositories for useful-transformers
Users that are interested in useful-transformers are comparing it to the libraries listed below
Sorting:
- My develoopment fork of llama.cpp. For now working on RK3588 NPU and Tenstorrent backend☆114Updated 2 months ago
- Streaming TTS based on Piper with optional RK3588 NPU support☆118Updated 8 months ago
- Reverse engineering the rk3588 npu☆108Updated last year
- Run Large Language Models on RK3588 with GPU-acceleration☆121Updated 2 years ago
- OpenAI Whisper for edge devices☆133Updated 2 years ago
- A ggml (C++) re-implementation of tortoise-tts☆193Updated last year
- Easier usage of LLMs in Rockchip's NPU on SBCs like Orange Pi 5 and Radxa Rock 5 series☆167Updated 5 months ago
- ggml implementation of BERT☆499Updated last year
- Suno AI's Bark model in C/C++ for fast text-to-speech generation☆850Updated last year
- Robust Speech Recognition via Large-Scale Weak Supervision☆90Updated 2 years ago
- top-like script for rockhip NPUs on linux☆63Updated 2 months ago
- Port of Meta's Encodec in C/C++☆227Updated last year
- Using FastChat-T5 Large Language Model, Vosk API for automatic speech recognition, and Piper for text-to-speech☆128Updated 2 years ago
- Inference Vision Transformer (ViT) in plain C/C++ with ggml☆305Updated last year
- CLIP inference in plain C/C++ with no extra dependencies☆546Updated 7 months ago
- Experiments to test different speech recognition systems for SEPIA Framework☆62Updated 2 years ago
- ASR/NLP/TTS deep learning inference library for NVIDIA Jetson using PyTorch and TensorRT☆220Updated last year
- Improving transcription performance of OpenAI Whisper for CPU based deployment☆257Updated 3 years ago
- Pybind11 bindings for Whisper.cpp☆343Updated last year
- Open source repo for AI in a Box.☆71Updated last year
- LLaVA server (llama.cpp).☆183Updated 2 years ago
- A "large" language model running on a microcontroller☆546Updated 2 years ago
- Pybind11 bindings for Whisper.cpp☆62Updated 2 weeks ago
- Python bindings for ggml☆146Updated last year
- Falcon LLM ggml framework with CPU and GPU support☆249Updated last year
- Optimized OpenAI's Whisper TFLite Port for Efficient Offline Inference on Edge Devices☆271Updated last year
- Optimized local inference for LLMs with HuggingFace-like APIs for quantization, vision/language models, multimodal agents, speech, vector…☆344Updated last year
- GGUF implementation in C as a library and a tools CLI program☆298Updated 4 months ago
- An innovative library for efficient LLM inference via low-bit quantization☆351Updated last year
- On-device LLM Inference Powered by X-Bit Quantization☆276Updated this week