moonshine-ai / useful-transformersLinks
Efficient Inference of Transformer models
☆439Updated 11 months ago
Alternatives and similar repositories for useful-transformers
Users that are interested in useful-transformers are comparing it to the libraries listed below
Sorting:
- My develoopment fork of llama.cpp. For now working on RK3588 NPU and Tenstorrent backend☆97Updated 2 weeks ago
- Streaming TTS based on Piper with optional RK3588 NPU support☆95Updated 2 months ago
- Run Large Language Models on RK3588 with GPU-acceleration☆112Updated last year
- Easier usage of LLMs in Rockchip's NPU on SBCs like Orange Pi 5 and Radxa Rock 5 series☆138Updated last month
- Suno AI's Bark model in C/C++ for fast text-to-speech generation☆831Updated 8 months ago
- Easy installation and usage of Rockchip's NPUs found in RK3588 and similar SoCs☆164Updated last month
- A ggml (C++) re-implementation of tortoise-tts☆188Updated 10 months ago
- Robust Speech Recognition via Large-Scale Weak Supervision☆83Updated last year
- top-like script for rockhip NPUs on linux☆45Updated 2 months ago
- Using FastChat-T5 Large Language Model, Vosk API for automatic speech recognition, and Piper for text-to-speech☆120Updated 2 years ago
- CLIP inference in plain C/C++ with no extra dependencies☆508Updated 3 weeks ago
- OpenAI Whisper for edge devices☆127Updated 2 years ago
- Inference Vision Transformer (ViT) in plain C/C++ with ggml☆288Updated last year
- On-device LLM Inference Powered by X-Bit Quantization☆255Updated last month
- ggml implementation of BERT☆494Updated last year
- Pybind11 bindings for Whisper.cpp☆333Updated 7 months ago
- Optimized local inference for LLMs with HuggingFace-like APIs for quantization, vision/language models, multimodal agents, speech, vector…☆292Updated 8 months ago
- Port of Meta's Encodec in C/C++☆226Updated 7 months ago
- ASR/NLP/TTS deep learning inference library for NVIDIA Jetson using PyTorch and TensorRT☆210Updated last year
- An innovative library for efficient LLM inference via low-bit quantization☆349Updated 10 months ago
- Pybind11 bindings for Whisper.cpp☆58Updated 2 weeks ago
- Optimized OpenAI's Whisper TFLite Port for Efficient Offline Inference on Edge Devices☆239Updated 10 months ago
- Speech-to-text server framework with next-gen Kaldi☆735Updated this week
- whisper.cpp bindings for python☆98Updated last year
- Python bindings for ggml☆142Updated 10 months ago
- Python bindings for whisper.cpp☆275Updated 2 weeks ago
- The Qualcomm® AI Hub Models are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.)…☆733Updated 2 weeks ago
- Experiments to test different speech recognition systems for SEPIA Framework☆60Updated 2 years ago
- TinyChatEngine: On-Device LLM Inference Library☆871Updated last year
- LLaVA server (llama.cpp).☆180Updated last year