usefulsensors / useful-transformersLinks
Efficient Inference of Transformer models
☆433Updated 9 months ago
Alternatives and similar repositories for useful-transformers
Users that are interested in useful-transformers are comparing it to the libraries listed below
Sorting:
- My develoopment fork of llama.cpp. For now working on RK3588 NPU and Tenstorrent backend☆94Updated last week
- Streaming TTS based on Piper with optional RK3588 NPU support☆89Updated last month
- Reverse engineering the rk3588 npu☆83Updated last year
- Robust Speech Recognition via Large-Scale Weak Supervision☆81Updated last year
- Run Large Language Models on RK3588 with GPU-acceleration☆104Updated last year
- CLIP inference in plain C/C++ with no extra dependencies☆498Updated 9 months ago
- Inference Vision Transformer (ViT) in plain C/C++ with ggml☆285Updated last year
- ☆803Updated 2 weeks ago
- ggml implementation of BERT☆492Updated last year
- Suno AI's Bark model in C/C++ for fast text-to-speech generation☆824Updated 6 months ago
- Easier usage of LLMs in Rockchip's NPU on SBCs like Orange Pi 5 and Radxa Rock 5 series☆132Updated last week
- Experiments to test different speech recognition systems for SEPIA Framework☆60Updated 2 years ago
- Easy installation and usage of Rockchip's NPUs found in RK3588 and similar SoCs☆149Updated this week
- A ggml (C++) re-implementation of tortoise-tts☆184Updated 9 months ago
- OpenAI Whisper for edge devices☆125Updated 2 years ago
- Pybind11 bindings for Whisper.cpp☆330Updated 5 months ago
- ☆738Updated last year
- ONNX implementation of Whisper. PyTorch free.☆97Updated 6 months ago
- An innovative library for efficient LLM inference via low-bit quantization☆348Updated 9 months ago
- Python bindings for ggml☆140Updated 9 months ago
- LiteRT is the new name for TensorFlow Lite (TFLite). While the name is new, it's still the same trusted, high-performance runtime for on-…☆469Updated this week
- Optimized OpenAI's Whisper TFLite Port for Efficient Offline Inference on Edge Devices☆235Updated 9 months ago
- LLaMa/RWKV onnx models, quantization and testcase☆363Updated last year
- ☆133Updated 11 months ago
- Run generative AI models in sophgo BM1684X/BM1688☆216Updated this week
- Improving transcription performance of OpenAI Whisper for CPU based deployment☆244Updated 2 years ago
- Port of Meta's Encodec in C/C++☆220Updated 6 months ago
- Speech-to-text server framework with next-gen Kaldi☆697Updated last week
- whisper.cpp bindings for python☆96Updated last year
- openvino version of openai/whisper☆166Updated last year