usefulsensors / useful-transformersLinks

Efficient Inference of Transformer models

☆433

Alternatives and similar repositories for useful-transformers

Users that are interested in useful-transformers are comparing it to the libraries listed below

Sorting:

marty1885 / llama.cpp
My develoopment fork of llama.cpp. For now working on RK3588 NPU and Tenstorrent backend
☆94Updated last week
marty1885 / paroli
Streaming TTS based on Piper with optional RK3588 NPU support
☆89Updated last month
mtx512 / rk3588-npu
Reverse engineering the rk3588 npu
☆83Updated last year
usefulsensors / openai-whisper
Robust Speech Recognition via Large-Scale Weak Supervision
☆81Updated last year
Chrisz236 / llm-rk3588
Run Large Language Models on RK3588 with GPU-acceleration
☆104Updated last year
monatis / clip.cpp
CLIP inference in plain C/C++ with no extra dependencies
☆498Updated 9 months ago
staghado / vit.cpp
Inference Vision Transformer (ViT) in plain C/C++ with ggml
☆285Updated last year
airockchip / rknn-llm
☆803Updated 2 weeks ago
skeskinen / bert.cpp
ggml implementation of BERT
☆492Updated last year
PABannier / bark.cpp
Suno AI's Bark model in C/C++ for fast text-to-speech generation
☆824Updated 6 months ago
Pelochus / ezrknn-llm
Easier usage of LLMs in Rockchip's NPU on SBCs like Orange Pi 5 and Radxa Rock 5 series
☆132Updated last week
fquirin / speech-recognition-experiments
Experiments to test different speech recognition systems for SEPIA Framework
☆60Updated 2 years ago
Pelochus / ezrknpu
Easy installation and usage of Rockchip's NPUs found in RK3588 and similar SoCs
☆149Updated this week
balisujohn / tortoise.cpp
A ggml (C++) re-implementation of tortoise-tts
☆184Updated 9 months ago
maxbbraun / whisper-edge
OpenAI Whisper for edge devices
☆125Updated 2 years ago
aarnphm / whispercpp
Pybind11 bindings for Whisper.cpp
☆330Updated 5 months ago
rockchip-linux / rknpu2
☆738Updated last year
PINTO0309 / whisper-onnx-cpu
ONNX implementation of Whisper. PyTorch free.
☆97Updated 6 months ago
intel / neural-speed
An innovative library for efficient LLM inference via low-bit quantization
☆348Updated 9 months ago
abetlen / ggml-python
Python bindings for ggml
☆140Updated 9 months ago
google-ai-edge / LiteRT
LiteRT is the new name for TensorFlow Lite (TFLite). While the name is new, it's still the same trusted, high-performance runtime for on-…
☆469Updated this week
nyadla-sys / whisper.tflite
Optimized OpenAI's Whisper TFLite Port for Efficient Offline Inference on Edge Devices
☆235Updated 9 months ago
tpoisonooo / llama.onnx
LLaMa/RWKV onnx models, quantization and testcase
☆363Updated last year
futo-org / whisper-acft
☆133Updated 11 months ago
sophgo / LLM-TPU
Run generative AI models in sophgo BM1684X/BM1688
☆216Updated this week
MiscellaneousStuff / openai-whisper-cpu
Improving transcription performance of OpenAI Whisper for CPU based deployment
☆244Updated 2 years ago
PABannier / encodec.cpp
Port of Meta's Encodec in C/C++
☆220Updated 6 months ago
k2-fsa / sherpa
Speech-to-text server framework with next-gen Kaldi
☆697Updated last week
carloscdias / whisper-cpp-python
whisper.cpp bindings for python
☆96Updated last year
zhuzilin / whisper-openvino
openvino version of openai/whisper
☆166Updated last year