usefulsensors / useful-transformers
Efficient Inference of Transformer models
☆427Updated 7 months ago
Alternatives and similar repositories for useful-transformers:
Users that are interested in useful-transformers are comparing it to the libraries listed below
- Run Large Language Models on RK3588 with GPU-acceleration☆96Updated last year
- My develoopment fork of llama.cpp. For now working on RK3588 NPU and Tenstorrent backend☆88Updated this week
- Streaming TTS based on Piper with optional RK3588 NPU support☆73Updated 3 months ago
- Reverse engineering the rk3588 npu☆75Updated 9 months ago
- Suno AI's Bark model in C/C++ for fast text-to-speech generation☆787Updated 4 months ago
- ☆686Updated last month
- Robust Speech Recognition via Large-Scale Weak Supervision☆76Updated last year
- Easy usage of Rockchip's NPUs found in RK3588 and similar chips☆136Updated 2 weeks ago
- Easier usage of LLMs in Rockchip's NPU on SBCs like Orange Pi 5 and Radxa Rock 5 series☆123Updated last week
- A ggml (C++) re-implementation of tortoise-tts☆178Updated 7 months ago
- ☆710Updated last year
- Inference Vision Transformer (ViT) in plain C/C++ with ggml☆263Updated 11 months ago
- LLaMa/RWKV onnx models, quantization and testcase☆359Updated last year
- CLIP inference in plain C/C++ with no extra dependencies☆486Updated 7 months ago
- Port of Meta's Encodec in C/C++☆216Updated 3 months ago
- Using FastChat-T5 Large Language Model, Vosk API for automatic speech recognition, and Piper for text-to-speech☆117Updated last year
- TinyChatEngine: On-Device LLM Inference Library☆826Updated 8 months ago
- Optimized OpenAI's Whisper TFLite Port for Efficient Offline Inference on Edge Devices☆219Updated 7 months ago
- ☆1,025Updated last year
- OpenAI Whisper for edge devices☆124Updated 2 years ago
- An innovative library for efficient LLM inference via low-bit quantization☆351Updated 6 months ago
- ggml implementation of BERT☆486Updated last year
- INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model☆1,491Updated this week
- Pybind11 bindings for Whisper.cpp☆328Updated 3 months ago
- Open source repo for AI in a Box.☆63Updated 11 months ago
- Extend the original llama.cpp repo to support redpajama model.☆117Updated 6 months ago
- Offline Speech Recognition with OpenAI Whisper and TensorFlow Lite for Android☆388Updated last month
- Pure C++ implementation of several models for real-time chatting on your computer (CPU & GPU)☆554Updated this week
- openvino version of openai/whisper☆166Updated last year
- Minimal extension of OpenAI's Whisper adding speaker diarization with special tokens☆480Updated last year