usefulsensors / useful-transformers
Efficient Inference of Transformer models
☆421Updated 6 months ago
Alternatives and similar repositories for useful-transformers:
Users that are interested in useful-transformers are comparing it to the libraries listed below
- Run Large Language Models on RK3588 with GPU-acceleration☆93Updated last year
- ☆602Updated last week
- My develoopment fork of llama.cpp. For now working on RK3588 NPU and Tenstorrent backend☆84Updated last week
- Robust Speech Recognition via Large-Scale Weak Supervision☆74Updated last year
- Easy usage of Rockchip's NPUs found in RK3588 and similar chips☆124Updated 3 months ago
- Streaming TTS based on Piper with optional RK3588 NPU support☆63Updated 2 months ago
- OpenAI Whisper for edge devices☆123Updated last year
- ☆693Updated last year
- Optimized OpenAI's Whisper TFLite Port for Efficient Offline Inference on Edge Devices☆211Updated 5 months ago
- ggml implementation of BERT☆480Updated 11 months ago
- A ggml (C++) re-implementation of tortoise-tts☆175Updated 6 months ago
- Experiments to test different speech recognition systems for SEPIA Framework☆58Updated last year
- Suno AI's Bark model in C/C++ for fast text-to-speech generation☆781Updated 3 months ago
- Inference Vision Transformer (ViT) in plain C/C++ with ggml☆255Updated 10 months ago
- CLIP inference in plain C/C++ with no extra dependencies☆480Updated 6 months ago
- Automated script to convert Huggingface and GGUF models to rkllm format for running on Rockchip NPU☆22Updated 3 months ago
- Port of Meta's Encodec in C/C++☆215Updated 2 months ago
- Using FastChat-T5 Large Language Model, Vosk API for automatic speech recognition, and Piper for text-to-speech☆117Updated last year
- LLaMa/RWKV onnx models, quantization and testcase☆356Updated last year
- top-like script for rockhip NPUs on linux☆32Updated 3 months ago
- Improving transcription performance of OpenAI Whisper for CPU based deployment☆239Updated 2 years ago
- Optimized local inference for LLMs with HuggingFace-like APIs for quantization, vision/language models, multimodal agents, speech, vector…☆239Updated 4 months ago
- Open source repo for AI in a Box.☆64Updated 10 months ago
- Python bindings for ggml☆137Updated 5 months ago
- LiteRT is the new name for TensorFlow Lite (TFLite). While the name is new, it's still the same trusted, high-performance runtime for on-…☆276Updated this week
- ☆1,311Updated 3 months ago
- On-device LLM Inference Powered by X-Bit Quantization☆212Updated this week
- Open Neural Network Exchange to C compiler.☆256Updated last month
- Minimal extension of OpenAI's Whisper adding speaker diarization with special tokens☆476Updated last year
- ☆942Updated 10 months ago