maxbbraun / whisper-edge
OpenAI Whisper for edge devices
☆125Updated 2 years ago
Alternatives and similar repositories for whisper-edge
Users that are interested in whisper-edge are comparing it to the libraries listed below
Sorting:
- Using FastChat-T5 Large Language Model, Vosk API for automatic speech recognition, and Piper for text-to-speech☆118Updated last year
- openvino version of openai/whisper☆165Updated last year
- ONNX implementation of Whisper. PyTorch free.☆96Updated 5 months ago
- Improving transcription performance of OpenAI Whisper for CPU based deployment☆244Updated 2 years ago
- NVIDIA Riva runnable tutorials☆130Updated last month
- Experiments to test different speech recognition systems for SEPIA Framework☆60Updated last year
- streaming speech to text server using Whisper☆92Updated last year
- Pybind11 bindings for Whisper.cpp☆331Updated 5 months ago
- whisper.cpp bindings for python☆95Updated last year
- Pybind11 bindings for Whisper.cpp☆57Updated 2 weeks ago
- A quick experiment to achieve almost realtime transcription using Whisper.☆187Updated 2 years ago
- ASR/NLP/TTS deep learning inference library for NVIDIA Jetson using PyTorch and TensorRT☆206Updated last year
- Python bindings for whisper.cpp☆247Updated last week
- A project that optimizes Whisper for low latency inference using NVIDIA TensorRT☆81Updated 7 months ago
- Efficient Inference of Transformer models☆432Updated 9 months ago
- Robust Speech Recognition via Large-Scale Weak Supervision☆81Updated last year
- Real-Time Whisper Voice Recognition with vosk model feedback.☆112Updated last year
- A VoiceAsistant with WhisperAI speech recognition☆30Updated 5 months ago
- Audio Keyphrase Detector☆148Updated 3 years ago
- Joint speech-language model - respond directly to audio!☆369Updated 10 months ago
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆210Updated 6 months ago
- Zero-shot Audio Classification using Whisper☆80Updated 2 years ago
- Minimal extension of OpenAI's Whisper adding speaker diarization with special tokens☆496Updated last year
- Streaming TTS based on Piper with optional RK3588 NPU support☆86Updated 3 weeks ago
- OneShot Learning-based hotword detection.☆259Updated 8 months ago
- Streaming transcriber with whisper☆686Updated 2 years ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆94Updated last year
- On-device voice activity detection (VAD) powered by deep learning☆214Updated last week
- A ggml (C++) re-implementation of tortoise-tts☆182Updated 8 months ago
- Listen to any audio stream on your machine and print out the transcribed or translated audio.☆119Updated last year