maxbbraun / whisper-edge
OpenAI Whisper for edge devices
☆124Updated 2 years ago
Alternatives and similar repositories for whisper-edge:
Users that are interested in whisper-edge are comparing it to the libraries listed below
- Using FastChat-T5 Large Language Model, Vosk API for automatic speech recognition, and Piper for text-to-speech☆117Updated last year
- ONNX implementation of Whisper. PyTorch free.☆92Updated 4 months ago
- streaming speech to text server using Whisper☆89Updated last year
- Experiments to test different speech recognition systems for SEPIA Framework☆59Updated last year
- whisper.cpp bindings for python☆93Updated last year
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆135Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆94Updated 10 months ago
- On-device streaming text-to-speech engine powered by deep learning☆73Updated last week
- ASR/NLP/TTS deep learning inference library for NVIDIA Jetson using PyTorch and TensorRT☆203Updated last year
- On-device voice activity detection (VAD) powered by deep learning☆203Updated this week
- A project that optimizes Whisper for low latency inference using NVIDIA TensorRT☆76Updated 5 months ago
- Zero-shot Audio Classification using Whisper☆80Updated 2 years ago
- OneShot Learning-based hotword detection.☆252Updated 6 months ago
- Minimal extension of OpenAI's Whisper adding speaker diarization with special tokens☆480Updated last year
- Efficient Inference of Transformer models☆427Updated 7 months ago
- Efficient approach to speaker diarization using voice characteristics extraction☆92Updated 11 months ago
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts☆316Updated 4 months ago
- On-device noise suppression powered by deep learning☆69Updated last week
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆205Updated 4 months ago
- An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine☆376Updated 7 months ago
- openvino version of openai/whisper☆166Updated last year
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆62Updated last year
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆199Updated last month
- A quick experiment to achieve almost realtime transcription using Whisper.☆187Updated 2 years ago
- ☆352Updated last year
- Pybind11 bindings for Whisper.cpp☆328Updated 3 months ago
- Improving transcription performance of OpenAI Whisper for CPU based deployment☆239Updated 2 years ago
- NVIDIA Riva runnable tutorials☆127Updated last week
- Wake word detection modeling toolkit for Firefox Voice, supporting open datasets like Speech Commands and Common Voice.☆204Updated 8 months ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆113Updated last year