FR33TR1ST / VoiceAssistantLinks
A VoiceAsistant with WhisperAI speech recognition
☆32Updated 8 months ago
Alternatives and similar repositories for VoiceAssistant
Users that are interested in VoiceAssistant are comparing it to the libraries listed below
Sorting:
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆96Updated last year
- Real time speech to text transcription app.☆420Updated 2 years ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆118Updated 2 years ago
- streaming speech to text server using Whisper☆94Updated 2 years ago
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆64Updated last year
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts☆335Updated 9 months ago
- Whisper combined with Silero VAD, for improved long-form transcriptions☆52Updated 2 years ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated last year
- A live speech recognition using Facebooks wav2vec 2.0 model.☆363Updated last year
- Live transcription with OpenAi Whisper☆50Updated 2 years ago
- Shared Voice Interface☆43Updated last year
- A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.☆355Updated 3 weeks ago
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆215Updated 9 months ago
- Speaker diarization model☆28Updated 2 years ago
- ☆38Updated 2 years ago
- Open models for Coqui STT☆141Updated 2 years ago
- Batch Support for OpenAI Whisper☆94Updated last year
- faster-whisper livestream translation, OBS noise reduction, dual language subtitles☆79Updated 2 years ago
- Listen to any audio stream on your machine and print out the transcribed or translated audio.☆119Updated last year
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆149Updated last year
- Transcription and diarization (speaker identification)☆33Updated 2 years ago
- The subtitles and translations are generated in real-time and displayed as pop-ups.☆171Updated 2 years ago
- A quick experiment to achieve almost realtime transcription using Whisper.☆188Updated 2 years ago
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)☆90Updated last year
- On-device speaker diarization powered by deep learning☆52Updated this week
- 🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.☆250Updated last year
- Streaming transcriber with whisper☆690Updated 2 years ago
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆224Updated this week
- On-device voice activity detection (VAD) powered by deep learning☆223Updated this week
- 🐸STT integration examples☆130Updated 2 years ago