FR33TR1ST / VoiceAssistant
A VoiceAsistant with WhisperAI speech recognition
☆29Updated 2 months ago
Alternatives and similar repositories for VoiceAssistant:
Users that are interested in VoiceAssistant are comparing it to the libraries listed below
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆91Updated 8 months ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆108Updated last year
- Live transcription with OpenAi Whisper☆50Updated 2 years ago
- Shared Voice Interface☆41Updated last year
- faster-whisper livestream translation, OBS noise reduction, dual language subtitles☆77Updated last year
- Transcription and diarization (speaker identification)☆30Updated last year
- Create an LJSpeech structured voice dataset on wave input☆24Updated 4 months ago
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization☆25Updated 2 years ago
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆203Updated 3 months ago
- ☆35Updated 2 years ago
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆59Updated last year
- A curated list of awesome OpenAI's Whisper☆97Updated last year
- Whisper combined with Silero VAD, for improved long-form transcriptions☆45Updated 2 years ago
- Synchronize Whisper's timestamps over an existing accurate transcription☆138Updated 8 months ago
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts☆298Updated 2 months ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆145Updated 8 months ago
- A quick experiment to achieve almost realtime transcription using Whisper.☆187Updated 2 years ago
- streaming speech to text server using Whisper☆85Updated last year
- Zero-shot Audio Classification using Whisper☆77Updated 2 years ago
- On-device speaker diarization powered by deep learning☆34Updated 2 weeks ago
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆37Updated last month
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆135Updated last year
- A live speech recognition using Facebooks wav2vec 2.0 model.☆339Updated 11 months ago
- On-device voice activity detection (VAD) powered by deep learning☆192Updated 2 weeks ago
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆150Updated 6 months ago
- Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, re…☆47Updated last year
- 🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.☆231Updated 7 months ago
- Your one-stop solution for voice dataset creation☆117Updated last year
- Code for OpenAI Whisper Web App Demo☆94Updated 2 years ago
- Experiments to test different speech recognition systems for SEPIA Framework☆58Updated last year