FR33TR1ST / VoiceAssistant
A VoiceAsistant with WhisperAI speech recognition
☆29Updated 2 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for VoiceAssistant
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆84Updated 6 months ago
- Live transcription with OpenAi Whisper☆50Updated 2 years ago
- A quick experiment to achieve almost realtime transcription using Whisper.☆186Updated 2 years ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆105Updated last year
- Zero-shot Audio Classification using Whisper☆74Updated last year
- Efficient approach to speaker diarization using voice characteristics extraction☆68Updated 6 months ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆133Updated last year
- Whisper combined with Silero VAD, for improved long-form transcriptions☆44Updated last year
- streaming speech to text server using Whisper☆83Updated last year
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆196Updated 3 weeks ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆141Updated 6 months ago
- Shared Voice Interface☆40Updated last year
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts☆275Updated last week
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" models☆65Updated 2 years ago
- A curated list of awesome OpenAI's Whisper☆93Updated last year
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper☆37Updated 2 years ago
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆53Updated 10 months ago
- Transcription and Diarization based on OpenAI's Whisper☆19Updated last year
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆158Updated last month
- Create an LJSpeech structured voice dataset on wave input☆21Updated last month
- whisper.cpp bindings for python☆77Updated last year
- 💬 "Realtime" voice transcription and cloning using ElevenLabs's API.☆49Updated last year
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization☆25Updated 2 years ago
- faster-whisper livestream translation, OBS noise reduction, dual language subtitles☆75Updated last year
- Listen to any audio stream on your machine and print out the transcribed or translated audio.☆114Updated last year
- [WIP] VoiceSmith makes training text to speech models easy.☆222Updated 2 years ago
- ☆32Updated last year
- On-device speaker recognition engine powered by deep learning☆27Updated this week
- Speaker prediction for captions on the Lex Fridman podcast☆23Updated 9 months ago
- Transcription and diarization (speaker identification)☆28Updated last year