FR33TR1ST / VoiceAssistant
A VoiceAsistant with WhisperAI speech recognition
☆30Updated 3 months ago
Alternatives and similar repositories for VoiceAssistant:
Users that are interested in VoiceAssistant are comparing it to the libraries listed below
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆92Updated 9 months ago
- Efficient approach to speaker diarization using voice characteristics extraction☆90Updated 10 months ago
- A curated list of awesome OpenAI's Whisper☆99Updated last year
- ☆35Updated 2 years ago
- Whisper combined with Silero VAD, for improved long-form transcriptions☆47Updated 2 years ago
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆205Updated 4 months ago
- Speaker diarization model☆24Updated last year
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆60Updated last year
- Speaker diarization service☆21Updated last week
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆191Updated 2 weeks ago
- faster-whisper livestream translation, OBS noise reduction, dual language subtitles☆77Updated last year
- On-device voice activity detection (VAD) powered by deep learning☆200Updated last week
- Real-Time Whisper Voice Recognition with vosk model feedback.☆110Updated last year
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" models☆66Updated 2 years ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆135Updated last year
- ☆153Updated last year
- Joint speech-language model - respond directly to audio!☆30Updated 9 months ago
- A streaming whisper server for on-prem transcription☆20Updated 6 months ago
- On-device speaker diarization powered by deep learning☆37Updated 2 weeks ago
- Real time speech to text transcription app.☆399Updated 2 years ago
- Open models for Coqui STT☆129Updated last year
- Live transcription with OpenAi Whisper☆50Updated 2 years ago
- A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.☆347Updated last year
- Transcription and diarization (speaker identification)☆31Updated last year
- On-device speaker recognition engine powered by deep learning☆32Updated last week
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization☆25Updated 2 years ago
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆15Updated last year
- streaming speech to text server using Whisper☆87Updated last year
- FastAPI service on top of WhisperX☆68Updated this week
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆145Updated 10 months ago