bbookman / Google-Speech-to-Text-API-Word-Error-Rate-Analysis-Tool
Takes audio and reference transcriptions in bulk and generates WER
☆13Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for Google-Speech-to-Text-API-Word-Error-Rate-Analysis-Tool
- ☆34Updated 10 months ago
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the…☆46Updated last year
- Speaker change detection using SincNet and an LSTM/Transformer☆44Updated 4 months ago
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆71Updated last year
- Sorce code of Apkinson: android app to monitor the motor symptoms of Parkinson's patients☆17Updated 4 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆99Updated last year
- A pipeline to isolate and transcribe one language in mixed-language speech☆18Updated 2 years ago
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" models☆65Updated 2 years ago
- 🫠 check your data, before you wreck your model☆16Updated 2 years ago
- ☆25Updated 2 years ago
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆47Updated last year
- Tunable pipelines☆30Updated last month
- ☆50Updated last year
- Clustering-based methods for overlapping diarization☆70Updated 10 months ago
- Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO☆58Updated 2 years ago
- ☆11Updated 9 years ago
- Official PyTorch implementation of "Attention-Free Keyword Spotting", Mashrur. M. Morshed & Ahmad Omar Ahsan, PML4DC @ ICLR 2022.☆15Updated 2 years ago
- Text frontend for ESPnet tts recipes☆31Updated 3 years ago
- Stable timestamps and confidence score for words of OpenAI's Whisper outputs down to word-level.☆25Updated last year
- Repository for Accent Recognition (Hackathon @SLT2022)☆23Updated 6 months ago
- Fine-tuning Wav2Vec2.0 on Common Voice(zh-HK)☆14Updated 2 years ago
- Rescoring methods for end-to-end Automatic Speech Recognition☆27Updated 4 years ago
- MeetEval - A meeting transcription evaluation toolkit☆78Updated 3 weeks ago
- ☆28Updated 2 years ago
- OpenAI Whisper Prompt Examples☆48Updated last year
- Speaker diarization service☆19Updated this week
- 56 language, 1 model Multilingual ASR☆24Updated 3 years ago
- Toolbox for easy and qualitative one-shot voice conversion☆45Updated 2 years ago
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…☆47Updated last year