bbookman / Google-Speech-to-Text-API-Word-Error-Rate-Analysis-Tool
Takes audio and reference transcriptions in bulk and generates WER
☆13Updated 3 years ago
Alternatives and similar repositories for Google-Speech-to-Text-API-Word-Error-Rate-Analysis-Tool:
Users that are interested in Google-Speech-to-Text-API-Word-Error-Rate-Analysis-Tool are comparing it to the libraries listed below
- Speaker change detection using SincNet and an LSTM/Transformer☆48Updated 9 months ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated 2 years ago
- C++ version of pyannote audio overlapped speech detection pipeline☆12Updated last year
- ☆25Updated 2 years ago
- Goodness of Pronunciation algorithm using PyKaldi☆15Updated 2 years ago
- ☆39Updated last year
- ☆11Updated 3 years ago
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆17Updated 3 weeks ago
- This is the official implementation of " Enhancing Embeddings for Speech Classification in Noisy Conditions"☆10Updated last year
- ☆33Updated 3 years ago
- Official implementation of A cappella: Audio-visual Singing VoiceSeparation, from BMVC21☆16Updated 2 years ago
- Python bindings of speexdsp noise suppression library☆38Updated 2 years ago
- Constrained Permutation Invariant Training, Speech Separation☆47Updated 4 years ago
- Real-time Speech Separation, Noise Suppression & Speaker Recognition☆18Updated 5 years ago
- A free & open tool for transcribing audio interviews with offline ASR support☆24Updated last year
- ☆18Updated 2 years ago
- Sorce code of Apkinson: android app to monitor the motor symptoms of Parkinson's patients☆17Updated 4 years ago
- ☆17Updated 3 years ago
- ☆11Updated 3 years ago
- This thesis applies an autoencoder deep neural network to the multichannel speech enhancement problem. It takes the problem from dataset …☆10Updated 2 years ago
- Transformer-based online speech recognition system with TensorFlow 2☆26Updated 4 years ago
- Pytorch Models for Speech Enhancement☆19Updated 2 years ago
- A simple command line tool to calculate WER for ASR.☆14Updated 5 months ago
- Wenet speech to text for react native☆10Updated 2 years ago
- Source code of paper <End-to-End Language Diarization for Bilingual Code-switching Speech>☆18Updated 3 years ago
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.☆20Updated 3 weeks ago
- ☆14Updated 2 years ago
- Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment☆12Updated last month
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Updated last year
- 🫠 check your data, before you wreck your model☆16Updated 2 years ago