bbookman / Google-Speech-to-Text-API-Word-Error-Rate-Analysis-Tool

Takes audio and reference transcriptions in bulk and generates WER

☆13

Alternatives and similar repositories for Google-Speech-to-Text-API-Word-Error-Rate-Analysis-Tool:

Users that are interested in Google-Speech-to-Text-API-Word-Error-Rate-Analysis-Tool are comparing it to the libraries listed below

HHousen / speaker-change-detection
Speaker change detection using SincNet and an LSTM/Transformer
☆48Updated 9 months ago
Open-Speech-EkStep / crowdsource-dataplatform
This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…
☆17Updated 2 years ago
leohuang2013 / pyannote-audio_overlapped-speech-detection_cpp
C++ version of pyannote audio overlapped speech detection pipeline
☆12Updated last year
MarceloSancinetti / epa-gop-pykaldi
☆25Updated 2 years ago
JazminVidal / gop-pykaldi
Goodness of Pronunciation algorithm using PyKaldi
☆15Updated 2 years ago
pyannote / AMI-diarization-setup
☆39Updated last year
yuhangear / wenet-android
☆11Updated 3 years ago
NTIA / alignnet
Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.
☆17Updated 3 weeks ago
mnabihali / Joint-training-embeddings
This is the official implementation of " Enhancing Embeddings for Speech Classification in Noisy Conditions"
☆10Updated last year
daanzu / wenet_stt_python
☆33Updated 3 years ago
JuanFMontesinos / Acappella-YNet
Official implementation of A cappella: Audio-visual Singing VoiceSeparation, from BMVC21
☆16Updated 2 years ago
TeaPoly / speexdsp-ns-python
Python bindings of speexdsp noise suppression library
☆38Updated 2 years ago
xuchenglin28 / speech_separation
Constrained Permutation Invariant Training, Speech Separation
☆47Updated 4 years ago
jadfegh / audiovision
Real-time Speech Separation, Noise Suppression & Speaker Recognition
☆18Updated 5 years ago
projecte-aina / oTranscribe-plus
A free & open tool for transcribing audio interviews with offline ASR support
☆24Updated last year
BirgerMoell / tmh
☆18Updated 2 years ago
SAGI-FAU / SMA2
Sorce code of Apkinson: android app to monitor the motor symptoms of Parkinson's patients
☆17Updated 4 years ago
Open-Speech-EkStep / data-acquisition-pipeline
☆17Updated 3 years ago
patrickvonplaten / Wav2Vec2_ParlanceCTCDecode
☆11Updated 3 years ago
furkanarius / Multichannel-Speech-Enhancement-with-Deep-Neural-Networks
This thesis applies an autoencoder deep neural network to the multichannel speech enhancement problem. It takes the problem from dataset …
☆10Updated 2 years ago
georgesterpu / Taris
Transformer-based online speech recognition system with TensorFlow 2
☆26Updated 4 years ago
ooshyun / Speech-Enhancement-Pytorch
Pytorch Models for Speech Enhancement
☆19Updated 2 years ago
pkufool / simple-wer
A simple command line tool to calculate WER for ASR.
☆14Updated 5 months ago
Hannes1 / react-native-wenet
Wenet speech to text for react native
☆10Updated 2 years ago
Lhx94As / E2E-language-diarization
Source code of paper <End-to-End Language Diarization for Bilingual Code-switching Speech>
☆18Updated 3 years ago
kgnlp / allophant
A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.
☆20Updated 3 weeks ago
yucongzh / online_speaker_diarization
☆14Updated 2 years ago
fgnt / speaker_reassignment
Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment
☆12Updated last month
audiodemo / voice-conversion
Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks
☆17Updated last year
coqui-ai / data-checker
🫠 check your data, before you wreck your model
☆16Updated 2 years ago