bbookman / Google-Speech-to-Text-API-Word-Error-Rate-Analysis-Tool
Takes audio and reference transcriptions in bulk and generates WER
☆13Updated 3 years ago
Related projects: ⓘ
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated last year
- Sorce code of Apkinson: android app to monitor the motor symptoms of Parkinson's patients☆17Updated 4 years ago
- ☆30Updated 7 months ago
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆64Updated 11 months ago
- Speaker change detection using SincNet and an LSTM/Transformer☆39Updated 2 months ago
- ☆16Updated 3 years ago
- 🫠 check your data, before you wreck your model☆16Updated 2 years ago
- Simple text to phonemes converter for multiple languages☆21Updated last year
- EfficientNet-Absolute Zero for Continuous Speech Keyword Spotting☆23Updated 2 years ago
- S3PRL for Speech Emotion Recognition (see s3prl > downstream)☆13Updated 3 months ago
- Speaker diarization benchmark framework☆10Updated 9 months ago
- ☆25Updated 2 years ago
- ☆56Updated this week
- Rescoring methods for end-to-end Automatic Speech Recognition☆27Updated 3 years ago
- Pytorch Models for Speech Enhancement☆15Updated last year
- ☆10Updated 11 months ago
- Using speaker embedding for diarization in PyTorch☆16Updated 4 years ago
- NSNet2 Deep Noise Suppression (DNS) package☆29Updated 2 years ago
- A free & open tool for transcribing audio interviews with offline ASR support☆24Updated 9 months ago
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆63Updated 2 years ago
- Official PyTorch implementation of "Attention-Free Keyword Spotting", Mashrur. M. Morshed & Ahmad Omar Ahsan, PML4DC @ ICLR 2022.☆15Updated last year
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated last year
- Real-time speech enhancement mobile app using Nested U-Net☆38Updated 11 months ago
- ☆11Updated 2 years ago
- Zero-Shot Foreign Accent Conversion without a Native Reference☆27Updated 4 months ago
- A pipeline to isolate and transcribe one language in mixed-language speech☆18Updated last year
- ☆57Updated 2 weeks ago
- ☆27Updated 5 months ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆79Updated 5 months ago
- ☆17Updated last year