Kowalski1024 / Mi-Go
Mi-Go is an open-source test framework designed to evaluate and compare the accuracy of speech-to-text models on YouTube dataset.
☆12Updated 6 months ago
Alternatives and similar repositories for Mi-Go:
Users that are interested in Mi-Go are comparing it to the libraries listed below
- ☆9Updated 5 years ago
- Experimental playground for benchmarking language model (LM) architectures, layers, and tricks on smaller datasets. Designed for flexible…☆15Updated last month
- SubER - Subtitle Edit Rate☆22Updated 5 months ago
- Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch☆17Updated this week
- Audio samples accompanying publications related to DF-Conformer, a speech enhancement model.☆20Updated last year
- ☆11Updated last year
- Implementation of Google's USM speech model in Pytorch☆27Updated this week
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆20Updated 10 months ago
- Metrics for measuring audio quality☆12Updated 5 years ago
- ☆9Updated 3 months ago
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆11Updated 6 months ago
- Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…☆20Updated 5 months ago
- Evaluation of STT models for german language☆15Updated 3 years ago
- Hifi-like Vocoder implemented in PyTorch☆13Updated 2 years ago
- MTDA-HSED: Mutual-Assistance Tuning and Dual-Branch Aggregating for Heterogeneous Sound Event Detection☆9Updated 4 months ago
- ☆13Updated 2 years ago
- The project for speech translation☆11Updated last year
- ☆8Updated 3 years ago
- The official implementation of DMEL the method presented in the paper "DMEL: The differentiable log-Mel spectrogram as a trainable layer …☆17Updated last month
- 🫠 check your data, before you wreck your model☆16Updated 2 years ago
- A unified framework for Low-resource Audio Processing and Evaluation (SSL Pre-training and Downstream Fine-tuning)☆27Updated 6 months ago
- Python package of MP-SENet from Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement.☆11Updated 2 months ago
- Test Framework for few-shot open set KWS☆25Updated 2 months ago
- Russian phonetical transcription☆9Updated last year
- ☆13Updated 10 months ago
- ☆11Updated 3 months ago
- The aim of this project is to make voice assistants more responsive towards whisper to some extent.☆10Updated 5 years ago
- Enhanced Reverberation As Supervision (ERAS) for unsupervised reverberant speech separation☆11Updated 5 months ago
- ☆16Updated 4 months ago