Kowalski1024 / Mi-GoLinks
Mi-Go is an open-source test framework designed to evaluate and compare the accuracy of speech-to-text models on YouTube dataset.
☆12Updated last year
Alternatives and similar repositories for Mi-Go
Users that are interested in Mi-Go are comparing it to the libraries listed below
Sorting:
- Sequence to sequence model for Arabic punctuation prediction.☆12Updated 5 years ago
- Code for ICML25 Paper "Overcoming Non-monotonicity in Transducer-based Streaming Generation"☆11Updated last month
- ☆11Updated last year
- ☆9Updated 5 years ago
- MultiMed-ST: Large-scale Many-to-many Multilingual Medical Speech Translation☆13Updated 3 months ago
- A semi-supervised sequence-to-sequence ASR☆10Updated 2 years ago
- ☆12Updated 5 months ago
- DysfluentWFST☆13Updated last month
- Evaluation of STT models for german language☆15Updated 3 years ago
- An extension of PHOIBLE that includes features for allophones.☆10Updated 2 years ago
- MTDA-HSED: Mutual-Assistance Tuning and Dual-Branch Aggregating for Heterogeneous Sound Event Detection☆9Updated 9 months ago
- SubER - Subtitle Edit Rate☆22Updated 2 months ago
- Target speaker automatic speech recognition (TS-ASR)☆11Updated last year
- A transcribed speech dataset in Wolof, Pulaar and Sereer, to support agriculture. Funded by Lacuna Fund.☆14Updated last year
- Survey of available speech datasets for Polish ASR development☆16Updated 6 months ago
- Attention-Enhanced Short-Time Wiener Solution for Acoustic Echo Cancellation☆16Updated last week
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆14Updated 7 months ago
- PolEval 2021 Task 1☆15Updated 3 years ago
- [INTERSPEECH 2024] Official pytorch code for the paper "Disentangled Representation Learning for Environment-agnostic Speaker Recognition…☆14Updated 11 months ago
- Russian phonetical transcription☆10Updated last year
- The project for speech translation☆11Updated last year
- ☆11Updated 2 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Updated 4 years ago
- A Model (maybe an app) that translates the audio of a video from one language to another language, cloning the voice of original video wi…☆11Updated last month
- Code for "Error-driven Fixed-Budget ASR Personalization for Accented Speakers" in ICASSP 2021☆11Updated 4 years ago
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆15Updated 2 months ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆9Updated 2 years ago
- Python package of MP-SENet from Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement.☆13Updated 8 months ago
- ☆10Updated 2 years ago
- ☆15Updated 2 months ago