IS2AI / MultilingualASR
☆12Updated 3 years ago
Alternatives and similar repositories for MultilingualASR:
Users that are interested in MultilingualASR are comparing it to the libraries listed below
- ☆11Updated 3 years ago
- ☆11Updated 2 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- Normalize Text in Russian☆26Updated last year
- Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.☆26Updated 6 months ago
- ☆13Updated 3 years ago
- Russian phonetical transcription☆9Updated last year
- ☆17Updated last year
- C++ version of pyannote audio overlapped speech detection pipeline☆12Updated last year
- ☆12Updated 3 weeks ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆20Updated 11 months ago
- Punctuation and casing restoration for the Russian Language (BERT-based)☆20Updated 3 years ago
- ☆56Updated 2 years ago
- T5-based (russian) text normalization☆20Updated last year
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated last year
- Training BERT for punctuation task☆10Updated 4 years ago
- ⚡ Blazing fast audio augmentation in Python, powered by GPU for high-efficiency processing in machine learning and audio analysis tasks.☆33Updated last year
- ☆21Updated 2 weeks ago
- Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.☆21Updated this week
- ☆9Updated 5 years ago
- ☆8Updated last year
- Python package of MP-SENet from Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement.☆12Updated 3 months ago
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.☆19Updated 3 months ago
- ☆13Updated 2 years ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆15Updated 4 months ago
- ☆9Updated 4 months ago
- MnTTS: An Open-Source Mongolian Text-to-Speech Synthesis Dataset and Accompanied Baseline. (Accepted by IALP'2022)☆17Updated 2 years ago
- ☆24Updated 4 years ago
- Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…☆21Updated 5 months ago
- WarpRNNT loss ported in Numba CPU/CUDA for Pytorch☆16Updated 2 years ago