MagicHub-io / MagicData-RAMC
MagicData-RAMC Dataset and Baseline
☆49Updated 2 years ago
Related projects: ⓘ
- The baseline system for the ICASSP2024 ICMC-ASR Challenge.☆42Updated 9 months ago
- ☆32Updated last month
- Code for synchronising all CHiME-5 audio signals for use in CHiME-6☆15Updated 4 years ago
- Target Speaker Extraction Toolkit☆58Updated 2 weeks ago
- ☆31Updated 3 years ago
- ADAPTING SELF-SUPERVISED MODELS TO MULTI-TALKER SPEECH RECOGNITION USING SPEAKER EMBEDDINGS☆26Updated last year
- AutoPrep: An Automatic Preprocessing Framework for In-the-Wild Speech Data☆28Updated 8 months ago
- Speech samples and code of BEdit-TTS☆32Updated 11 months ago
- A pytorch implementation of the paper "ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding"☆42Updated this week
- a curated list of speech datasets (110+ datasets, 75+ easy to download)☆76Updated last year
- SpEx+(tied) source code☆72Updated last year
- ☆26Updated last year
- Official PyTorch code for Deep Audio-Signal Holistic Embeddings☆40Updated last week
- repository for paper "Audio-Visual Speech Recognition in MISP2021 Challenge: Dataset Release and Deep Analysis"☆15Updated 2 years ago
- Production first, nn-based on-device signal processing toolkit.☆63Updated last year
- [ICASSP2021] Data preperation scripts, training pipeline and baseline experiment results for the Interspeech 2020 Accented English Speech…☆53Updated 3 years ago
- ☆39Updated last year
- Exploring Binary Classification Loss for Speaker Verification☆14Updated last year
- A simple package for Guided source separation (GSS)☆104Updated 4 months ago
- A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK☆60Updated 2 years ago
- Chinese Text Normalization and Dataset☆78Updated 2 years ago
- ☆48Updated 11 months ago
- Training data simulation☆38Updated 4 months ago
- ☆32Updated 2 years ago
- ☆13Updated last year
- Computes the MWER (minimum WER) Loss with beam search and negative sampling strategy.☆17Updated last year
- End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM☆37Updated last year
- ☆63Updated this week
- ☆29Updated 2 years ago
- multilingual speech aligner☆70Updated 10 months ago