thamquocdung / eCMU
eCMU: An Efficient Phase-aware Framework for Music Source Separation with Conformer (IEEE RIVF23)
☆9Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for eCMU
- ☆10Updated last year
- VietTTS: An Open-Source Vietnamese Text to Speech☆21Updated 3 weeks ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal.☆13Updated 2 years ago
- A simple command line tool to calculate WER for ASR.☆13Updated last month
- C++ version of pyannote audio overlapped speech detection pipeline☆9Updated 9 months ago
- Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"☆17Updated 3 years ago
- ☆11Updated 3 years ago
- ☆17Updated last year
- ☆11Updated last year
- ☆16Updated 2 years ago
- ClearVoice☆13Updated this week
- Wenet speech to text for react native☆10Updated 2 years ago
- Implementation of the contextual biasing for ASR decoding on GPUs without lattice generation. The code supports submission to Interspeech…☆19Updated last year
- SANE-TTS: Stable And Natural End-to-End Multilingual Text-to-Speech☆11Updated last year
- ☆16Updated 3 years ago
- A Study of Low-Resource Speech Commands Recognition Based on Adversarial Reprogramming☆18Updated last year
- Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.☆21Updated 2 months ago
- Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model☆10Updated 10 months ago
- A High-Quality and Large-Scale Dataset for English-Vietnamese Speech Translation (INTERSPEECH 2022)☆20Updated 4 months ago
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…☆26Updated last year
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆13Updated last month
- ☆18Updated 8 months ago
- Vietnamese Punctuation Prediction using Pretrained Language Models☆13Updated 2 years ago
- ☆41Updated 2 months ago
- Official implementation of the APSIPA 2022 paper: Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆12Updated 2 years ago
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆61Updated 8 months ago
- ☆12Updated last year
- Megatts2 use HierSpeechpp's vocoder☆15Updated last week
- [ASRU 2023] Code of paper SALT: Distinguishable Speaker Anonymization Through Latent Space Transformation☆17Updated 3 months ago