cnlinxi / blog
personal blog
☆14Updated 2 years ago
Alternatives and similar repositories for blog:
Users that are interested in blog are comparing it to the libraries listed below
- ☆74Updated 2 years ago
- The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.☆95Updated last year
- Predict prosody labels for Chinese sentences.☆40Updated 2 years ago
- Official data preparation scripts for the URGENT 2024 Challenge☆75Updated last week
- ☆69Updated 4 years ago
- Awesome Neural Codec Models, Text-to-Speech Synthesizers & Speech Language Models☆95Updated this week
- BAE-NET: A LOW COMPLEXITY AND HIGH FIDELITY BANDWIDTH-ADAPTIVE NEURAL NETWORK FOR SPEECH SUPER-RESOLUTION☆65Updated 5 months ago
- Target Speaker Extraction Toolkit☆139Updated 2 months ago
- ☆94Updated last year
- 语音算法相关资源汇总 Resource for Speech Processing || NEWS: official link of VoxCeleb fails recently and an external link is added for download☆48Updated 2 years ago
- Voice activity detection (VAD) paper and code(From 198*~ )and its classification.☆90Updated 11 months ago
- A training code template for DNN-based speech enhancement.☆63Updated last week
- An unofficial implementation of DeepVQE proposed by Microsoft Corp.☆78Updated last year
- TTS-frontend with Bert and CRF/lstm (For Tacotron)☆52Updated 4 years ago
- The baseline system for the ICASSP2024 ICMC-ASR Challenge.☆47Updated last year
- ☆64Updated last year
- TTS FrontEnd DataSet: Polyphone / Prosody / TextNormalization☆92Updated 11 months ago
- FACodec: Speech Codec with Attribute Factorization used for NaturalSpeech 3☆183Updated 9 months ago
- Implementation of StyleTTS for Mandarin☆11Updated last year
- Unofficial Pytorch Lightning Implementation of "Real-time Speech Frequency Bandwidth Extension"☆31Updated last year
- Uformer: A Unet based dilated complex & real dual-path conformer network for simultaneous speech enhancement and dereverberation☆99Updated 2 years ago
- Official PyTorch code for Deep Audio-Signal Holistic Embeddings☆64Updated 2 weeks ago
- UT-Sarulab MOS prediction system using SSL models☆199Updated 9 months ago
- Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion (Interspeech 2022)☆114Updated 11 months ago
- ☆114Updated 3 weeks ago
- The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synth…☆82Updated 2 years ago
- ☆135Updated 11 months ago
- multi-scale time domain speaker extraction☆60Updated 3 years ago
- ☆65Updated last year
- Baseline Recipe for VoicePrivacy Challenge 2024: anonymization systems and evaluation software☆48Updated 2 weeks ago