Repo for Polyphone Disambiguation in Mandarin Chinese with Semi-Supervised Learning
☆15Feb 26, 2022Updated 4 years ago
Alternatives and similar repositories for SemiPPL
Users that are interested in SemiPPL are comparing it to the libraries listed below
Sorting:
- Labeled data for homograph disambiguation☆62Jun 1, 2023Updated 2 years ago
- style token with tacotron2☆62Jul 6, 2023Updated 2 years ago
- Chinese polyphone disambiguation for Text-to-Speech application☆42Jun 11, 2024Updated last year
- Python 汉字到粤拼转换工具。☆35Feb 26, 2024Updated 2 years ago
- Configuration Information for Qt + SGX on TI Platforms☆24Sep 14, 2013Updated 12 years ago
- A sailing robot to map coral reefs☆14Mar 23, 2022Updated 3 years ago
- the Tensorflow version of multi-speaker TTS training with feedback constraint☆40Oct 12, 2020Updated 5 years ago
- Four neural network architectures to classify sound source direction☆11Oct 3, 2020Updated 5 years ago
- Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge☆21Jul 25, 2022Updated 3 years ago
- Adaptive Noise Injection based Acoustic Feedback Cancellation☆16Jul 14, 2020Updated 5 years ago
- Build kaldi inside docker containers with option for CUDA support☆12Feb 6, 2017Updated 9 years ago
- Most Complete Pytorch Imeplementation "GENERALIZED END-TO-END LOSS FOR SPEAKER VERIFICATION"☆10Mar 11, 2020Updated 5 years ago
- my personal vim setting☆10Sep 21, 2021Updated 4 years ago
- MMM 2021: Crossed-Time Delay Neural Network for Speaker Recognition☆11Dec 4, 2021Updated 4 years ago
- 基于 g2pW 提升 pypinyin 的准确性☆104Jun 24, 2023Updated 2 years ago
- Code for the paper: A Sketch based 3D Point Cloud Modeling System based on Deep Generation Network and Detail Editing☆11Feb 21, 2022Updated 4 years ago
- ☆10Mar 21, 2018Updated 7 years ago
- recent audio generation papers (including speech, music and general audios)☆13Mar 14, 2023Updated 2 years ago
- Calculate similarity with embedding☆11Jan 22, 2022Updated 4 years ago
- Source code for "BLOOM-Net: Blockwise Optimization for Masking Networks Toward Scalable and Efficient Speech Enhancement"☆14Feb 13, 2022Updated 4 years ago
- Dynamic vision-guided speaker embedding for audio-visual speaker diarization☆12Jul 5, 2022Updated 3 years ago
- Score Normalization for NIST 2019 Speaker Recognition Evaluation☆10Nov 8, 2019Updated 6 years ago
- Automatically exported from code.google.com/p/transducersaurus☆11Apr 1, 2015Updated 10 years ago
- Tsinghua University SPMI Lab array processing toolkit☆18Nov 23, 2016Updated 9 years ago
- This repository provides UNOFFICIAL Bunched LPCNet implementations with Pytorch.☆14Jun 17, 2021Updated 4 years ago
- ☆11Dec 31, 2019Updated 6 years ago
- This repository contains the files used for our Interspeech 2017 paper.☆16May 30, 2017Updated 8 years ago
- ☆14May 9, 2022Updated 3 years ago
- ☆16Feb 19, 2026Updated last week
- chinese_tacotron-2☆12Feb 27, 2018Updated 8 years ago
- Modern audio compression for the internet http://opus-codec.org/: modification for constrained devices☆13Jan 29, 2018Updated 8 years ago
- ☆19Jun 3, 2020Updated 5 years ago
- 利用PaddleSpeech合成原神角色纳西妲声音☆12Dec 6, 2022Updated 3 years ago
- The implementation of "End-to-End Neural Speaker Diarization with an Iterative Adaptive Attractor Estimation", which is accepted by Neura…☆11Aug 27, 2023Updated 2 years ago
- This repository is for the paper Incorporating External POS Tagger for Punctuation Restoration. Proc. Interspeech 2021, 1987-1991, doi: 1…☆11Jul 5, 2023Updated 2 years ago
- Image style transfer using Convolutional Neural Networks☆13May 1, 2017Updated 8 years ago
- U-Net + Attention, extending U-Net model for semantic segmentation. Implemented with TensorFlow.☆11May 11, 2019Updated 6 years ago
- Implementation of BAM: Bottleneck Attention Module with TensorFLow☆12Jan 16, 2019Updated 7 years ago
- Solution by Nhi Vo for AICovidVN 115M Challenge: Covid Cough Detection Challenge☆10Jul 8, 2021Updated 4 years ago