yeyupiaoling / YeAudio
Python的音频工具
☆12Updated 4 months ago
Alternatives and similar repositories for YeAudio:
Users that are interested in YeAudio are comparing it to the libraries listed below
- paraformer(chinense asr) online onnx runtime for python☆41Updated last year
- 单独维护的中文TTS☆35Updated 2 years ago
- Python Wrapper of Silero VAD☆48Updated 3 months ago
- Official implementation of the paper titled "Age and Gender Recognition Using a Convolutional Neural Network with a Specially Designed Mu…☆19Updated last year
- Torchaudio Forced Aligner for Mixed Chinese (Mandarin or Cantonese) and English.☆11Updated 3 months ago
- ☆20Updated 5 months ago
- A Tiny Project For ASR model training and Deployment☆27Updated 2 years ago
- Utilizes ONNX Runtime for audio denoising.☆40Updated last month
- CTC decoder with hotwords for ASR.☆17Updated 2 months ago
- Streaming Text to Speech Web UI☆16Updated 10 months ago
- Huawei Grad-TTS for Chinese☆46Updated last year
- Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++☆16Updated 11 months ago
- ☆26Updated last month
- noise reduction☆17Updated 8 months ago
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆71Updated 7 months ago
- ☆37Updated 3 years ago
- silero-vad pytorch implement☆17Updated 4 months ago
- Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)☆64Updated 4 months ago
- Chinese and English Bilinguish G2P☆20Updated last year
- singing voice conversion without f0☆23Updated last year
- Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English.☆89Updated last week
- Project of Singing Voice Conversion.☆14Updated last year
- (WIP )long form speech generatoins☆30Updated 3 months ago
- ☆31Updated 3 years ago
- g2p for english tts☆19Updated 2 years ago
- Utilizes ONNX Runtime for speech activity detection.☆18Updated 2 months ago
- faster inference☆27Updated 2 months ago
- Went online decode demo☆29Updated 3 years ago
- An open-source Kazakh Emotional Text-to-Speech Dataset☆27Updated 11 months ago
- Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speake…☆57Updated last year