yeyupiaoling / YeAudio
Python的音频工具
☆12Updated 3 months ago
Alternatives and similar repositories for YeAudio:
Users that are interested in YeAudio are comparing it to the libraries listed below
- Python Wrapper of Silero VAD☆47Updated last month
- Huawei Grad-TTS for Chinese☆46Updated last year
- paraformer(chinense asr) online onnx runtime for python☆40Updated 10 months ago
- ☆65Updated last year
- Chinese and English Bilinguish G2P☆20Updated last year
- Streaming Text to Speech Web UI☆15Updated 9 months ago
- noise reduction☆17Updated 7 months ago
- Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English.☆85Updated 2 weeks ago
- CTC decoder with hotwords for ASR.☆16Updated last month
- ☆16Updated 3 months ago
- g2p for english tts☆18Updated 2 years ago
- 单独维护的中文TTS☆35Updated 2 years ago
- ☆18Updated 4 months ago
- Official implementation of the paper titled "Age and Gender Recognition Using a Convolutional Neural Network with a Specially Designed Mu…☆18Updated 11 months ago
- ☆31Updated 3 years ago
- ☆24Updated this week
- ☆37Updated 3 years ago
- Utilizes ONNX Runtime for audio denoising.☆32Updated last week
- Colab notebooks for Next-gen Kaldi☆26Updated last week
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆76Updated last year
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆58Updated 6 months ago
- An open-source Kazakh Emotional Text-to-Speech Dataset☆27Updated 10 months ago
- Went online decode demo☆29Updated 3 years ago
- A library for adding punctuation into a text from ASR.☆16Updated last year
- [WIP] Unofficial Implementation of Microsoft's PromptTTS2☆51Updated last year
- Project of Singing Voice Conversion.☆14Updated last year
- ASR 2Pass onnxruntime and websocket server, based on FunASR(https://github.com/alibaba-damo-academy/FunASR).☆60Updated last month
- A Tiny Project For ASR model training and Deployment☆27Updated 2 years ago
- (WIP)long form speech generatoins☆30Updated 2 months ago
- UMETTS: A Unified Framework for Emotional Text-to-Speech Synthesis with Multimodal Prompts☆16Updated last month