pengzhendong / pyrnnoiseLinks
Python Wrapper for RnNoise v0.2
☆63Updated 3 weeks ago
Alternatives and similar repositories for pyrnnoise
Users that are interested in pyrnnoise are comparing it to the libraries listed below
Sorting:
- Python Wrapper of Silero VAD☆61Updated 6 months ago
- FlashCosyVoice: A lightweight vLLM implementation built from scratch for CosyVoice.☆210Updated last week
- Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English.☆110Updated 8 months ago
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆119Updated 2 years ago
- Target Speaker Extraction Toolkit☆218Updated last month
- TTSAudioNormalizer is a specialized tool for TTS data production, featuring descriptive statistical analysis of audio loudness and loud…☆107Updated 11 months ago
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆100Updated last year
- We Speech Toolkit, LLM based Speech Toolkit for Speech Understanding, Generation, and Interaction☆151Updated last week
- CTC decoder with hotwords for ASR.☆34Updated 7 months ago
- Utilizes ONNX Runtime for audio denoising.☆92Updated last week
- ONNX Inference of Pyannote Segmentation☆95Updated 10 months ago
- IndexTTS Fine-tuning notebooks☆116Updated 5 months ago
- ☆139Updated 2 months ago
- Code for our INTERSPEECH paper Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection☆96Updated 7 months ago
- An LLM base TTS engine☆91Updated 10 months ago
- An evolving, large-scale and multi-domain ASR corpus for low-resource languages with automated crawling, transcription and refinement☆176Updated 2 months ago
- An unofficial implementation of the Personal VAD speaker-conditioned voice activity detection method. Bachelor's thesis project.☆77Updated 3 years ago
- Chinese and English Bilinguish G2P☆21Updated 2 years ago
- This project is to train an RWKV LLM for TTS generation which compatible to other TTS engine(like fish/cosy/chattts).☆88Updated last month
- LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech Enhancement☆89Updated 7 months ago
- ☆104Updated 2 months ago
- Text-audio foundation model from Boson AI☆112Updated 2 months ago
- ☆29Updated 9 months ago
- Train the next generation of TTS systems.☆169Updated last year
- Huawei Grad-TTS for Chinese☆49Updated 2 years ago
- noise reduction☆17Updated last year
- ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations☆180Updated last year
- This is the audio sample repository for speech separation model "MossFormer2".☆153Updated 11 months ago
- Baichuan-Audio: A Unified Framework for End-to-End Speech Interaction☆216Updated 8 months ago
- Official code for "F5R-TTS: Improving Flow-Matching based Text-to-Speech with Group Relative Policy Optimization"☆127Updated 5 months ago