pengzhendong / pyrnnoiseLinks
Python Wrapper for RnNoise v0.2
☆54Updated last month
Alternatives and similar repositories for pyrnnoise
Users that are interested in pyrnnoise are comparing it to the libraries listed below
Sorting:
- Python Wrapper of Silero VAD☆59Updated 4 months ago
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆86Updated last year
- Target Speaker Extraction Toolkit☆196Updated last month
- ☆29Updated 7 months ago
- CTC decoder with hotwords for ASR.☆23Updated 5 months ago
- An unofficial implementation of the Personal VAD speaker-conditioned voice activity detection method. Bachelor's thesis project.☆74Updated 2 years ago
- Utilizes ONNX Runtime for audio denoising.☆76Updated 2 weeks ago
- Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English.☆107Updated 5 months ago
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆112Updated 2 years ago
- Huawei Grad-TTS for Chinese☆51Updated last year
- ☆82Updated 2 months ago
- FlashCosyVoice: A lightweight vLLM implementation built from scratch for CosyVoice.☆156Updated last month
- TTSAudioNormalizer is a specialized tool for TTS data production, featuring descriptive statistical analysis of audio loudness and loud…☆104Updated 8 months ago
- Official Repository For VoxBlink2☆81Updated last year
- ONNX Inference of Pyannote Segmentation☆93Updated 8 months ago
- silero-vad pytorch implement☆26Updated 9 months ago
- The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based …☆141Updated this week
- Colab notebooks for Next-gen Kaldi☆28Updated 2 weeks ago
- LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech Enhancement☆82Updated 5 months ago
- An LLM base TTS engine☆89Updated 8 months ago
- Companion repo for the paper "PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings…☆94Updated 8 months ago
- Code for our INTERSPEECH paper Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection☆84Updated 5 months ago
- Silero VAD(ncnn): pre-trained enterprise-grade Voice Activity Detector.☆21Updated last year
- Went online decode demo☆31Updated 4 years ago
- ☆66Updated 2 years ago
- Code for the Interspeech 2024 paper "MM-KWS: Multi-modal Prompts for Multilingual User-defined Keyword Spotting"☆35Updated 4 months ago
- An evolving, large-scale and multi-domain ASR corpus for low-resource languages with automated crawling, transcription and refinement☆171Updated 2 weeks ago
- Chinese and English Bilinguish G2P☆21Updated 2 years ago
- In-car multi-channel speech transcription system of AISHELL-5.☆33Updated 3 months ago
- This project is to train an RWKV LLM for TTS generation which compatible to other TTS engine(like fish/cosy/chattts).☆83Updated this week