dengcunqin / noise-reductionLinks
noise reduction
☆17Updated last year
Alternatives and similar repositories for noise-reduction
Users that are interested in noise-reduction are comparing it to the libraries listed below
Sorting:
- (WIP)long form speech generatoins☆31Updated 8 months ago
- faster inference☆28Updated 11 months ago
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆105Updated last year
- ☆23Updated last year
- Inference code for Audiodec-Valle-Wenetspeech4TTS☆50Updated last year
- Offline Speaker Diarization with SenseVoice by Sherpa ONNX.☆15Updated 11 months ago
- Official implementation of the paper titled "Age and Gender Recognition Using a Convolutional Neural Network with a Specially Designed Mu…☆27Updated last year
- Official PyTorch inference code for the Interspeech 2025 paper: Efficient Speech Enhancement via Embeddings from Pre-trained Generative A…☆73Updated 6 months ago
- Python Wrapper of Silero VAD☆63Updated 7 months ago
- Python runtime for WeTextProcessing (does not depend on Pynini)☆38Updated 3 weeks ago
- ☆29Updated 10 months ago
- ☆68Updated 2 years ago
- silero-vad pytorch implement☆34Updated last year
- a lightweight voice conversion☆85Updated last year
- CTC decoder with hotwords for ASR.☆34Updated 8 months ago
- semantic tokenizer for speech and music☆21Updated 5 months ago
- ☆23Updated last year
- Torch Audio Forced Aligner for Mixed Chinese (Mandarin or Cantonese) and English.☆60Updated 3 months ago
- Huawei Grad-TTS for Chinese☆50Updated 2 years ago
- ☆36Updated 3 months ago
- In-car multi-channel speech transcription system of AISHELL-5.☆36Updated 6 months ago
- ☆103Updated 2 months ago
- Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)☆76Updated last year
- Try to replicate the architecture of MiniMaxTTS mentioned in it's technical report☆49Updated 3 months ago
- Streaming Text to Speech Web UI☆22Updated last year
- ☆20Updated 4 months ago
- Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp files☆55Updated 3 months ago
- Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English.☆112Updated 3 weeks ago
- Streamable Text-to-Speech model using a language modeling approach, without vector quantization☆106Updated 7 months ago
- PitchVC: Pitch Conditioned Any-to-Many Voice Conversion☆36Updated last year