pengzhendong / pyrnnoiseLinks

Python Wrapper for RnNoise v0.2

☆42

Alternatives and similar repositories for pyrnnoise

Users that are interested in pyrnnoise are comparing it to the libraries listed below

Sorting:

pengzhendong / pysilero
Python Wrapper of Silero VAD
☆57Updated 2 months ago
DakeQQ / Audio-Denoiser-ONNX
Utilizes ONNX Runtime for audio denoising.
☆60Updated 2 weeks ago
pengzhendong / g2p-mix
Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English.
☆105Updated 4 months ago
csukuangfj / kaldi-native-fbank
Kaldi-compatible online fbank extractor without external dependencies
☆112Updated 2 weeks ago
yuyun2000 / SpeechDenoiser
SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…
☆82Updated 11 months ago
ScottishFold007 / TTSAudioNormalizer
TTSAudioNormalizer is a specialized tool for TTS data production, featuring descriptive statistical analysis of audio loudness and loud…
☆101Updated 7 months ago
linan2 / Voice-activity-detection-VAD-paper-and-code
Voice activity detection (VAD) paper and code（From 198*~ ）and its classification.
☆101Updated last month
jiay7 / wenet_onlinedecode
Went online decode demo
☆30Updated 4 years ago
pirxus / personalVAD
An unofficial implementation of the Personal VAD speaker-conditioned voice activity detection method. Bachelor's thesis project.
☆70Updated 2 years ago
MaxMax2016 / Grad-TTS-Chinese
Huawei Grad-TTS for Chinese
☆50Updated last year
R1ckShi / SeACo-Paraformer
[ICASSP2023] Source code, model links and open test sets for paper SeACo-Paraformer.
☆32Updated last year
lovemefan / fsmn-vad
A enterprise-grade Voice Activity Detector from modelscope and funasr.
☆105Updated 2 years ago
k2-fsa / colab
Colab notebooks for Next-gen Kaldi
☆28Updated 3 months ago
wxqwinner / silero-vad-ncnn
Silero VAD(ncnn): pre-trained enterprise-grade Voice Activity Detector.
☆20Updated 11 months ago
pengzhendong / asr-decoder
CTC decoder with hotwords for ASR.
☆20Updated 3 months ago
DaiYvhang / AISHELL-5
In-car multi-channel speech transcription system of AISHELL-5.
☆30Updated last month
wenet-e2e / wesep
Target Speaker Extraction Toolkit
☆183Updated last week
aizhiqi-work / MM-KWS
Code for the Interspeech 2024 paper "MM-KWS: Multi-modal Prompts for Multilingual User-defined Keyword Spotting"
☆35Updated 2 months ago
dukGuo / valle-audiodec
Inference code for Audiodec-Valle-Wenetspeech4TTS
☆50Updated last year
wenet-e2e / wesignal
Production first, nn-based on-device signal processing toolkit.
☆64Updated 2 years ago
Jackiexiao / tts-frontend-dataset
TTS FrontEnd DataSet: Polyphone / Prosody / TextNormalization
☆99Updated last year
frankyoujian / Edge-Punct-Casing
☆29Updated 5 months ago
tomasJwYU / AutoPrepDemo
AutoPrep: An Automatic Preprocessing Framework for In-the-Wild Speech Data
☆31Updated last year
Kevin-naticl / LLaSE-G1
LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech Enhancement
☆79Updated 4 months ago
pengzhendong / welm
One command to build TLG.fst for WeNet.
☆31Updated 2 years ago
marianne-m / brouhaha-vad
Predicts the level of noise and reverberation on your audiofiles
☆155Updated last month
csukuangfj / kaldifeat
Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - P…
☆203Updated last month
HolgerBovbjerg / data2vec-KWS
This repository contains code for applying Data2Vec to pretrain Keyword Transformer model as described in "Improving Label-Deficient Keyw…
☆29Updated 4 months ago
Xiaobin-Rong / deepvqe
An unofficial implementation of DeepVQE proposed by Microsoft Corp.
☆95Updated 4 months ago
Ephrem-ETH / E2E-KWS
End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM
☆41Updated 2 years ago