On-device noise suppression powered by deep learning
☆83Feb 28, 2026Updated this week
Alternatives and similar repositories for koala
Users that are interested in koala are comparing it to the libraries listed below
Sorting:
- Code for "Error-driven Fixed-Budget ASR Personalization for Accented Speakers" in ICASSP 2021☆11Jun 13, 2021Updated 4 years ago
- Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.☆21May 26, 2025Updated 9 months ago
- Causal Speech Enhancement Based on a Two-Branch Nested U-Net Architecture Using Self-Supervised Speech Embeddings☆19Jun 6, 2025Updated 9 months ago
- On-device voice activity detection (VAD) powered by deep learning☆245Updated this week
- On-device speaker diarization powered by deep learning☆69Updated this week
- A solution to denoising and separating for two-speaker-mixed noisy speech, using a BSRNN inspired network.☆14Aug 22, 2023Updated 2 years ago
- On-device speaker recognition engine powered by deep learning☆41Updated this week
- Real-time speech enhancement mobile app using Nested U-Net☆55Oct 6, 2023Updated 2 years ago
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆25Jul 5, 2022Updated 3 years ago
- Audio Cleaner using DeepFilterNet, hosted through Streamlit☆25May 4, 2025Updated 10 months ago
- **ICASSP 2022** 《Toward Degradation-Robust Voice Conversion》Using speech enhancement and end-to-end denoising training to improve degrada…☆24Sep 27, 2022Updated 3 years ago
- simple to use, pretrained/training-less models for speaker diarization☆21Aug 23, 2023Updated 2 years ago
- TorchSpectralGate is a PyTorch-based implementation of Spectral Gating, an algorithm for denoising audio signals.☆27Feb 3, 2024Updated 2 years ago
- BioVoice: a multipurpose tool for voice analysis☆11Nov 13, 2020Updated 5 years ago
- ☆11Aug 11, 2023Updated 2 years ago
- Official repository of the work "Low-complexity Unsupervised Audio Anomaly Detection exploiting Separable Convolutions and Angular Loss" …☆10Nov 6, 2024Updated last year
- CTC decoder with hotwords for ASR.☆34Apr 13, 2025Updated 10 months ago
- Finally, some decent sample sentences☆23Dec 3, 2023Updated 2 years ago
- ☆11Jun 14, 2024Updated last year
- ☆11Nov 5, 2021Updated 4 years ago
- Arabic Grapheme-to-Phoneme (G2P) Conversion☆13Mar 15, 2025Updated 11 months ago
- ☆26Nov 3, 2025Updated 4 months ago
- ☆11Nov 7, 2024Updated last year
- This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …☆13Dec 4, 2024Updated last year
- PyTorch implementation of TinyWASE described in our paper "Compressing Speaker Extraction Model with Ultra-low Precision Quantization and…☆11Jun 28, 2021Updated 4 years ago
- offical code for Dense-TSNet☆12Sep 17, 2024Updated last year
- Using YouTube to prepare a speech recognition dataset for any language☆10Mar 30, 2021Updated 4 years ago
- [ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder☆12Mar 11, 2025Updated 11 months ago
- Onset-and-Offset-Aware Sound Event Detection☆21Feb 10, 2025Updated last year
- text to speech☆10Mar 19, 2024Updated last year
- Extract frequency, power, width and dissonance of formants from wav files☆28Jun 3, 2022Updated 3 years ago
- Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.☆50May 19, 2021Updated 4 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆16Dec 3, 2024Updated last year
- [ICMR 2025] Official Repository for The Paper, Let Network Decide What to Learn: Symbolic Music Understanding Model Based on Large-scale …☆18Aug 17, 2025Updated 6 months ago
- steps to perform text-based speaker diarization with kaldi toolkit☆12Nov 2, 2018Updated 7 years ago
- Spiking neural networks (SNNs) for speech classification☆12Mar 14, 2022Updated 3 years ago
- Cantonese Grapheme-to-Phoneme Converter based on GitYCC/g2pW☆15Dec 10, 2024Updated last year
- ☆15Nov 10, 2025Updated 3 months ago