Picovoice/koala

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Picovoice/koala)

Picovoice / koala

On-device noise suppression powered by deep learning

☆92

Alternatives and similar repositories for koala

Users that are interested in koala are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Picovoice / octopus
View on GitHub
On-device Speech-to-Index engine powered by deep learning
☆36Apr 16, 2025Updated last year
Picovoice / eagle
View on GitHub
On-device speaker recognition engine powered by deep learning
☆54Updated this week
Picovoice / cobra
View on GitHub
On-device voice activity detection (VAD) powered by deep learning
☆266Updated this week
Picovoice / falcon
View on GitHub
On-device speaker diarization powered by deep learning
☆75Updated this week
Picovoice / speech-to-intent-benchmark
View on GitHub
benchmark for Speech-to-Intent engines
☆18Updated this week
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
Picovoice / leopard
View on GitHub
On-device speech-to-text engine powered by deep learning
☆482Updated this week
Picovoice / browser-extension
View on GitHub
Picovoice Browser Extension
☆17Jun 24, 2026Updated last month
Picovoice / voice-activity-benchmark
View on GitHub
Voice activity engine benchmark framework
☆23Jan 14, 2026Updated 6 months ago
chuck1z / AudioCleaner
View on GitHub
Audio Cleaner using DeepFilterNet, hosted through Streamlit
☆28May 4, 2025Updated last year
kjw11 / Speaker-Aware-CTC
View on GitHub
Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.
☆22May 26, 2025Updated last year
awasthiabhijeet / Error-Driven-ASR-Personalization
View on GitHub
Code for "Error-driven Fixed-Budget ASR Personalization for Accented Speakers" in ICASSP 2021
☆11Jun 13, 2021Updated 5 years ago
JaeBinCHA7 / Nested-U-Net-based-Real-time-Speech-Enhancement-Mobile-App
View on GitHub
Real-time speech enhancement mobile app using Nested U-Net
☆55Oct 6, 2023Updated 2 years ago
rishikksh20 / NU-Wave2-pytorch
View on GitHub
NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]
☆25Jul 5, 2022Updated 4 years ago
Picovoice / orca
View on GitHub
On-device streaming text-to-speech engine powered by deep learning
☆141Updated this week
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
noisereduce / TorchSpectralGating
View on GitHub
TorchSpectralGate is a PyTorch-based implementation of Spectral Gating, an algorithm for denoising audio signals.
☆27Feb 3, 2024Updated 2 years ago
KVDmitrieva / source_sep_hifi
View on GitHub
☆20Jun 29, 2025Updated last year
timsainb / vocalization-segmentation
View on GitHub
Simple python algorithms for segmenting animal (songbird, mice) vocalizations into notes and syllables using Dynamic Thresholding and Con…
☆27Apr 12, 2021Updated 5 years ago
juhayna-zh / BSRNN-speech-preprocess
View on GitHub
A solution to denoising and separating for two-speaker-mixed noisy speech, using a BSRNN inspired network.
☆15Aug 22, 2023Updated 2 years ago
Picovoice / picollm
View on GitHub
On-device LLM Inference Powered by X-Bit Quantization
☆315Jul 22, 2026Updated last week
ORI-Muchim / Efficient-Speech
View on GitHub
Lightweight Korean TTS Model based on FastSpeech2
☆15Mar 4, 2026Updated 4 months ago
desh2608 / kaldi-noise-vectors
View on GitHub
Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.
☆13Feb 13, 2021Updated 5 years ago
Picovoice / rhino
View on GitHub
On-device Speech-to-Intent engine powered by deep learning
☆705Updated this week
huaidanquede / Dense-TSNet
View on GitHub
offical code for Dense-TSNet
☆12Sep 17, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
aispeech-lab / TinyWASE
View on GitHub
PyTorch implementation of TinyWASE described in our paper "Compressing Speaker Extraction Model with Ultra-low Precision Quantization and…
☆11Jun 28, 2021Updated 5 years ago
TehreemFarooqi / Preparing-a-speech-recognition-dataset-using-YouTube-videos
View on GitHub
Using YouTube to prepare a speech recognition dataset for any language
☆10Mar 30, 2021Updated 5 years ago
Kazuhito00 / DIS-ONNX-Sample
View on GitHub
背景除去モデルであるDIS/IS-NetのPythonでのONNX推論サンプル
☆18Feb 25, 2023Updated 3 years ago
seorim0 / SE-using-SRL-Model
View on GitHub
Causal Speech Enhancement Based on a Two-Branch Nested U-Net Architecture Using Self-Supervised Speech Embeddings
☆21Jun 6, 2025Updated last year
patrickvonplaten / Wav2Vec2_ParlanceCTCDecode
View on GitHub
☆11Nov 5, 2021Updated 4 years ago
Picovoice / speaker-diarization-benchmark
View on GitHub
Speaker diarization benchmark framework
☆42Jul 17, 2026Updated last week
wangfu91 / ten-vad-rs
View on GitHub
A Rust library for working with the TEN VAD (Voice Activity Detection) ONNX model.
☆18Apr 3, 2026Updated 3 months ago
shiguredo / dtln-aec
View on GitHub
An echo cancellation library for browsers using DTLN-aec
☆26Oct 18, 2023Updated 2 years ago
thorstenMueller / cTTS
View on GitHub
TTS Client for Coqui TTS server
☆13Jan 7, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
boris-kuz / jaxloudnorm
View on GitHub
Jax implementation of a flexible audio loudness meter in Python with implementation of ITU-R BS.1770-4 loudness algorithm
☆13Jan 29, 2025Updated last year
SSTC-Challenge / SSTC2024_baseline_system
View on GitHub
☆12Jun 14, 2024Updated 2 years ago
porscheinformatik / tapestry-csrf-protection
View on GitHub
Tapestry CSRF Protection
☆11Sep 23, 2025Updated 10 months ago
hecko-yes / tts-dataset-prompts
View on GitHub
Finally, some decent sample sentences
☆24Dec 3, 2023Updated 2 years ago
wasertech / DuOS
View on GitHub
My favorite GNU/Linux flavor on the Microsoft Surface Duo.
☆11Feb 7, 2024Updated 2 years ago
talhanai / kaldi-diar-latte
View on GitHub
steps to perform text-based speaker diarization with kaldi toolkit
☆12Nov 2, 2018Updated 7 years ago
ga642381 / RobustVC
View on GitHub
**ICASSP 2022** 《Toward Degradation-Robust Voice Conversion》Using speech enhancement and end-to-end denoising training to improve degrada…
☆24Sep 27, 2022Updated 3 years ago