Picovoice / koala
On-device noise suppression powered by deep learning
☆69Updated 3 weeks ago
Alternatives and similar repositories for koala:
Users that are interested in koala are comparing it to the libraries listed below
- On-device speaker diarization powered by deep learning☆44Updated last month
- On-device voice activity detection (VAD) powered by deep learning☆208Updated this week
- A curated list of awesome voice activity detection☆50Updated 5 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆62Updated 3 weeks ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆95Updated 6 months ago
- On-device speaker recognition engine powered by deep learning☆34Updated this week
- C++ version of pyannote audio speaker diarizaiton pipeline☆21Updated last year
- Uses machine learning to denoise audio containing speech☆33Updated 10 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆94Updated 11 months ago
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆83Updated last year
- Zero-shot Audio Classification using Whisper☆80Updated 2 years ago
- ONNX Inference of Pyannote Segmentation☆86Updated 4 months ago
- ☆39Updated last year
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆112Updated 2 years ago
- ☆62Updated 2 weeks ago
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆73Updated 8 months ago
- Speaker change detection using SincNet and an LSTM/Transformer☆50Updated 10 months ago
- Simple PyTorch Denoisers for Waveform Audio☆35Updated last week
- On-device Speech-to-Index engine powered by deep learning☆36Updated 2 weeks ago
- StyleTTS 2 Optimized Training Fork☆27Updated 3 months ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆148Updated last year
- A lightweight, efficient variation of the StyleTTS 2 text‐to‐speech model.☆14Updated last week
- On-device streaming text-to-speech engine powered by deep learning☆77Updated this week
- Colab notebooks for Next-gen Kaldi☆27Updated 3 weeks ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆100Updated 2 months ago
- VoiceBox neural network implementation☆106Updated 9 months ago
- ☆34Updated 2 weeks ago
- Export an ONNX graph that performs ISTFT. Designed for TTS models.☆24Updated last year
- Simple Diarization model☆47Updated last year
- C++ library for converting text to phonemes for Piper☆117Updated last year