Picovoice / koalaLinks
On-device noise suppression powered by deep learning
☆72Updated last week
Alternatives and similar repositories for koala
Users that are interested in koala are comparing it to the libraries listed below
Sorting:
- On-device voice activity detection (VAD) powered by deep learning☆218Updated this week
- On-device speaker diarization powered by deep learning☆50Updated last week
- Experiments to test different speech recognition systems for SEPIA Framework☆60Updated 2 years ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆62Updated 3 weeks ago
- C++ version of pyannote audio speaker diarizaiton pipeline☆21Updated last year
- ONNX Inference of Pyannote Segmentation☆90Updated 6 months ago
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆84Updated last year
- C++ library for converting text to phonemes for Piper☆121Updated last year
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆115Updated 2 years ago
- A curated list of awesome voice activity detection☆57Updated 7 months ago
- ☆71Updated last week
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆98Updated 8 months ago
- Create an LJSpeech structured voice dataset on wave input☆30Updated 8 months ago
- Efficient approach to speaker diarization using voice characteristics extraction☆96Updated this week
- The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based …☆137Updated 3 months ago
- A complete speech segmentation system using Kaldi and x-vectors for voice activity detection (VAD) and speaker diarisation.☆29Updated last year
- Real-time speech enhancement mobile app using Nested U-Net☆51Updated last year
- Application for viewing Rich Transcription Time Marked (RTTM) files in an interactive way☆43Updated 2 years ago
- Companion repo for the paper "PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings…☆94Updated 5 months ago
- Python bindings of speexdsp noise suppression library☆39Updated 2 years ago
- On-device Speech-to-Index engine powered by deep learning☆36Updated 2 months ago
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆78Updated 10 months ago
- C++ version of pyannote audio overlapped speech detection pipeline☆13Updated last year
- ☆40Updated last year
- Apply machine learning model DTLN for noise suppression and acoustic echo cancellation on Raspberry Pi☆67Updated 3 years ago
- Google's SoundStorm: Efficient Parallel Audio Generation☆132Updated last year
- An unofficial implementation of the Personal VAD speaker-conditioned voice activity detection method. Bachelor's thesis project.☆70Updated 2 years ago
- Tools for making LJSpeech datasets☆25Updated last year
- IPA Phonemizer/Dephonemizer for 139 human languages☆27Updated 2 months ago
- ☆28Updated 4 months ago