Picovoice / koala
On-device noise suppression powered by deep learning
☆64Updated this week
Alternatives and similar repositories for koala:
Users that are interested in koala are comparing it to the libraries listed below
- On-device speaker diarization powered by deep learning☆33Updated this week
- On-device voice activity detection (VAD) powered by deep learning☆190Updated this week
- C++ library for converting text to phonemes for Piper☆99Updated 10 months ago
- Create an LJSpeech structured voice dataset on wave input☆23Updated 3 months ago
- Experiments to test different speech recognition systems for SEPIA Framework☆59Updated last year
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆90Updated 3 months ago
- On-device streaming text-to-speech engine powered by deep learning☆62Updated this week
- C++ version of pyannote audio speaker diarizaiton pipeline☆19Updated 11 months ago
- An automatic speech recognition API☆48Updated this week
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆107Updated last year
- On-device speaker recognition engine powered by deep learning☆30Updated this week
- PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.☆158Updated 10 months ago
- ☆104Updated 6 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆53Updated last month
- Zero-shot Audio Classification using Whisper☆77Updated 2 years ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆144Updated 8 months ago
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆57Updated 5 months ago
- Companion repo for the paper "PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings…☆68Updated last week
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆75Updated last year
- On-device Speech-to-Index engine powered by deep learning☆36Updated last month
- Real-time speech enhancement mobile app using Nested U-Net☆45Updated last year
- ☆38Updated 11 months ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆92Updated this week
- ONNX Inference of Pyannote Segmentation☆81Updated 3 weeks ago
- Your one-stop solution for voice dataset creation☆117Updated last year
- Speaker diarization service☆20Updated 3 weeks ago
- Speaker change detection using SincNet and an LSTM/Transformer☆46Updated 6 months ago
- benchmark for Speech-to-Intent engines☆15Updated 7 months ago
- ☆27Updated last week
- Simple Diarization model☆46Updated last year