Picovoice / koalaLinks
On-device noise suppression powered by deep learning
☆73Updated 3 weeks ago
Alternatives and similar repositories for koala
Users that are interested in koala are comparing it to the libraries listed below
Sorting:
- On-device speaker diarization powered by deep learning☆51Updated 3 weeks ago
- On-device voice activity detection (VAD) powered by deep learning☆219Updated this week
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆116Updated 2 years ago
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" models☆66Updated 2 years ago
- Speaker change detection using SincNet and an LSTM/Transformer☆52Updated last month
- A curated list of awesome voice activity detection☆59Updated 7 months ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆100Updated 9 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆63Updated last month
- ONNX Inference of Pyannote Segmentation☆92Updated 6 months ago
- C++ library for converting text to phonemes for Piper☆128Updated last year
- Uses machine learning to denoise audio containing speech☆35Updated last year
- Simple PyTorch Denoisers for Waveform Audio☆35Updated 2 months ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆149Updated last year
- Python bindings of speexdsp noise suppression library☆39Updated 2 years ago
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the…☆47Updated 2 years ago
- An even smaller speech recognizer / force aligner☆34Updated 6 months ago
- The EveryVoice TTS Toolkit - Text To Speech for your language☆37Updated this week
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆96Updated last year
- Reproducible experimental protocols for multimedia (audio, video, text) database☆104Updated 5 months ago
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆79Updated 10 months ago
- Your one-stop solution for voice dataset creation☆120Updated last year
- C++ version of pyannote audio speaker diarizaiton pipeline☆21Updated last year
- Experiments to test different speech recognition systems for SEPIA Framework☆60Updated 2 years ago
- The human speaks a language with an accent. A particular accent necessarily reflects a person's linguistic background. The model defines …☆62Updated 3 years ago
- Stable timestamps and confidence score for words of OpenAI's Whisper outputs down to word-level.☆25Updated 2 years ago
- Lightweight wrapper for Silero VAD using internal ONNX Runtime and with no python package dependencies☆15Updated 7 months ago
- AudioStretchy is a Python wrapper around the `audio-stretch` C library, which performs fast, high-quality time-stretching of WAV/MP3 file…☆52Updated 2 weeks ago
- Google's SoundStorm: Efficient Parallel Audio Generation☆132Updated last year
- StyleTTS 2 Optimized Training Fork☆32Updated 5 months ago