Picovoice / koalaLinks
On-device noise suppression powered by deep learning
☆73Updated 2 weeks ago
Alternatives and similar repositories for koala
Users that are interested in koala are comparing it to the libraries listed below
Sorting:
- On-device voice activity detection (VAD) powered by deep learning☆222Updated last week
- On-device speaker diarization powered by deep learning☆51Updated 2 weeks ago
- ONNX Inference of Pyannote Segmentation☆92Updated 7 months ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆116Updated 2 years ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆149Updated last year
- PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.☆174Updated last year
- A curated list of awesome voice activity detection☆59Updated 8 months ago
- C++ version of pyannote audio speaker diarizaiton pipeline☆21Updated last year
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆65Updated last week
- C++ library for converting text to phonemes for Piper☆128Updated 3 weeks ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆106Updated 5 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆96Updated last year
- Create an LJSpeech structured voice dataset on wave input☆33Updated 10 months ago
- ☆40Updated last year
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated last year
- AudioStretchy is a Python wrapper around the `audio-stretch` C library, which performs fast, high-quality time-stretching of WAV/MP3 file…☆54Updated last week
- Google's SoundStorm: Efficient Parallel Audio Generation☆132Updated last year
- On-device streaming text-to-speech engine powered by deep learning☆102Updated last week
- Tunable pipelines☆35Updated 5 months ago
- 🐸 - A general purpose model trainer, as flexible as it gets☆222Updated last year
- Real-Time Whisper Voice Recognition with vosk model feedback.☆117Updated 2 years ago
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆83Updated 11 months ago
- A complete speech segmentation system using Kaldi and x-vectors for voice activity detection (VAD) and speaker diarisation.☆30Updated last year
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" models☆66Updated 2 years ago
- Speaker diarization service☆23Updated last month
- Experiments to test different speech recognition systems for SEPIA Framework☆60Updated 2 years ago
- Uses machine learning to denoise audio containing speech☆36Updated last year
- VoiceBox neural network implementation☆108Updated last year
- An automatic speech recognition API☆65Updated 2 weeks ago
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆161Updated last year