Picovoice / koalaLinks
On-device noise suppression powered by deep learning
☆75Updated 2 months ago
Alternatives and similar repositories for koala
Users that are interested in koala are comparing it to the libraries listed below
Sorting:
- On-device voice activity detection (VAD) powered by deep learning☆231Updated last month
- On-device speaker diarization powered by deep learning☆56Updated 2 months ago
- A curated list of awesome voice activity detection☆67Updated 11 months ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆119Updated 2 years ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆68Updated this week
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆96Updated last year
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆151Updated last year
- ONNX Inference of Pyannote Segmentation☆94Updated 10 months ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated 2 years ago
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" models☆65Updated 3 years ago
- C++ library for converting text to phonemes for Piper☆134Updated 3 months ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆107Updated last month
- Speaker change detection using SincNet and an LSTM/Transformer☆54Updated 5 months ago
- PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.☆177Updated last year
- Lightweight wrapper for Silero VAD using internal ONNX Runtime and with no python package dependencies☆15Updated 11 months ago
- A simple, but performant framework for mapping speech directly to categories and intents.☆22Updated last year
- A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text☆34Updated 5 years ago
- On-device streaming text-to-speech engine powered by deep learning☆122Updated last month
- Uses machine learning to denoise audio containing speech☆43Updated last year
- C++ version of pyannote audio speaker diarizaiton pipeline☆21Updated last year
- Zero-shot Audio Classification using Whisper☆78Updated 2 years ago
- A package for NeuCodec: a 50hz, 0.8kbps, 24kHz audio codec.☆102Updated 2 weeks ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆102Updated last year
- A simple Python wrapper for audio noise reduction RNNoise. Simplifies work with it, adds new trained models and detailed instructions for…☆175Updated last year
- Add n-gram and large language model (LLM) support to Whisper models.☆32Updated 5 months ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆119Updated 2 years ago
- Experiments to test different speech recognition systems for SEPIA Framework☆63Updated 2 years ago
- Google's SoundStorm: Efficient Parallel Audio Generation☆132Updated 2 years ago
- ☆275Updated last year
- Timething is a library for aligning text transcripts with their audio recordings.☆124Updated 10 months ago