Picovoice / koalaLinks
On-device noise suppression powered by deep learning
☆78Updated last week
Alternatives and similar repositories for koala
Users that are interested in koala are comparing it to the libraries listed below
Sorting:
- On-device voice activity detection (VAD) powered by deep learning☆238Updated last week
- On-device speaker diarization powered by deep learning☆61Updated last week
- A curated list of awesome voice activity detection☆70Updated last year
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆119Updated 2 years ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆69Updated 2 months ago
- Zero-shot Audio Classification using Whisper☆79Updated 3 years ago
- C++ version of pyannote audio speaker diarizaiton pipeline☆22Updated last year
- ONNX Inference of Pyannote Segmentation☆97Updated last year
- On-device streaming text-to-speech engine powered by deep learning☆122Updated last week
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated 2 years ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆100Updated last year
- Experiments to test different speech recognition systems for SEPIA Framework☆62Updated 2 years ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆108Updated 2 weeks ago
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" models☆67Updated 3 years ago
- Mirror of hf.co/pyannote/speaker-diarization-3.1☆29Updated last year
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆153Updated last year
- An automatic speech recognition API☆76Updated last month
- openvino version of openai/whisper☆178Updated 2 years ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆121Updated 2 years ago
- Uses machine learning to denoise audio containing speech☆48Updated last year
- A package for NeuCodec: a 50hz, 0.8kbps, 24kHz audio codec.☆135Updated 2 months ago
- Open models for Coqui STT☆148Updated 2 years ago
- C++ library for converting text to phonemes for Piper☆137Updated 5 months ago
- Python bindings of speexdsp noise suppression library☆45Updated 3 years ago
- Lightweight wrapper for Silero VAD using internal ONNX Runtime and with no python package dependencies☆15Updated last year
- 🐸STT integration examples☆129Updated 3 years ago
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆161Updated last year
- PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.☆177Updated last year
- Speaker change detection using SincNet and an LSTM/Transformer☆56Updated 7 months ago
- ☆275Updated last year