Picovoice / falcon
On-device speaker diarization powered by deep learning
☆34Updated 2 weeks ago
Alternatives and similar repositories for falcon:
Users that are interested in falcon are comparing it to the libraries listed below
- On-device voice activity detection (VAD) powered by deep learning☆192Updated 2 weeks ago
- On-device noise suppression powered by deep learning☆65Updated 2 weeks ago
- Companion repo for the paper "PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings…☆74Updated 2 weeks ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆90Updated 3 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆53Updated last week
- ONNX Inference of Pyannote Segmentation☆81Updated last month
- ☆21Updated 5 months ago
- ☆33Updated 3 weeks ago
- The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based …☆109Updated last week
- a lightweight voice conversion☆78Updated 4 months ago
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆78Updated last year
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆16Updated 11 months ago
- ☆31Updated 9 months ago
- ☆57Updated 11 months ago
- ☆62Updated 8 months ago
- An automatic speech recognition API☆48Updated this week
- ☆19Updated last year
- OpenAI Whisper Prompt Examples☆50Updated last year
- Reproducible experimental protocols for multimedia (audio, video, text) database☆94Updated 2 weeks ago
- Speaker change detection using SincNet and an LSTM/Transformer☆46Updated 7 months ago
- Colab notebooks for Next-gen Kaldi☆26Updated last month
- Unofficial implementation of wavenext vocoder☆40Updated 5 months ago
- An unofficial implementation of the Personal VAD speaker-conditioned voice activity detection method. Bachelor's thesis project.☆65Updated 2 years ago
- NOTSOFAR-1 Challenge: Distant Diarization and ASR☆47Updated this week
- Create an LJSpeech structured voice dataset on wave input☆24Updated 4 months ago
- ☆38Updated last year
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆107Updated last year
- Onnx wrapper for espnet infrernce model☆159Updated 3 months ago
- An unofficial PyTorch implementation of VALL-E☆87Updated this week
- Codec for paper: LLaSA: Scaling Train-time and Test-time Compute for LLaMA-based Speech Synthesis☆126Updated 2 weeks ago