Picovoice / falconLinks
On-device speaker diarization powered by deep learning
☆57Updated this week
Alternatives and similar repositories for falcon
Users that are interested in falcon are comparing it to the libraries listed below
Sorting:
- On-device noise suppression powered by deep learning☆76Updated this week
- On-device voice activity detection (VAD) powered by deep learning☆233Updated last week
- A curated list of awesome voice activity detection☆68Updated last year
- ☆43Updated last year
- On-device streaming text-to-speech engine powered by deep learning☆122Updated last week
- ONNX Inference of Pyannote Segmentation☆95Updated 11 months ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆153Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆97Updated last year
- Very fast, accurate speaker diarization☆166Updated last week
- Code for our INTERSPEECH paper Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection☆96Updated 7 months ago
- Voice Activity Projection Models: Self-supervised learning of Turn-taking Events☆85Updated last year
- C++ library for converting text to phonemes for Piper☆134Updated 4 months ago
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆91Updated 2 years ago
- Speaker diarization model☆28Updated 2 years ago
- ☆91Updated 2 weeks ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆107Updated 2 months ago
- ☆69Updated last month
- Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event …☆408Updated last year
- An automatic speech recognition API☆73Updated this week
- Companion repo for the paper "PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings…☆97Updated 10 months ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆119Updated 2 years ago
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)☆103Updated 3 months ago
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆242Updated 3 months ago
- Application for viewing Rich Transcription Time Marked (RTTM) files in an interactive way☆47Updated 2 years ago
- High quality text-to-speech based on StyleTTS 2.☆70Updated 2 weeks ago
- C++ version of pyannote audio speaker diarizaiton pipeline☆22Updated last year
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆128Updated 3 months ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆102Updated last year
- Mirror of hf.co/pyannote/speaker-diarization-3.1☆27Updated last year
- The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based …☆155Updated last week