Picovoice / falconLinks
On-device speaker diarization powered by deep learning
☆53Updated last month
Alternatives and similar repositories for falcon
Users that are interested in falcon are comparing it to the libraries listed below
Sorting:
- On-device noise suppression powered by deep learning☆74Updated last month
- On-device voice activity detection (VAD) powered by deep learning☆228Updated last month
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆148Updated last year
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆86Updated last year
- ☆41Updated last year
- A curated list of awesome voice activity detection☆62Updated 10 months ago
- ONNX Inference of Pyannote Segmentation☆93Updated 8 months ago
- An automatic speech recognition API☆68Updated 3 weeks ago
- Speaker change detection using SincNet and an LSTM/Transformer☆53Updated 3 months ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆107Updated last week
- ☆60Updated 2 weeks ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆97Updated last year
- Very fast, accurate speaker diarization☆93Updated last week
- Voice Activity Projection Models: Self-supervised learning of Turn-taking Events☆80Updated last year
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆103Updated 11 months ago
- Companion repo for the paper "PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings…☆95Updated 8 months ago
- Code for our INTERSPEECH paper Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection☆84Updated 5 months ago
- Simple diarization model☆52Updated 3 months ago
- OpenAI Whisper Prompt Examples☆52Updated 2 years ago
- Application for viewing Rich Transcription Time Marked (RTTM) files in an interactive way☆47Updated 2 years ago
- ☆82Updated 3 months ago
- ☆64Updated last year
- ☆143Updated last month
- Zero-shot Audio Classification using Whisper☆79Updated 2 years ago
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation☆150Updated last year
- C++ version of pyannote audio speaker diarizaiton pipeline☆21Updated last year
- Google's SoundStorm: Efficient Parallel Audio Generation☆132Updated 2 years ago
- Online streaming speaker change detection model in Pytorch☆42Updated 2 years ago
- On-device streaming text-to-speech engine powered by deep learning☆120Updated last week
- ☆127Updated 3 weeks ago