Picovoice / falconLinks
On-device speaker diarization powered by deep learning
☆52Updated 3 weeks ago
Alternatives and similar repositories for falcon
Users that are interested in falcon are comparing it to the libraries listed below
Sorting:
- On-device noise suppression powered by deep learning☆74Updated 3 weeks ago
- On-device voice activity detection (VAD) powered by deep learning☆227Updated 2 weeks ago
- An automatic speech recognition API☆68Updated last week
- A curated list of awesome voice activity detection☆62Updated 9 months ago
- ONNX Inference of Pyannote Segmentation☆92Updated 8 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆96Updated last year
- ☆40Updated last year
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆150Updated last year
- Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event …☆402Updated last year
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆88Updated last year
- Reproducible experimental protocols for multimedia (audio, video, text) database☆107Updated 6 months ago
- OpenAI Whisper Prompt Examples☆52Updated 2 years ago
- Voice Activity Projection Models: Self-supervised learning of Turn-taking Events☆78Updated last year
- On-device streaming text-to-speech engine powered by deep learning☆120Updated 3 weeks ago
- Companion repo for the paper "PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings…☆94Updated 7 months ago
- Various speech datasets made available to the public☆128Updated 8 months ago
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆228Updated 2 weeks ago
- Deep Learning - one shot learning for speaker recognition using Filter Banks☆170Updated last year
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated 2 years ago
- This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at…☆427Updated 2 weeks ago
- Speaker change detection using SincNet and an LSTM/Transformer☆53Updated 3 months ago
- ☆29Updated last year
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆118Updated 2 years ago
- The human speaks a language with an accent. A particular accent necessarily reflects a person's linguistic background. The model defines …☆62Updated 3 years ago
- ☆81Updated 2 months ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆103Updated 10 months ago
- C++ version of pyannote audio speaker diarizaiton pipeline☆21Updated last year
- ☆62Updated last year
- ☆57Updated 2 weeks ago
- Tunable pipelines☆36Updated 6 months ago