Picovoice / falconLinks
On-device speaker diarization powered by deep learning
☆55Updated 2 months ago
Alternatives and similar repositories for falcon
Users that are interested in falcon are comparing it to the libraries listed below
Sorting:
- On-device voice activity detection (VAD) powered by deep learning☆229Updated 2 weeks ago
- On-device noise suppression powered by deep learning☆74Updated 2 months ago
- A curated list of awesome voice activity detection☆66Updated 10 months ago
- Very fast, accurate speaker diarization☆145Updated last week
- ONNX Inference of Pyannote Segmentation☆93Updated 9 months ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆150Updated last year
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆88Updated last year
- ☆43Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆97Updated last year
- Reproducible experimental protocols for multimedia (audio, video, text) database☆107Updated 3 weeks ago
- An automatic speech recognition API☆70Updated 2 weeks ago
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆126Updated 2 months ago
- Companion repo for the paper "PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings…☆97Updated 9 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆68Updated last month
- Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event …☆404Updated last year
- ☆203Updated 2 months ago
- ☆86Updated last week
- Code for our INTERSPEECH paper Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection☆92Updated 6 months ago
- ☆61Updated this week
- On-device streaming text-to-speech engine powered by deep learning☆122Updated last month
- Speaker change detection using SincNet and an LSTM/Transformer☆53Updated 4 months ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆213Updated 5 months ago
- The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based …☆143Updated 2 weeks ago
- Application for viewing Rich Transcription Time Marked (RTTM) files in an interactive way☆47Updated 2 years ago
- OpenAI Whisper Prompt Examples☆52Updated 2 years ago
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆237Updated last month
- Deep Learning - one shot learning for speaker recognition using Filter Banks☆170Updated last year
- Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.☆42Updated 3 weeks ago
- Various speech datasets made available to the public☆131Updated 9 months ago
- Speaker Diarization with Transformers☆69Updated 4 months ago