Picovoice / falconLinks
On-device speaker diarization powered by deep learning
☆52Updated 3 weeks ago
Alternatives and similar repositories for falcon
Users that are interested in falcon are comparing it to the libraries listed below
Sorting:
- On-device noise suppression powered by deep learning☆73Updated 3 weeks ago
- On-device voice activity detection (VAD) powered by deep learning☆223Updated this week
- A curated list of awesome voice activity detection☆59Updated 8 months ago
- ONNX Inference of Pyannote Segmentation☆92Updated 7 months ago
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆88Updated last year
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆149Updated last year
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆101Updated 10 months ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆107Updated 6 months ago
- ☆48Updated 3 weeks ago
- Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.☆42Updated last week
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆96Updated last year
- LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation☆118Updated 2 months ago
- An automatic speech recognition API☆66Updated 3 weeks ago
- OpenAI Whisper Prompt Examples☆52Updated 2 years ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆65Updated 2 weeks ago
- ☆40Updated last year
- Speaker change detection using SincNet and an LSTM/Transformer☆53Updated 2 months ago
- Code for our INTERSPEECH paper Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection☆71Updated 4 months ago
- 🐸 - A general purpose model trainer, as flexible as it gets☆222Updated last year
- On-device streaming text-to-speech engine powered by deep learning☆121Updated this week
- Tunable pipelines☆35Updated 5 months ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆116Updated 2 years ago
- ☆78Updated last month
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated last year
- PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.☆174Updated last year
- High quality text-to-speech based on StyleTTS 2.☆59Updated this week
- Zero-shot Audio Classification using Whisper☆79Updated 2 years ago
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆97Updated 2 months ago
- ☆116Updated 2 weeks ago
- Audio tokenization, in the fastest way possible!☆52Updated 11 months ago