Picovoice / falconLinks
On-device speaker diarization powered by deep learning
☆66Updated 3 weeks ago
Alternatives and similar repositories for falcon
Users that are interested in falcon are comparing it to the libraries listed below
Sorting:
- On-device noise suppression powered by deep learning☆82Updated 3 weeks ago
- A curated list of awesome voice activity detection☆71Updated last year
- On-device voice activity detection (VAD) powered by deep learning☆242Updated 3 weeks ago
- ONNX Inference of Pyannote Segmentation☆97Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆100Updated last year
- ☆46Updated 2 years ago
- Speaker diarization model☆32Updated 2 years ago
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆93Updated 2 years ago
- Very fast, accurate speaker diarization☆228Updated this week
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆154Updated last year
- On-device streaming text-to-speech engine powered by deep learning☆128Updated 2 weeks ago
- C++ library for converting text to phonemes for Piper☆139Updated 7 months ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆113Updated 2 months ago
- ☆476Updated last week
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆250Updated this week
- ☆82Updated 2 weeks ago
- An automatic speech recognition API☆79Updated last week
- A package for NeuCodec: a 50hz, 0.8kbps, 24kHz audio codec.☆150Updated 2 weeks ago
- OpenAI Whisper Prompt Examples☆53Updated 2 years ago
- Efficient approach to speaker diarization using voice characteristics extraction☆106Updated 7 months ago
- Application for viewing Rich Transcription Time Marked (RTTM) files in an interactive way☆48Updated 2 years ago
- Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event …☆413Updated last year
- Companion repo for the paper "PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings…☆105Updated last year
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)☆104Updated 5 months ago
- Uses machine learning to denoise audio containing speech☆50Updated last year
- 🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.☆258Updated last year
- ☆100Updated this week
- Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.☆48Updated 4 months ago
- IPA Phonemizer/Dephonemizer for 140 human languages☆54Updated last month
- Kanade is a single-layer disentangled speech tokenizer that extracts compact tokens suitable for both generative and discriminative model…☆68Updated last week