On-device speaker diarization powered by deep learning
☆72May 8, 2026Updated 3 weeks ago
Alternatives and similar repositories for falcon
Users that are interested in falcon are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- benchmark for Speech-to-Intent engines☆18Mar 27, 2026Updated 2 months ago
- On-device noise suppression powered by deep learning☆88Updated this week
- On-device speaker recognition engine powered by deep learning☆46Updated this week
- Picovoice Browser Extension☆16May 11, 2026Updated 2 weeks ago
- On-device streaming text-to-speech engine powered by deep learning☆139Updated this week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- This repository creates speaker diarization recipes to be used within the egs folder of kaldi.☆17Aug 12, 2024Updated last year
- Speaker diarization benchmark framework☆40Jan 8, 2026Updated 4 months ago
- On-device voice activity detection (VAD) powered by deep learning☆254Updated this week
- Balanced Error Rate for Speaker Diarization☆33Feb 28, 2023Updated 3 years ago
- Automatic Speech Recognition (ASR) system for the Samrómur speech corpus using Kaldi☆12Sep 30, 2022Updated 3 years ago
- Descript Audio Codec - VAE Variant (.dac-vae): High-Fidelity Audio Compression with Variational Autoencoder☆36Aug 30, 2025Updated 9 months ago
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆17May 16, 2025Updated last year
- On-device Speech-to-Index engine powered by deep learning☆36Apr 16, 2025Updated last year
- ☆10Dec 22, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…☆32Apr 8, 2022Updated 4 years ago
- The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based …☆179May 7, 2026Updated 3 weeks ago
- Write and keep snippets for VSCode in a markdown file.☆15Jul 23, 2023Updated 2 years ago
- Voice activity engine benchmark framework☆23Jan 14, 2026Updated 4 months ago
- Application for viewing Rich Transcription Time Marked (RTTM) files in an interactive way☆48Apr 19, 2023Updated 3 years ago
- ☆67Feb 8, 2024Updated 2 years ago
- Some comprehensive papers about speaker diarization☆357Mar 24, 2026Updated 2 months ago
- ☆14Mar 15, 2024Updated 2 years ago
- [ICASSP 2025] FreeSVC: Towards Zero-shot Multilingual Singing Voice Conversion☆92Jul 23, 2025Updated 10 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- On-device speech-to-text engine powered by deep learning☆480Updated this week
- [INTERSPEECH 2024] Official pytorch code for the paper "Disentangled Representation Learning for Environment-agnostic Speaker Recognition…☆18Jul 23, 2024Updated last year
- This is a fork of the original fairseq repository (version 0.12.2) with added classes for training mHuBERT-147.☆21Nov 19, 2024Updated last year
- Overview of Icelandic NLP resources at a glance☆18Jun 20, 2024Updated last year
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- Sing any popular song with your voice☆11Jul 10, 2022Updated 3 years ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆157May 2, 2024Updated 2 years ago
- Speechlib is a library that unifies speaker diarization, transcription and speaker recognition in a single pipeline to create transcripts…☆265Apr 19, 2026Updated last month
- LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM☆18May 17, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A toolkit for benchmarking on a wide variety of audio deepfake datasets.☆30May 22, 2026Updated last week
- Visualization tools for audio-only and multi-modal speaker diarization dataset☆13Oct 27, 2023Updated 2 years ago
- This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core☆15Jun 19, 2023Updated 2 years ago
- CHIME-7/8 diarization champion system: neural speaker diarization using memory-aware multi-speaker embedding with sequence-to-sequence ar…☆86Jun 17, 2025Updated 11 months ago
- ☆17Apr 14, 2023Updated 3 years ago
- Official repository for Mamba-based Segmentation Model for Speaker Diarization☆47May 13, 2025Updated last year
- Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.☆22May 26, 2025Updated last year