On-device speaker diarization powered by deep learning
☆69Mar 20, 2026Updated last week
Alternatives and similar repositories for falcon
Users that are interested in falcon are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- benchmark for Speech-to-Intent engines☆17Dec 18, 2025Updated 3 months ago
- On-device noise suppression powered by deep learning☆84Mar 20, 2026Updated last week
- On-device speaker recognition engine powered by deep learning☆41Mar 20, 2026Updated last week
- Picovoice Browser Extension☆15Updated this week
- On-device streaming text-to-speech engine powered by deep learning☆133Mar 20, 2026Updated last week
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- This repository creates speaker diarization recipes to be used within the egs folder of kaldi.☆17Aug 12, 2024Updated last year
- Speaker diarization benchmark framework☆39Jan 8, 2026Updated 2 months ago
- On-device voice activity detection (VAD) powered by deep learning☆248Updated this week
- Balanced Error Rate for Speaker Diarization☆33Feb 28, 2023Updated 3 years ago
- Automatic Speech Recognition (ASR) system for the Samrómur speech corpus using Kaldi☆12Sep 30, 2022Updated 3 years ago
- Descript Audio Codec - VAE Variant (.dac-vae): High-Fidelity Audio Compression with Variational Autoencoder☆35Aug 30, 2025Updated 6 months ago
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆16May 16, 2025Updated 10 months ago
- On-device Speech-to-Index engine powered by deep learning☆36Apr 16, 2025Updated 11 months ago
- ☆10Dec 22, 2023Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…☆32Apr 8, 2022Updated 3 years ago
- The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based …☆170Dec 12, 2025Updated 3 months ago
- Write and keep snippets for VSCode in a markdown file.☆15Jul 23, 2023Updated 2 years ago
- ☆67Feb 8, 2024Updated 2 years ago
- Voice activity engine benchmark framework☆21Jan 14, 2026Updated 2 months ago
- Application for viewing Rich Transcription Time Marked (RTTM) files in an interactive way☆48Apr 19, 2023Updated 2 years ago
- Some comprehensive papers about speaker diarization☆338Updated this week
- ☆14Mar 15, 2024Updated 2 years ago
- [ICASSP 2025] FreeSVC: Towards Zero-shot Multilingual Singing Voice Conversion☆91Jul 23, 2025Updated 8 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- On-device speech-to-text engine powered by deep learning☆477Mar 20, 2026Updated last week
- [INTERSPEECH 2024] Official pytorch code for the paper "Disentangled Representation Learning for Environment-agnostic Speaker Recognition…☆18Jul 23, 2024Updated last year
- This is a fork of the original fairseq repository (version 0.12.2) with added classes for training mHuBERT-147.☆21Nov 19, 2024Updated last year
- Overview of Icelandic NLP resources at a glance☆18Jun 20, 2024Updated last year
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- Sing any popular song with your voice☆11Jul 10, 2022Updated 3 years ago
- A toolkit for benchmarking on a wide variety of audio deepfake datasets.☆29Oct 9, 2025Updated 5 months ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆155May 2, 2024Updated last year
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆252Feb 10, 2026Updated last month
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM☆18May 17, 2024Updated last year
- Visualization tools for audio-only and multi-modal speaker diarization dataset☆13Oct 27, 2023Updated 2 years ago
- This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core☆15Jun 19, 2023Updated 2 years ago
- CHIME-7/8 diarization champion system: neural speaker diarization using memory-aware multi-speaker embedding with sequence-to-sequence ar…☆85Jun 17, 2025Updated 9 months ago
- ☆17Apr 14, 2023Updated 2 years ago
- Official repository for Mamba-based Segmentation Model for Speaker Diarization☆47May 13, 2025Updated 10 months ago
- Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.☆22May 26, 2025Updated 10 months ago