faroit/CountNet

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/faroit/CountNet)

faroit / CountNet

Deep Neural Network for Speaker Count Estimation

☆157

Alternatives and similar repositories for CountNet

Users that are interested in CountNet are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

aishoot / Concurrent_Speakers_Counter
View on GitHub
Estimate the number of concurrent speakers from single channel mixtures to crack the "cocktail-party” problem.
☆23Mar 4, 2020Updated 6 years ago
asteroid-team / pytorch-pit
View on GitHub
Permutation invariant training in PyTorch
☆13Oct 2, 2020Updated 5 years ago
LucasRr / Dictionary_learning_for_declipping_Python
View on GitHub
Consistent dictionary learning algorithm for signal declipping (Python code)
☆20Oct 24, 2018Updated 7 years ago
popcornell / OSDC
View on GitHub
☆18Jan 26, 2021Updated 5 years ago
itsnotacie / AAAI-26_SepPrune
View on GitHub
SepPrune: Structured Pruning for Efficient Deep Speech Separation-AAAI'26
☆15May 31, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
yinruiqing / diarization_with_neural_approach
View on GitHub
☆14Aug 9, 2018Updated 7 years ago
fgnt / pb_bss
View on GitHub
Collection of EM algorithms for blind source separation of audio signals
☆305May 19, 2025Updated last year
Chutlhu / mirapie
View on GitHub
Interference removal algorithm for multitrack live recordings
☆11Jan 9, 2019Updated 7 years ago
bekirbakar / replay-attack-detection
View on GitHub
Deep learning-based audio spoofing attack detection experiments for speaker verification.
☆14Apr 20, 2023Updated 3 years ago
BornInWater / Overlap-Detection
View on GitHub
Overlapped Speech detection in Multi-party Conversations
☆22Feb 20, 2018Updated 8 years ago
csteinmetz1 / MixCNN
View on GitHub
Convolutional Neural Network for multitrack mix leveling
☆19Jun 25, 2018Updated 8 years ago
yinruiqing / fsmn
View on GitHub
Feedforward Sequential Memory Networks
☆18Aug 2, 2022Updated 3 years ago
etzinis / two_step_mask_learning
View on GitHub
A two step optimization for sound source separation on the adaptive front-end domain
☆70Sep 18, 2020Updated 5 years ago
facebookresearch / WavAugment
View on GitHub
A library for speech data augmentation in time-domain
☆689Aug 30, 2021Updated 4 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
shashankshirol / GeneratingNoisySpeechData
View on GitHub
A repository comprising of code for generation of noisy speech data from clean data using deep learning methods
☆16Jul 12, 2021Updated 5 years ago
yinruiqing / change_detection
View on GitHub
Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networks
☆67Jul 14, 2020Updated 6 years ago
bagustris / s3prl-ser
View on GitHub
S3PRL for Speech Emotion Recognition (see s3prl > downstream)
☆15Feb 28, 2026Updated 5 months ago
boozyguo / ClearWave
View on GitHub
Denoise Speech (Enhanced Speech or Speech enhancement) by Deep Learning (Using Keras and Tensorflow)
☆38Mar 21, 2018Updated 8 years ago
naplab / DANet
View on GitHub
Deep Attractor Network (DANet) for single-channel speech separation
☆77Oct 1, 2018Updated 7 years ago
jwr1995 / WD-TCN
View on GitHub
☆11Aug 5, 2022Updated 3 years ago
JorisCos / LibriMix
View on GitHub
An open source dataset for source separation
☆502Feb 9, 2024Updated 2 years ago
funcwj / deep-clustering
View on GitHub
deep clustering method for single-channel speech separation
☆110Jun 21, 2022Updated 4 years ago
sagiebenaim / Singing
View on GitHub
☆19May 9, 2019Updated 7 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
mahimg / Speaker-recognition
View on GitHub
Segment speech sequences based on speaker transitions, using ML and DSP.
☆17Jul 30, 2018Updated 7 years ago
FlorianKrey / DNC
View on GitHub
Discriminative Neural Clustering for Speaker Diarisation
☆79Apr 8, 2022Updated 4 years ago
jordipons / neural-classifiers-with-few-audio
View on GitHub
Training neural audio classifiers with few data − https://arxiv.org/abs/1810.10274
☆60Feb 1, 2019Updated 7 years ago
hitachi-speech / EEND
View on GitHub
End-to-End Neural Diarization
☆435Aug 30, 2021Updated 4 years ago
bootphon / learnable-strf
View on GitHub
Learnable STRF, from Riad et al. 2021 JASA
☆13Aug 21, 2021Updated 4 years ago
funcwj / setk
View on GitHub
Tools for Speech Enhancement integrated with Kaldi
☆432Jul 6, 2023Updated 3 years ago
google / uis-rnn
View on GitHub
This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Su…
☆1,588Sep 25, 2024Updated last year
Andong-Li-speech / GaGNet
View on GitHub
This repo provides the network code and the processed samples of the manuscript "Glance and Gaze: A Collaborative Learning Framework for …
☆72Feb 10, 2022Updated 4 years ago
Js-Mim / aes_wimp
View on GitHub
Support material and source code for the system described in : "New Sonorities for Jazz Recordings: Separation and Mixing using Deep Neu…
☆13Jul 19, 2017Updated 9 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
popcornell / SparseLibriMix
View on GitHub
☆73Feb 15, 2021Updated 5 years ago
idiap / acoustic-simulator
View on GitHub
Implementation of audio degradation processes
☆105Nov 18, 2015Updated 10 years ago
aishoot / LSTM_PIT_Speech_Separation
View on GitHub
Two-talker Speech Separation with LSTM/BLSTM by Permutation Invariant Training method.
☆311Jan 6, 2022Updated 4 years ago
wq2012 / awesome-diarization
View on GitHub
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
☆1,886Jul 7, 2026Updated 3 weeks ago
gemengtju / SpEx_Plus
View on GitHub
SpEx+(tied) source code
☆96Jul 6, 2023Updated 3 years ago
ina-foss / inaSpeechSegmenter
View on GitHub
CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender …
☆902Mar 12, 2026Updated 4 months ago
asteroid-team / asteroid
View on GitHub
The PyTorch-based audio source separation toolkit for researchers
☆2,579May 13, 2026Updated 2 months ago