usc-sail / mica-speech-activity-detectionLinks

Robust Speech Activity Detection (SAD) in movie audio

☆26

Alternatives and similar repositories for mica-speech-activity-detection

Users that are interested in mica-speech-activity-detection are comparing it to the libraries listed below

Sorting:

iariav / End-to-End-VAD
an Audio-Visual Voice Activity Detection using Deep Learning
☆50Updated 6 years ago
RicherMans / GPV
Repository for our Interspeech2020 general-purpose voice activity detection (GPVAD) paper
☆141Updated 2 years ago
juanmc2005 / SpeakerEmbeddingLossComparison
Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…
☆60Updated 5 years ago
a-nagrani / VoxSRC2020
Development Toolkit for the VoxCeleb Speaker Recognition Challenge 2020
☆42Updated 5 years ago
RicherMans / Datadriven-GPVAD
The codebase for Data-driven general-purpose voice activity detection.
☆94Updated 2 years ago
FlorianKrey / DNC
Discriminative Neural Clustering for Speaker Diarisation
☆79Updated 3 years ago
celebrity-audio-collection / videoprocess
CN-Celeb, a large-scale Chinese celebrities dataset published by Center for Speech and Language Technology (CSLT) at Tsinghua University.
☆74Updated 6 years ago
HuangZiliAndy / RPNSD
PyTorch implementation of RPNSD
☆60Updated last year
felixkreuk / SegFeat
Phoneme Boundary Detection using Learnable Segmental Features (ICASSP 2020)
☆82Updated 4 years ago
funcwj / voice-filter
A unofficial Pytorch implementation of Google's VoiceFilter
☆102Updated 2 years ago
lilianemomeni / KWS-Net
Seeing Wake Words: Audio-visual Keyword Spotting
☆65Updated 5 years ago
staplesinLA / denoising_DIHARD18
☆60Updated 5 years ago
dr-pato / audio_visual_speech_enhancement
Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments
☆110Updated last year
funcwj / ge2e-speaker-verification
Pytorch implementation of "Generalized End-to-End Loss for Speaker Verification"
☆103Updated 6 years ago
mravanelli / pySpeechRev
This python code performs an efficient speech reverberation starting from a dataset of close-talking speech signals and a collection of a…
☆96Updated 5 years ago
joonson / voxsrc_2019
VoxSRC Challenge
☆31Updated 6 years ago
nttcslab-sp / agevoxceleb
☆27Updated 3 years ago
luan78zaoha / kaldi-timit-sre-ivector
Develop speaker recognition model based on i-vector using TIMIT database
☆16Updated 6 years ago
RaviSoji / plda
Probabilistic Linear Discriminant Analysis & classification, written in Python.
☆129Updated 3 years ago
JaesungBae / Speech-Command-Recognition-with-Capsule-Network
Speech command recognition with capsule network & various NNs / KWS on Google Speech Command Dataset.
☆25Updated 6 years ago
DonkeyShot21 / uis-rnn-sml
A better, faster, stronger version of the unbounded interleaved-state recurrent neural network (UIS-RNN)
☆62Updated 5 years ago
felixkreuk / UnsupSeg
Self-Supervised Contrastive Learning for Unsupervised Phoneme Segmentation (INTERSPEECH 2020)
☆143Updated 3 years ago
nii-yamagishilab / Intelligibility-MetricGAN
Implementation for paper "iMetricGAN: Intelligibility Enhancement for Speech-in-Noise using Generative Adversarial Network-based Metric L…
☆56Updated 2 years ago
iiscleap / NeuralPlda
Implementation of Neural PLDA (NPLDA) model (A discriminative backend for Speaker Verification)
☆100Updated 5 years ago
danFromTelAviv / key_words_spotting
implementation of "EFFICIENT KEYWORD SPOTTING USING DILATED CONVOLUTIONS AND GATING"
☆36Updated 5 years ago
fgnt / sms_wsj
SMS-WSJ: Spatialized Multi-Speaker Wall Street Journal database for multi-channel source separation and recognition
☆124Updated last year
hbredin / DomainAdversarialVoiceActivityDetection
Code for reproducing experiments in "Domain-Adversarial Voice Activity Detection"
☆23Updated 5 years ago
joaoantoniocn / AM-SincNet
The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…
☆45Updated 2 years ago
foamliu / Speaker-Embeddings
PyTorch implementation of a self-attentive speaker embedding
☆17Updated 6 years ago
jefflai108 / pytorch-kaldi-neural-speaker-embeddings
A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.
☆136Updated 5 years ago