VITA-Group/AutoSpeech

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/VITA-Group/AutoSpeech)

VITA-Group / AutoSpeech

[InterSpeech 2020] "AutoSpeech: Neural Architecture Search for Speaker Recognition" by Shaojin Ding*, Tianlong Chen*, Xinyu Gong, Weiwei Zha, Zhangyang Wang

☆206

Alternatives and similar repositories for AutoSpeech

Users that are interested in AutoSpeech are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Jungjee / RawNet
View on GitHub
Official repository for RawNet, RawNet2, and RawNet3
☆407Mar 21, 2024Updated 2 years ago
Snowdar / asv-subtools
View on GitHub
An Open Source Tools for Speaker Recognition
☆638Aug 5, 2024Updated last year
clovaai / voxceleb_trainer
View on GitHub
In defence of metric learning for speaker recognition
☆1,170Apr 22, 2026Updated 3 months ago
joonson / voxceleb_unsupervised
View on GitHub
Augmentation adversarial training for self-supervised speaker recognition
☆77Aug 15, 2021Updated 4 years ago
jymsuper / SpeakerRecognition_tutorial
View on GitHub
Simple d-vector based Speaker Recognition (verification and identification) using Pytorch
☆211Jul 17, 2020Updated 6 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
seongmin-kye / meta-SR
View on GitHub
Pytorch implementation of Meta-Learning for Short Utterance Speaker Recognition with Imbalance Length Pairs (Interspeech, 2020)
☆73Sep 16, 2020Updated 5 years ago
gpu-poor / gramvaani_hindi_asr
View on GitHub
This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge
☆16Mar 26, 2022Updated 4 years ago
manojpamk / pytorch_xvectors
View on GitHub
Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196
☆321Nov 11, 2020Updated 5 years ago
facebookresearch / WavAugment
View on GitHub
A library for speech data augmentation in time-domain
☆689Aug 30, 2021Updated 4 years ago
google / speaker-id
View on GitHub
This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at…
☆453Aug 12, 2025Updated 11 months ago
a-nagrani / VoxSRC2020
View on GitHub
Development Toolkit for the VoxCeleb Speaker Recognition Challenge 2020
☆43Jul 17, 2020Updated 6 years ago
yistLin / dvector
View on GitHub
Speaker embedding (d-vector) trained with GE2E loss
☆289Jan 8, 2024Updated 2 years ago
iiscleap / NeuralPlda
View on GitHub
Implementation of Neural PLDA (NPLDA) model (A discriminative backend for Speaker Verification)
☆99Apr 20, 2020Updated 6 years ago
WeidiXie / VGG-Speaker-Recognition
View on GitHub
Utterance-level Aggregation For Speaker Recognition In The Wild
☆371Mar 24, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
philipperemy / deep-speaker
View on GitHub
Deep Speaker: an End-to-End Neural Speaker Embedding System.
☆941Apr 13, 2024Updated 2 years ago
qqueing / DeepSpeaker-pytorch
View on GitHub
Speaker embedding(verification and recognition) using Pytorch
☆369Jul 24, 2020Updated 6 years ago
jefflai108 / pytorch-kaldi-neural-speaker-embeddings
View on GitHub
A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.
☆136Jan 27, 2020Updated 6 years ago
mycrazycracy / tf-kaldi-speaker
View on GitHub
Neural speaker recognition/verification system based on Kaldi and Tensorflow
☆31Jun 30, 2020Updated 6 years ago
mravanelli / SincNet
View on GitHub
SincNet is a neural architecture for efficiently processing raw audio samples.
☆1,241Apr 28, 2021Updated 5 years ago
rhoposit / icassp2021
View on GitHub
☆15May 8, 2021Updated 5 years ago
hitachi-speech / EEND
View on GitHub
End-to-End Neural Diarization
☆435Aug 30, 2021Updated 4 years ago
wentaozhu / speechnas
View on GitHub
SpeechNAS-Better-Trade-off-between-Latency-and-Accuracy-for-Large-Scale-Speaker-Verification
☆30Mar 24, 2023Updated 3 years ago
wq2012 / awesome-diarization
View on GitHub
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
☆1,886Jul 7, 2026Updated 2 weeks ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
mechanicalsea / sugar
View on GitHub
Efficient Speech Processing Tookit for Automatic Speaker Recognition
☆17Feb 8, 2023Updated 3 years ago
awslabs / speech-representations
View on GitHub
Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)
☆104Nov 26, 2022Updated 3 years ago
shkim816 / acnn_speaker_recog
View on GitHub
acnn for text-independent speaker recognition
☆10Feb 8, 2022Updated 4 years ago
mycrazycracy / speaker-embedding-with-phonetic-information
View on GitHub
The code for the Interspeech paper "Speaker Embedding Extraction with Phonetic Information"
☆45Jul 10, 2019Updated 7 years ago
taylorlu / Speaker-Diarization
View on GitHub
speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition
☆501Jul 1, 2021Updated 5 years ago
HarryVolek / PyTorch_Speaker_Verification
View on GitHub
PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.
☆598Jan 20, 2022Updated 4 years ago
iver56 / torch-audiomentations
View on GitHub
Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.
☆1,161Nov 24, 2025Updated 8 months ago
rishikksh20 / VocGAN
View on GitHub
VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network
☆321Jul 25, 2024Updated last year
DemisEom / SpecAugment
View on GitHub
A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
☆655Apr 5, 2022Updated 4 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
hirofumi0810 / neural_sp
View on GitHub
End-to-end ASR/LM implementation with PyTorch
☆594Aug 30, 2021Updated 4 years ago
joonson / syncnet_trainer
View on GitHub
Disentangled Speech Embeddings using Cross-Modal Self-Supervision
☆167Apr 12, 2020Updated 6 years ago
wenet-e2e / speech-synthesis-paper
View on GitHub
List of speech synthesis papers.
☆1,074Jul 24, 2023Updated 3 years ago
nonday / awesome-voiceprint
View on GitHub
A curated list of awesome Voiceprint Recognition papers
☆19Jul 9, 2021Updated 5 years ago
facebookresearch / CPC_audio
View on GitHub
An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.
☆374Oct 12, 2021Updated 4 years ago
RicherMans / GPV
View on GitHub
Repository for our Interspeech2020 general-purpose voice activity detection (GPVAD) paper
☆141Aug 3, 2023Updated 2 years ago
speechbrain / speechbrain.github.io
View on GitHub
The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech proc…
☆374Jun 18, 2025Updated last year