mycrazycracy / speaker-embedding-with-phonetic-informationLinks
The code for the Interspeech paper "Speaker Embedding Extraction with Phonetic Information"
☆45Updated 6 years ago
Alternatives and similar repositories for speaker-embedding-with-phonetic-information
Users that are interested in speaker-embedding-with-phonetic-information are comparing it to the libraries listed below
Sorting:
- End-To-End Speaker Verification based on X-vector and Neural PLDA - A PyTorch implementation☆23Updated 3 years ago
- ☆37Updated 4 years ago
- ☆32Updated 3 years ago
- This repository will illustrate the use of some different backends on NIST SRE 2019.☆20Updated 5 years ago
- MultiSV: scripts for data preparation☆27Updated 9 months ago
- This repository contains a set of codes to run (i.e., train, perform inference with, evaluate) a diarization method called EEND-vector-cl…☆76Updated 3 years ago
- SpEx+(tied) source code☆88Updated 2 years ago
- A personal toolkit for single/multi-channel speech recognition & enhancement & separation.☆145Updated 2 years ago
- Discriminative Condition-Aware PLDA☆44Updated last year
- Code for the ICASSP-2021 paper: Continuous Speech Separation with Conformer.☆119Updated 2 years ago
- A pytorch implementation of the paper "ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding"☆58Updated last year
- A simple package for Guided source separation (GSS)☆130Updated last year
- ☆31Updated 3 years ago
- [ICASSP2021] Data preperation scripts, training pipeline and baseline experiment results for the Interspeech 2020 Accented English Speech…☆56Updated 5 years ago
- Boosting Self-Supervised Embeddings for Speech Enhancement☆47Updated 3 years ago
- ☆59Updated last year
- Implementation of Neural PLDA (NPLDA) model (A discriminative backend for Speaker Verification)☆100Updated 5 years ago
- A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK☆61Updated 4 years ago
- PyTorch implementation of RPNSD☆60Updated last year
- The official repo: "McNet: Fuse Multiple Cues for Multichannel Speech Enhancement", ICASSP 2023☆123Updated 2 years ago
- Official repository of our paper: https://arxiv.org/abs/2010.15366☆63Updated 4 years ago
- ☆104Updated 4 years ago
- STOI loss function in PyTorch☆99Updated last year
- Code for reproducing experiments in "Domain-Adversarial Voice Activity Detection"☆23Updated 5 years ago
- Improving Perceptual Quality by Phone-Fortified Perceptual Loss using Wasserstein Distance for Speech Enhancement☆82Updated 4 years ago
- Conferencing Speech Challenge☆95Updated 4 years ago
- multi-scale time domain speaker extraction☆67Updated 4 years ago
- ☆49Updated 5 years ago
- [InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by …☆39Updated 2 years ago
- CHIME-7/8 diarization champion system: neural speaker diarization using memory-aware multi-speaker embedding with sequence-to-sequence ar…☆81Updated 4 months ago