wenet-e2e / wespeaker
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
☆670Updated this week
Related projects: ⓘ
- An Open Source Tools for Speaker Recognition☆590Updated last month
- ☆894Updated last week
- Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)☆587Updated 5 months ago
- Large, modern dataset for speech recognition☆629Updated 6 months ago
- The dataset of Speech Recognition☆382Updated 2 months ago
- Production First and Production Ready End-to-End Keyword Spotting Toolkit☆436Updated last month
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆658Updated 6 months ago
- Tools for handling speech data in machine learning projects.☆932Updated this week
- FSA/FST algorithms, differentiable, with PyTorch compatibility.☆1,108Updated last week
- Production First and Production Ready End-to-End Text-to-Speech Toolkit☆367Updated 3 months ago
- INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. …☆629Updated last month
- UniSpeech - Large Scale Self-Supervised Learning for Speech☆419Updated 5 months ago
- A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR☆887Updated last year
- End-to-End Neural Diarization☆367Updated 3 years ago
- SpeechIO Leaderboard: a large, robust, comprehensive, benchmarking platform for Automatic Speech Recognition.☆426Updated last week
- Variational Bayes HMM over x-vectors diarization☆251Updated 8 months ago
- In defence of metric learning for speaker recognition☆1,027Updated 5 months ago
- A must-read paper for speech separation based on neural networks☆735Updated 2 years ago
- Voice Activity Detection based on Deep Learning & TensorFlow☆348Updated last year
- Some comprehensive papers about speaker diarization☆190Updated last month
- Towards hot directions in industrial end to end speech recognition☆324Updated 2 years ago
- speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition☆464Updated 3 years ago
- Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196☆303Updated 3 years ago
- PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.☆576Updated 2 years ago
- [ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training fo…☆576Updated last week
- Chinese text normalization for speech processing☆620Updated last year
- This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at…☆345Updated this week
- Conformer-based Metric GAN for speech enhancement☆297Updated 4 months ago
- ☆393Updated 11 months ago
- FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music gener…☆344Updated 7 months ago