felixfuyihui / AISHELL-4
☆114Updated 3 years ago
Related projects: ⓘ
- The project is associated with the recently-launched ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge (M2MeT) to pro…☆111Updated 2 years ago
- SpEx+(tied) source code☆72Updated last year
- [ICASSP2021] Data preperation scripts, training pipeline and baseline experiment results for the Interspeech 2020 Accented English Speech…☆53Updated 3 years ago
- ☆50Updated 3 years ago
- Conferencing Speech Challenge☆89Updated 3 years ago
- E2E system with LF-MMI; word N-gram for Mandarin☆163Updated 2 years ago
- ☆69Updated 3 years ago
- This repository contains a set of codes to run (i.e., train, perform inference with, evaluate) a diarization method called EEND-vector-cl…☆71Updated last year
- ☆64Updated 2 years ago
- The baseline system for the ICASSP2024 ICMC-ASR Challenge.☆42Updated 9 months ago
- A personal toolkit for single/multi-channel speech recognition & enhancement & separation.☆139Updated last year
- A python IO interface for data accessing in kaldi☆38Updated 3 years ago
- 3M: Multi-loss, Multi-path and Multi-level Neural Networks for speech recognition☆119Updated 2 years ago
- The code for the Interspeech paper "Speaker Embedding Extraction with Phonetic Information"☆43Updated 5 years ago
- A PyTorch implementation of End-to-End Neural Diarization☆98Updated last year
- Tensorflow implementation of x-vector topology on top of Kaldi recipe☆119Updated 4 years ago
- A torch implementation of a recursion which turns out to be useful for RNN-T.☆138Updated last year
- Code for the ICASSP-2021 paper: Continuous Speech Separation with Conformer.☆108Updated last year
- ☆29Updated 2 years ago
- Libri-CSS: dataset and evaluation pipeline☆130Updated last year
- ☆63Updated this week
- A unofficial Pytorch implementation of Microsoft's PHASEN☆221Updated 5 months ago
- ☆32Updated last month
- Python package for combining diarization system outputs.☆73Updated 11 months ago
- target speaker extraction and verification for multi-talker speech☆155Updated 3 years ago
- ☆97Updated 3 years ago
- ☆141Updated 4 years ago
- This repo is to list the references papers of 《Speaker Recognition Based on Deep Learning: An Overview》☆35Updated 3 years ago
- Moved to https://github.com/k2-fsa/icefall☆143Updated last year
- A librosa STFT/Fbank/mfcc feature extration written up in PyTorch using 1D Convolutions.☆72Updated 2 years ago