celebrity-audio-collection / videoprocessView external linksLinks
CN-Celeb, a large-scale Chinese celebrities dataset published by Center for Speech and Language Technology (CSLT) at Tsinghua University.
☆77Nov 9, 2019Updated 6 years ago
Alternatives and similar repositories for videoprocess
Users that are interested in videoprocess are comparing it to the libraries listed below
Sorting:
- The demo for "Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem".☆12Oct 25, 2021Updated 4 years ago
- Pronunciation lexicon covering both English and Chinese languages for Automatic Speech Recognition.☆262Oct 11, 2019Updated 6 years ago
- Development Toolkit for the VoxCeleb Speaker Recognition Challenge 2020☆43Jul 17, 2020Updated 5 years ago
- kaldi cnn-tdnnf baseline☆13Aug 31, 2021Updated 4 years ago
- An Open Source Tools for Speaker Recognition☆634Aug 5, 2024Updated last year
- Collection of works from VIPL-AVSU☆50Aug 2, 2025Updated 6 months ago
- Utterance-level Aggregation For Speaker Recognition In The Wild☆372Mar 24, 2023Updated 2 years ago
- Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments☆111Mar 19, 2024Updated last year
- Kaldi model converter to ONNX☆247Jan 27, 2023Updated 3 years ago
- This is now the official location of the Kaldi project.☆13Jun 10, 2019Updated 6 years ago
- In defence of metric learning for speaker recognition☆1,161Mar 26, 2024Updated last year
- Tools for Ahocoder data processing and evaluation metrics☆15Apr 22, 2024Updated last year
- Gaussian Mixture VAE Tacotron☆53Jul 6, 2023Updated 2 years ago
- The implementation of g2pL with a new open dataset.☆16May 14, 2023Updated 2 years ago
- Python package for combining diarization system outputs.☆92Oct 12, 2023Updated 2 years ago
- ☆29Jun 15, 2022Updated 3 years ago
- implementation of Monaural Speech Enhancement with Recursive Learning in the Time Domain☆48Nov 4, 2020Updated 5 years ago
- ☆157Jan 9, 2023Updated 3 years ago
- ☆106Mar 12, 2021Updated 4 years ago
- An effort to track benchmarking results over widely-used datasets for ASR.☆52Dec 19, 2025Updated last month
- University of Edinbrugh-Johns Hopkins University's system for ASVspoof 2017 Version 2.0 dataset.☆50May 1, 2019Updated 6 years ago
- Contains code for Deep Self Supervised Heirarchical Clustering for Speaker Diarization☆17Dec 16, 2021Updated 4 years ago
- ☆276Jan 15, 2021Updated 5 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 3 years ago
- Tensorflow implementation of x-vector topology on top of Kaldi recipe☆120Nov 5, 2019Updated 6 years ago
- A No-Recurrence Sequence-to-Sequence Model for Speech Recognition☆379Jul 21, 2022Updated 3 years ago
- Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196☆320Nov 11, 2020Updated 5 years ago
- DaCiDian is an open-sourced chinese mandarin lexicon for automatic speech recognition(ASR)☆301Jun 15, 2020Updated 5 years ago
- Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone data☆96Jul 6, 2023Updated 2 years ago
- ☆18Feb 9, 2020Updated 6 years ago
- Code and model for paper <Mutual Information Maximization for Effective Lip Reading>☆19Sep 4, 2020Updated 5 years ago
- End-to-end waveform utterance enhancement for direct evaluation metrics optimization by fully convolutional neural networks (TASLP 2018)☆18Jul 12, 2019Updated 6 years ago
- [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…☆32Apr 8, 2022Updated 3 years ago
- [ASRU 2021] Efficient Conformer: Progressive Downsampling and Grouped Attention for Automatic Speech Recognition☆219Jun 22, 2023Updated 2 years ago
- Discriminative Neural Clustering for Speaker Diarisation☆79Apr 8, 2022Updated 3 years ago
- a lightweight speech processing toolkit based on Pytorch and (Py)Kaldi☆344Dec 25, 2020Updated 5 years ago
- streaming attention networks for end-to-end automatic speech recognition☆55May 6, 2020Updated 5 years ago
- Neural speaker recognition/verification system based on Kaldi and Tensorflow☆32Jun 30, 2020Updated 5 years ago
- Run speaker recognition algorithms - Mirrored from https://gitlab.idiap.ch/bob/bob.bio.spear☆19Jun 24, 2023Updated 2 years ago