CN-Celeb, a large-scale Chinese celebrities dataset published by Center for Speech and Language Technology (CSLT) at Tsinghua University.
☆78Nov 9, 2019Updated 6 years ago
Alternatives and similar repositories for videoprocess
Users that are interested in videoprocess are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An Open Source Tools for Speaker Recognition☆636Aug 5, 2024Updated last year
- Development Toolkit for the VoxCeleb Speaker Recognition Challenge 2020☆43Jul 17, 2020Updated 5 years ago
- kaldi cnn-tdnnf baseline☆13Aug 31, 2021Updated 4 years ago
- In defence of metric learning for speaker recognition☆1,164Mar 26, 2024Updated 2 years ago
- Pronunciation lexicon covering both English and Chinese languages for Automatic Speech Recognition.☆262Oct 11, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 3 years ago
- The implementation of g2pL with a new open dataset.☆16May 14, 2023Updated 2 years ago
- Kaldi model converter to ONNX☆247Jan 27, 2023Updated 3 years ago
- Collection of works from VIPL-AVSU☆50Mar 13, 2026Updated 2 weeks ago
- Utterance-level Aggregation For Speaker Recognition In The Wild☆372Mar 24, 2023Updated 3 years ago
- ESPnet-TTS Audio Sample HP☆21Oct 25, 2019Updated 6 years ago
- ☆159Jan 9, 2023Updated 3 years ago
- Contains code for Deep Self Supervised Heirarchical Clustering for Speaker Diarization☆17Dec 16, 2021Updated 4 years ago
- The demo for "Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem".☆12Oct 25, 2021Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Python package for combining diarization system outputs.☆93Oct 12, 2023Updated 2 years ago
- Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments☆111Mar 19, 2024Updated 2 years ago
- ICASSP 2023: 'Speaker recognition with two-step multi-modal deep cleansing'☆44Oct 31, 2022Updated 3 years ago
- ☆32Mar 11, 2022Updated 4 years ago
- A No-Recurrence Sequence-to-Sequence Model for Speech Recognition☆379Jul 21, 2022Updated 3 years ago
- Implementation of state of the art d-vector approach for speaker verification☆127Oct 1, 2017Updated 8 years ago
- DaCiDian is an open-sourced chinese mandarin lexicon for automatic speech recognition(ASR)☆301Jun 15, 2020Updated 5 years ago
- Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances☆50Oct 27, 2022Updated 3 years ago
- ☆277Jan 15, 2021Updated 5 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆106Mar 12, 2021Updated 5 years ago
- TristouNet: Triplet Loss for Speaker Turn Embedding☆122Jul 6, 2017Updated 8 years ago
- University of Edinbrugh-Johns Hopkins University's system for ASVspoof 2017 Version 2.0 dataset.☆50May 1, 2019Updated 6 years ago
- AudioVisual Diarization - Supervised and Unsupervised☆15Nov 22, 2022Updated 3 years ago
- Official repository for RawNet, RawNet2, and RawNet3☆399Mar 21, 2024Updated 2 years ago
- ☆29Jun 15, 2022Updated 3 years ago
- The accompanying code for "Exploring the limits of decoder-only models trained on public speech recognition corpora" (Ankit Gupta, George…☆20Oct 11, 2024Updated last year
- Tensorflow implementation of x-vector topology on top of Kaldi recipe☆119Nov 5, 2019Updated 6 years ago
- Gaussian Mixture VAE Tacotron☆54Jul 6, 2023Updated 2 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Code and model for paper <Mutual Information Maximization for Effective Lip Reading>☆19Sep 4, 2020Updated 5 years ago
- Discriminative Neural Clustering for Speaker Diarisation☆79Apr 8, 2022Updated 3 years ago
- ☆13Sep 21, 2022Updated 3 years ago
- Run speaker recognition algorithms - Mirrored from https://gitlab.idiap.ch/bob/bob.bio.spear☆19Jun 24, 2023Updated 2 years ago
- FSA/FST algorithms, differentiable, with PyTorch compatibility.☆1,316Mar 9, 2026Updated 3 weeks ago
- Neural speaker recognition/verification system based on Kaldi and Tensorflow☆31Jun 30, 2020Updated 5 years ago
- VGGVox models for Speaker Identification and Verification trained on the VoxCeleb (1 & 2) datasets☆400Feb 4, 2019Updated 7 years ago