IDRnD / VoxTube
The VoxTube dataset official repository
☆65Updated 11 months ago
Alternatives and similar repositories for VoxTube:
Users that are interested in VoxTube are comparing it to the libraries listed below
- NOTSOFAR-1 Challenge: Distant Diarization and ASR☆47Updated this week
- Official Repository For VoxBlink2☆59Updated 5 months ago
- Reference-aware automatic speech evaluation toolkit☆140Updated last month
- A repo containing download guidance and corresponding scripts of the VoxBlink dataset.☆24Updated 9 months ago
- UTokyo-SaruLab MOS Prediction System☆129Updated last month
- The official pytorch implemention of the Intespeech 2024 paper "Reshape Dimensions Network for Speaker Recognition"☆136Updated 2 months ago
- A simple package for Guided source separation (GSS)☆112Updated 8 months ago
- Python package for combining diarization system outputs.☆83Updated last year
- This is the M-AILABS Speech Dataset☆38Updated 2 months ago
- Official repository of NeXt-TDNN for speaker verification☆65Updated 3 months ago
- UT-Sarulab MOS prediction system using SSL models☆202Updated 9 months ago
- Implementation of BEST-RQ - a model for self-supervised learning of speech signals using a random projection quantizer, in Pytorch.☆109Updated last year
- ☆55Updated 8 months ago
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆67Updated 3 years ago
- LibriTTS-P: A Corpus with Speaking Style and Speaker Identity Prompts for Text-to-Speech and Style Captioning☆120Updated 7 months ago
- MOS score prediction by fine-tuned wav2vec2.0 model☆150Updated 2 years ago
- Official implementation of the paper "BigCodec: Pushing the Limits of Low-Bitrate Neural Speech Codec"☆123Updated 4 months ago
- ☆88Updated last year
- Implementation of TTS model based on NVIDIA P-Flow TTS Paper☆72Updated 8 months ago
- ☆57Updated 11 months ago
- This Repository surveys the paper focusing on Prompting and Adapters for Speech Processing.☆107Updated last year
- This is a list of speech tasks and datasets, which can provide training data for Generative AI, AIGC, AI model training, intelligent spee…☆73Updated 7 months ago
- Libriheavy: a 50,000 hours ASR corpus with punctuation casing and context☆185Updated 4 months ago
- ☆63Updated 4 months ago
- An official implementation of "UnitSpeech: Speaker-adaptive Speech Synthesis with Untranscribed Data"☆133Updated last year
- ☆65Updated last week
- A PyTorch implementation of End-to-End Neural Diarization☆101Updated last year
- INTERSPEECH 2023: "DPHuBERT: Joint Distillation and Pruning of Self-Supervised Speech Models"☆109Updated last year
- Official implementation for the paper: A Unified One-Shot Prosody and Speaker Conversion System with Self-Supervised Discrete Speech Unit…☆78Updated 2 years ago
- Official implementation for Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning☆84Updated 2 months ago