IDRnD / VoxTube
The VoxTube dataset official repository
☆68Updated last year
Alternatives and similar repositories for VoxTube:
Users that are interested in VoxTube are comparing it to the libraries listed below
- A repo containing download guidance and corresponding scripts of the VoxBlink dataset.☆25Updated 11 months ago
- The official pytorch implemention of the Intespeech 2024 paper "Reshape Dimensions Network for Speaker Recognition"☆149Updated 4 months ago
- Official Repository For VoxBlink2☆64Updated 7 months ago
- NOTSOFAR-1 Challenge: Distant Diarization and ASR☆50Updated last month
- Reference-aware automatic speech evaluation toolkit☆144Updated 3 months ago
- ☆57Updated 10 months ago
- Clustering-based methods for overlapping diarization☆77Updated last year
- ☆52Updated last year
- A toolkit to calculate speech audio quality. Not affiliated with the original authors☆50Updated 7 months ago
- Expressive Anechoic Recordings of Speech (EARS)☆151Updated 8 months ago
- This Repository surveys the paper focusing on Prompting and Adapters for Speech Processing.☆107Updated last year
- UTokyo-SaruLab MOS Prediction System☆155Updated 3 weeks ago
- A simple package for Guided source separation (GSS)☆117Updated 10 months ago
- Dataset and baseline code for the VocalSound dataset (ICASSP2022).☆132Updated 2 years ago
- ☆51Updated 4 months ago
- ☆64Updated 6 months ago
- Official repository of NeXt-TDNN for speaker verification☆68Updated 5 months ago
- ☆91Updated last year
- ☆43Updated 2 years ago
- Official implementation for Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning☆86Updated 4 months ago
- UT-Sarulab MOS prediction system using SSL models☆216Updated 11 months ago
- [AAAI 2024] Code for CTX-vec2wav in UniCATS☆127Updated 9 months ago
- S3PRL-VC: A Voice Conversion Toolkit based on S3PRL☆99Updated 8 months ago
- Libriheavy: a 50,000 hours ASR corpus with punctuation casing and context☆191Updated 6 months ago
- This repository contains a set of codes to run (i.e., train, perform inference with, evaluate) a diarization method called EEND-vector-cl…☆77Updated 2 years ago
- Confidence interval computation for evaluation in machine learning using the bootstrapping approach☆77Updated 11 months ago
- Baseline Recipe for VoicePrivacy Challenge 2024: anonymization systems and evaluation software☆51Updated last month
- ☆61Updated last year
- This is a list of speech tasks and datasets, which can provide training data for Generative AI, AIGC, AI model training, intelligent spee…☆74Updated 9 months ago
- Code for vec2wav 2.0, a speech token vocoder for VC. Paper: https://arxiv.org/abs/2409.01995☆71Updated 3 months ago