mit-ll / PSIAP-DL-YouTube-CC
Python script to download all Creative Commons licensed videos from a Youtube channel
☆13Updated 4 years ago
Alternatives and similar repositories for PSIAP-DL-YouTube-CC
Users that are interested in PSIAP-DL-YouTube-CC are comparing it to the libraries listed below
Sorting:
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆25Updated 2 years ago
- An end-to-end MATLAB toolkit for completely unsupervised Speaker Diarization using state-of-the-art algorithms.☆16Updated 9 years ago
- Tool to analyze an audio corpora in terms of intonation, intensity, duration and voice quality☆21Updated 5 years ago
- ToneNet: A CNN Model of Tone Classification of Mandarin Chinese☆17Updated 5 years ago
- Python library for audio augmentation☆84Updated last year
- A database of clean and noisy speech for audio research☆9Updated 7 years ago
- Deep understanding and modelling of the hierarchical structure of prosody☆22Updated 6 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆102Updated 2 years ago
- automatically align transcribed audio and generate a wav2letter training corpus☆36Updated 2 years ago
- Tools for Ahocoder data processing and evaluation metrics☆14Updated last year
- Simplified recipes for preparing commonly used speech datasets, and a PyTorch-compatible Python data loader that can perform standard fea…☆15Updated last year
- A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …☆41Updated 2 years ago
- Creates video from TTS output and viseme images.☆11Updated 2 years ago
- ☆26Updated 3 years ago
- A command line interface to combine text information from subtitles with voice data in the video. Provides a convenient way to generate t…☆19Updated last year
- Enhanced Reverberation As Supervision (ERAS) for unsupervised reverberant speech separation☆12Updated 9 months ago
- Unsupervised Any-to-many Audiovisual Synthesis via Exemplar Autoencoders☆121Updated 2 years ago
- FFTNet: a Real-Time Speaker-Dependent Neural Vocoder☆64Updated 6 years ago
- A collection of basic python modules for spoken natural language processing☆57Updated 5 years ago
- 24-hour Automatic Speech Recognition☆27Updated 3 years ago
- Easily turn large sets of audio urls to an audio dataset.☆21Updated 2 years ago
- ☆12Updated 4 years ago
- C++ Implementation of the Information Bottleneck System☆23Updated 6 years ago
- ☆8Updated 7 years ago
- Code for the paper "Investigating the effect of residual and highway connections in speech enhancement models"☆11Updated 6 years ago
- [ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach☆20Updated 3 years ago
- Deep learning using CNN for Mandarin Chinese tone classification☆35Updated 6 years ago
- gentle forced aligner☆11Updated last year
- Facestar dataset. High quality audio-visual recordings of human conversational speech.☆106Updated 3 years ago
- ☆24Updated 6 years ago