mit-ll / PSIAP-DL-YouTube-CCLinks
Python script to download all Creative Commons licensed videos from a Youtube channel
☆13Updated 4 years ago
Alternatives and similar repositories for PSIAP-DL-YouTube-CC
Users that are interested in PSIAP-DL-YouTube-CC are comparing it to the libraries listed below
Sorting:
- Data processing tools for preparing speech and labels for training TTS voices☆27Updated 4 years ago
- Anonymous ICLR Submission☆14Updated 5 years ago
- Simple text to phonemes converter for multiple languages☆20Updated 2 years ago
- FFTNet: a Real-Time Speaker-Dependent Neural Vocoder☆64Updated 6 years ago
- A database of clean and noisy speech for audio research☆9Updated 7 years ago
- Unsupervised Speech Decomposition via Triple Information Bottleneck☆14Updated 5 years ago
- A neural network for filtering target speaker's voice from audio written in tensorflow☆21Updated 7 years ago
- ☆8Updated 7 years ago
- Single Pass Spectrogram Inversion in a Jupyter Python notebook☆34Updated 7 years ago
- automatically align transcribed audio and generate a wav2letter training corpus☆36Updated 2 years ago
- Keras version of Syncnet, by Joon Son Chung and Andrew Zisserman.☆51Updated 6 years ago
- Unsupervised Any-to-many Audiovisual Synthesis via Exemplar Autoencoders☆121Updated 2 years ago
- The History of Speech Recognition to the Year 2030☆13Updated 3 years ago
- lazy_dataset: Process large datasets as if it was an iterable.☆18Updated 6 months ago
- Demonstration of gpt-2 model with flask+uwsgi+nginx in web environment containerized in docker for quick deployment.☆13Updated 2 years ago
- Source code for INTERSPEECH2020☆11Updated 4 years ago
- From a large speech audio file and its corresponding body of text, automatically chunk the audio and text into (phrase, audio_snippet) pa…☆17Updated 10 years ago
- You Said That?: Synthesising Talking Faces from Audio☆69Updated 7 years ago
- Basic wavenet and fftnet vocoder model.☆19Updated 3 years ago
- Code for the paper "Investigating the effect of residual and highway connections in speech enhancement models"☆11Updated 6 years ago
- ☆23Updated 2 years ago
- ESPnet-TTS Audio Sample HP☆21Updated 5 years ago
- A collection of YouTube videos transcripts : Podcasts (Joe Rogan Experience, Tim Ferris, Jocko podcast, ..), lectures (YaleCourses, MIT l…☆80Updated 2 months ago
- ☆26Updated 3 years ago
- Tensor2tensor experiment with SpecAugment☆46Updated 6 years ago
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a l…☆23Updated 11 months ago
- Coqui Inference Engine☆40Updated 3 years ago
- Losses and decoders for end-to-end ASR and OCR☆34Updated 4 years ago
- Code for the paper: Unified Gradient Reweighting for Model Biasing with Applications to Source Separation☆14Updated 4 years ago
- ☆21Updated 7 years ago