ylongqi / podcast-data-modeling
More than Just Words: Modeling Non-textual Characteristics of Podcasts
☆26Updated 5 years ago
Alternatives and similar repositories for podcast-data-modeling:
Users that are interested in podcast-data-modeling are comparing it to the libraries listed below
- ☆48Updated 2 years ago
- ☆32Updated 4 years ago
- SWIG bindings for Kaldi I/O, built with Conda☆14Updated 4 months ago
- ☆20Updated 6 years ago
- Text normalization scripts from IRISA lab☆13Updated 6 years ago
- Embedded segmental K-means (ES-KMeans) in Python.☆14Updated last year
- Companion tutorials for "An Introduction to Singing Voice Analysis", published Jan 2019☆56Updated 6 years ago
- SIGMORPHON 2020 Shared Task: Grapheme-to-Phoneme, Unsupervised Induction of Morphology, and Typologically Diverse Morphological Inflectio…☆36Updated 4 years ago
- maracas is a library for corrupting audio files with additive and convolutive noise.☆72Updated 7 years ago
- Attacking Speaker Recognition with Deep Generative Models☆34Updated 2 years ago
- ☆22Updated 2 years ago
- ☆45Updated 6 years ago
- RawNet: Fast End-to-End Neural Vocoder☆42Updated 5 years ago
- 2018/2019 TTS framework integrating state of the art open source methods☆47Updated 5 years ago
- Code for paper submission under review.☆33Updated 7 years ago
- Python bindings for SoX, aiming to replicate a subset of the command line sox utility.☆54Updated 4 years ago
- Codebase and utilities for using models trained by multiple music related tasks☆13Updated last year
- Benchmark popular audio i/o packages☆140Updated last year
- readers that enable reading kaldi ark in tensorflow☆17Updated 7 years ago
- Autoregressive HMM version of the HTS demo for statistical speech synthesis (includes autoregressive clustering)☆16Updated 10 years ago
- CNN-based singing voice detection experiments☆37Updated 7 years ago
- Lattice combination algorithm to combine inaccurate transcripts with hypothesis lattices☆16Updated last year
- ☆12Updated 3 years ago
- Self-contained Python package for OpenFst☆51Updated 2 years ago
- Large scale (>200h) and publicly available read audio book corpus. This corpus is an augmentation of LibriSpeech ASR Corpus (1000h) and c…☆43Updated 2 years ago
- Zero-shot Learning for Audio-based Music Classification and Tagging (ISMIR 2019)☆41Updated 5 years ago
- This is the code of the ICASSP 2020 paper "Joint phoneme alignment and text-informed speech separation on highly corrupted speech"☆15Updated last year
- easy-to-use implementation of the ISMIR 2013 Audio Degradation Toolbox☆49Updated 5 years ago
- This repo contains code for comparing audio representation sin the task of audio synthesis wth Generative Adversarial Networks (GAN)☆37Updated 2 years ago
- ☆33Updated 5 years ago