Robofied / VoicenetView external linksLinks
Comprehensive Python library for speech and voice.
☆32Dec 8, 2022Updated 3 years ago
Alternatives and similar repositories for Voicenet
Users that are interested in Voicenet are comparing it to the libraries listed below
Sorting:
- Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open …☆22Updated this week
- ☆19Sep 20, 2024Updated last year
- Frechet Audio Distance evaluation in PyTorch☆36Jun 9, 2023Updated 2 years ago
- Keras-based python framework to compute phonological posterior probabilities from audio files☆46Dec 27, 2022Updated 3 years ago
- RawNet: Fast End-to-End Neural Vocoder☆42May 29, 2019Updated 6 years ago
- Consistent dictionary learning algorithm for signal declipping (Python code)☆20Oct 24, 2018Updated 7 years ago
- Code for the paper "MULTI-BAND MASKING FOR WAVEFORM-BASED SINGING VOICE SEPARATION" that was accepted on EUSIPCO2022☆15Jun 18, 2022Updated 3 years ago
- A convolutional generative audio synthesis model☆32Jun 17, 2022Updated 3 years ago
- Bilingual Singing Voice Synthesis☆18Mar 25, 2024Updated last year
- Pronounce Arabic words☆19May 27, 2019Updated 6 years ago
- This branch of Asteroid contains code for the vocal harmony and chamber ensemble separation related papers.☆12Nov 7, 2024Updated last year
- ☆11Aug 11, 2023Updated 2 years ago
- Official repository of the work "Low-complexity Unsupervised Audio Anomaly Detection exploiting Separable Convolutions and Angular Loss" …☆10Nov 6, 2024Updated last year
- A tool to collect/validate audio recordings from workers on Amazon Mechanical Turk. Written in Python/Flask. (originally hosted on github…☆14Dec 19, 2022Updated 3 years ago
- 🏥 🎤 The largest clinical study in the world to collect voice data labeled with health information (N>6,000 participants, 48 utterances…☆32Apr 2, 2025Updated 10 months ago
- SylNet: An Adaptable End-to-End Syllable Count Estimator for Speech☆27May 25, 2023Updated 2 years ago
- This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …☆13Dec 4, 2024Updated last year
- [ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder☆12Mar 11, 2025Updated 11 months ago
- Onset-and-Offset-Aware Sound Event Detection☆20Feb 10, 2025Updated last year
- text to speech☆10Mar 19, 2024Updated last year
- A corpus of diacritized Hebrew texts (טקסט מנוקד)☆11May 4, 2022Updated 3 years ago
- ☆10Mar 20, 2021Updated 4 years ago
- ☆17May 30, 2018Updated 7 years ago
- A Playground for Variational Autoencoders☆12Feb 11, 2018Updated 8 years ago
- ☆11Nov 7, 2024Updated last year
- VoxAngeles Corpus☆13Aug 23, 2025Updated 5 months ago
- Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.☆50May 19, 2021Updated 4 years ago
- ☆52Sep 10, 2024Updated last year
- Code for the paper: Unified Gradient Reweighting for Model Biasing with Applications to Source Separation☆14Nov 16, 2020Updated 5 years ago
- DysfluentWFST☆17Nov 13, 2025Updated 3 months ago
- ☆14Aug 1, 2025Updated 6 months ago
- ☆15Nov 11, 2024Updated last year
- ☆13Jan 14, 2025Updated last year
- Simple Kaldi recipe for forced alignment☆11Jul 16, 2023Updated 2 years ago
- Read audio with FFmpeg into NumPy/PyTorch via ctypes (standard library module)☆11Aug 12, 2020Updated 5 years ago
- Project page for our paper "DurIAN : DurIAN-SC: Duration Informed Attention Network based Singing Voice Conversion System".☆10Oct 12, 2020Updated 5 years ago
- Cantonese Grapheme-to-Phoneme Converter based on GitYCC/g2pW☆15Dec 10, 2024Updated last year
- ☆15Nov 10, 2025Updated 3 months ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Mar 26, 2022Updated 3 years ago