jim-schwoebel / voiceomeView external linksLinks
π₯ π€ The largest clinical study in the world to collect voice data labeled with health information (N>6,000 participants, 48 utterances, 80+ health labels). Preprint: https://www.medrxiv.org/content/10.1101/2021.08.16.21262125v1
β32Apr 2, 2025Updated 10 months ago
Alternatives and similar repositories for voiceome
Users that are interested in voiceome are comparing it to the libraries listed below
Sorting:
- β10Mar 20, 2021Updated 4 years ago
- β45Oct 24, 2020Updated 5 years ago
- Deepspeech ASR Model for the Catalan Languageβ17Feb 15, 2021Updated 5 years ago
- ICASSP2022 TTS&VC Summaryβ14Jun 9, 2022Updated 3 years ago
- Comprehensive Python library for speech and voice.β32Dec 8, 2022Updated 3 years ago
- Face Research Toolkitβ17Jul 9, 2025Updated 7 months ago
- A corpus of speech from the Joe Rogan Experience podcast, consisting of 8.43 million words. It includes aligned TextGrids with phonetic aβ¦β21Jan 26, 2020Updated 6 years ago
- Keras-based python framework to compute phonological posterior probabilities from audio filesβ46Dec 27, 2022Updated 3 years ago
- Audio streaming transfer demo with google.api.HttpBody and grpc gateway for speech synthesisβ20Jan 28, 2020Updated 6 years ago
- CMU dictionary in IPA instead of their subset of Arpabetβ16Sep 24, 2024Updated last year
- ESPnet-TTS Audio Sample HPβ21Oct 25, 2019Updated 6 years ago
- eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.β24Nov 29, 2023Updated 2 years ago
- Sound augmentation using Large-scale audio dataset (Audioset)β45Jun 29, 2021Updated 4 years ago
- Surrey CVSSP DCASE 2018 Task 2 systemβ20Dec 26, 2022Updated 3 years ago
- Crystal TTVS engine is a real-time audio-visual Multilingual speech synthesizer with a 3D expressive avatar.β87Aug 17, 2020Updated 5 years ago
- video cut powered by AIβ24Nov 15, 2022Updated 3 years ago
- A unofficial Pytorch implementation of Google's VoiceFilterβ103Jul 6, 2023Updated 2 years ago
- Sequence-to-sequence TTS based on Kyubyong's dc_ttsβ61Feb 2, 2023Updated 3 years ago
- Covering grammars for English and Russian text normalizationβ61Sep 15, 2019Updated 6 years ago
- β28Oct 7, 2025Updated 4 months ago
- Pytorch implementation of "Generalized End-to-End Loss for Speaker Verification"β103Mar 18, 2019Updated 6 years ago
- transformer for ASR-systerm (via tensorflow2.0)β114May 7, 2019Updated 6 years ago
- GlottDNN vocoder and tools for training DNN excitation modelsβ32Feb 27, 2021Updated 4 years ago
- This is a github repository of the abandonware Sequitur G2P by Bisani & Neyβ175Dec 16, 2025Updated 2 months ago
- A PyTorch implementation of a punctuation prediction system using (B)LSTM, which automatically adds suitable punctuation into text withouβ¦β62May 13, 2020Updated 5 years ago
- PPSpeech: Phrase based Parallel End-to-End TTS Systemβ35Aug 31, 2020Updated 5 years ago
- The Hidden Markov Model Toolkit (HTK) from University of Cambridge, with fixed issues.β33Nov 29, 2018Updated 7 years ago
- A hand-gesture recognition system using Doppler effect of ultrasonic.β11Mar 2, 2019Updated 6 years ago
- This repository is about how to build an SQLite version of the Arabic WordNet database.β10Mar 19, 2019Updated 6 years ago
- SIGMORPHON 2020 Shared Task: Grapheme-to-Phoneme, Unsupervised Induction of Morphology, and Typologically Diverse Morphological Inflectioβ¦β36Apr 25, 2025Updated 9 months ago
- A bot to automatically take surveys on the Pulse websiteβ14Mar 6, 2022Updated 3 years ago
- β11Jun 20, 2023Updated 2 years ago
- real time face swap and one-click video deepfake with only a single imageβ11Sep 13, 2024Updated last year
- Text frontend for ESPnet tts recipesβ34Jun 1, 2021Updated 4 years ago
- β10Jul 14, 2022Updated 3 years ago
- Code for the paper "UnNatural Language Inference" to appear at ACL 2021 (Long Paper)β36Aug 31, 2021Updated 4 years ago
- Automatic Speech Recognition Dataset Generationβ37Aug 26, 2018Updated 7 years ago
- Formant Tracking & Estimationβ79Dec 15, 2024Updated last year
- PyTorch implementation of LF-MMI for End-to-end ASRβ220Jan 14, 2021Updated 5 years ago