visual-text to speech
☆14Apr 3, 2022Updated 3 years ago
Alternatives and similar repositories for visual-text-to-speech
Users that are interested in visual-text-to-speech are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- RWCP-SSD-Onomatopoeia☆23Jun 28, 2023Updated 2 years ago
- Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE☆15Nov 30, 2022Updated 3 years ago
- ☆21Jun 16, 2021Updated 4 years ago
- PyTorch implementation of Retriever: Learning Content-Style Representation☆12Jan 27, 2023Updated 3 years ago
- Official implementation of Self-Remixing☆17Feb 3, 2024Updated 2 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- PAVOQUE Corpus of Expressive Speech☆12Aug 2, 2016Updated 9 years ago
- Juliusを使ったセグメンテーション支援ツール☆13Feb 13, 2020Updated 6 years ago
- A data collection and processing pipeline for animal video, annotations include mask, keypoint, depth, occlusion, etc. Suitable for 3D/4D…☆51Dec 5, 2025Updated 3 months ago
- Examination Questions in the Dept. of Computer Science and Electronic Engineering.☆11Apr 2, 2025Updated 11 months ago
- ☆18Feb 9, 2020Updated 6 years ago
- SelfRemaster: SSL Speech Restoration☆94Jan 5, 2024Updated 2 years ago
- ☆15Nov 13, 2025Updated 4 months ago
- Python package for the Zero Speech Challenge 2020☆14Feb 5, 2021Updated 5 years ago
- 日本語音声に対して音素ラベルをアラインメントするためのツールです☆38Aug 19, 2025Updated 7 months ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Network specification and demo☆35Jun 5, 2017Updated 8 years ago
- Bilingual Singing Voice Synthesis☆18Mar 25, 2024Updated 2 years ago
- Unsupervised Speech Decomposition via Triple Information Bottleneck☆14Apr 29, 2020Updated 5 years ago
- Implementation for paper "Disentangled Speech Representation Learning for One-Shot Cross-Lingual Voice Conversion Using ß-VAE"☆44Apr 10, 2023Updated 2 years ago
- Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms☆18Oct 8, 2023Updated 2 years ago
- My notes and work for the Bayesian Data Analysis course taught by Aki Vehtari.☆18Oct 29, 2022Updated 3 years ago
- [INTERSPEECH'2022] Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning☆83Nov 4, 2022Updated 3 years ago
- ☆18Dec 7, 2023Updated 2 years ago
- Code for paper titled "Perception of prosodic variation for speech synthesis using an unsupervised discrete representation of F0" submitt…☆17May 24, 2020Updated 5 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Official implementation of FCL-taco2: Fast, Controllable and Lightweight version of Tacotron2 @ ICASSP 2021☆40Jul 17, 2021Updated 4 years ago
- Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"☆17Dec 10, 2020Updated 5 years ago
- Please visit: https://thuhcsi.github.io/icassp2021-emotion-tts/☆34Mar 17, 2023Updated 3 years ago
- ☆20Mar 16, 2020Updated 6 years ago
- TPSE-GST Tacotron2☆14May 1, 2019Updated 6 years ago
- ☆42Mar 25, 2022Updated 4 years ago
- Implementation of Harmonic Convolution by Harmonic Lowering☆17Nov 11, 2020Updated 5 years ago
- ICASSP 2021 accepted papers in term of voice conversion (VC)☆18Apr 11, 2021Updated 4 years ago
- An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"☆22Jul 5, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- CML-TTS: A Multilingual Dataset for Speech Synthesis☆34Jul 31, 2024Updated last year
- A Machine Learning Approach for the Diagnosis of Parkinson's Disease via Speech Analysis☆20Dec 27, 2020Updated 5 years ago
- Berlin Bayesians' solutions to Bayesian Data Analysis, 3rd edition.☆21Sep 21, 2019Updated 6 years ago
- A fundamental frequency estimation algorithm using features from the magnitude and phase spectrogram.☆24Mar 29, 2021Updated 5 years ago
- xvector model on jtubespeech☆47Nov 5, 2023Updated 2 years ago
- Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS☆169Apr 10, 2024Updated last year
- Crawled from FreeMidi.org, MIDI files library including over twenty thousand files!☆32Jun 6, 2020Updated 5 years ago